Senior Site Reliability Engineer - Remote

USD 141,000-208,000 per year
SENIOR
✅ Remote

Used Tools & Technologies

Not specified

Required Skills & Competences

Security @ 4 Ansible @ 7 Docker @ 4 Go @ 4 Kubernetes @ 4 Terraform @ 7 Python @ 4 SQL @ 1 AWS @ 7 Azure @ 7 Communication @ 4 Debugging @ 7 Puppet @ 7 Cloud Computing @ 7 Observability @ 4 AI @ 4 ClickHouse @ 4

Details

Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

The company’s sustained momentum was recently validated by a $400M Series D financing round. Customers include Capital One, Lovable, Decagon, Polymarket, Airwallex, Meta, Cursor, Sony, and Tesla.

We’re on a mission to transform how companies use data. This role expands the central Site Reliability Engineering team to ensure the reliability, availability, scalability, and performance of ClickHouse Cloud. You will build and lead processes, collaborate with Control Plane, Data Plane, Core, Security, Support and Operations teams, and own incident management, post-mortems, and continuous improvement of cloud services. You will leverage software engineering skills to develop platforms and tools that optimize operational and engineering efficiency for ClickHouse Cloud.

Responsibilities

  • Collaborate with engineering teams to design and implement scalable, secure, highly available systems for ClickHouse Cloud.
  • Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
  • Ensure monitoring and alerting across infrastructure components (Data Plane, Control Plane, ClickHouse Core, etc.) to enable timely detection and resolution of incidents.
  • Enhance incident response processes and conduct blameless post-mortem analysis for outages, including coordinating communications with impacted customers.
  • Continuously improve reliability and performance of ClickHouse services.
  • Plan, enable, and drive Chaos engineering initiatives across engineering teams.
  • Manage on-call processes, coordinate escalations, and establish best practices to minimize downtime.

Requirements

  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • At least 8 years of experience in Site Reliability Engineering or a related field.
  • Hands-on experience with Go and/or Python.
  • Strong knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
  • Excellent understanding of distributed databases and SQL; ClickHouse experience is a major plus.
  • Hands-on experience with container orchestration tools such as Kubernetes or Docker Swarm.
  • Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
  • Strong production debugging and problem-solving skills.
  • Passion for efficiency, availability, scalability, and data governance.
  • Ability to thrive in a fast-paced environment and partner with the business to move it forward.
  • High level of responsibility, ownership, and accountability.
  • Excellent communication and interpersonal skills.

Compensation

  • Typical starting salary for this role in the United States: $141,000 - $208,000 USD.
  • Typical starting salary for this role in US Premium Markets (e.g., San Francisco Bay Area, New York City Metro Area): $157,000 - $230,000 USD.

Perks

  • Flexible work environment; ClickHouse is globally distributed and remote-friendly.
  • Employer contributions towards healthcare.
  • Stock options for new team members.
  • Flexible time off in the US; generous entitlement in other countries.
  • $500 home office setup for remote employees.
  • Global company-wide gatherings/offsites.

Culture & Equal Opportunity

  • As part of the first 500 employees, you will help shape company culture. Learn more about values and company life on the ClickHouse careers page and blog.
  • ClickHouse provides equal employment opportunities and prohibits discrimination and harassment based on legally protected characteristics.

Other

  • Role is remote-friendly and listed as United States (remote). #LI-Remote