Used Tools & Technologies
Not specified
Required Skills & Competences ?
Grafana @ 3 Kubernetes @ 3 Prometheus @ 3 Terraform @ 3 GCP @ 3 ArgoCD @ 3 Distributed Systems @ 3 AWS @ 3 Azure @ 3 Communication @ 6 Helm @ 3 Debugging @ 6 OpenTelemetry @ 3Details
ClickHouse is looking for an experienced engineer to join the Observability team that builds and operates the telemetry platform powering both internal monitoring and observability features for ClickHouse Cloud customers. The team handles ingestion of trillions of events per day with sustained throughput in the tens of millions per second. Engineers on this team are hybrid software, systems, and infrastructure engineers responsible for reliability, scalability, and efficiency of the platform, collaborating closely with product and infrastructure teams and contributing to major engineering initiatives.
Responsibilities
- Design, build, and operate distributed systems that power observability across ClickHouse Cloud
- Own reliability, performance, and cost-efficiency of telemetry pipeline and storage systems
- Participate in on-call rotation; drive root-cause resolution and long-term fixes
- Build tooling and automation to eliminate repetitive operational work
- Identify bottlenecks and scaling challenges and help shape the observability roadmap
- Collaborate with other engineering teams to improve their observability posture
- Contribute to design discussions, architecture reviews, and mentor teammates
Requirements
- 5+ years building and running production systems at scale
- Proficiency in Golang
- Experience with Kubernetes, Helm, ArgoCD, and Terraform or similar Infrastructure-as-Code tools
- Comfortable working with at least one major cloud provider (AWS, GCP, Azure)
- Experience with OpenTelemetry, Prometheus, Grafana, or similar observability tools
- Experience with ClickHouse preferred
- Strong production debugging skills and a problem-solving mindset
- Strong communication skills; comfortable in a remote, async-friendly environment
- Experience balancing system performance, reliability, and cost; ability to iterate quickly (build MVPs, gather feedback, improve)
Compensation
- Typical starting salary for this role in the United States: $115,000 - $185,000 USD
- Typical starting salary for US Premium Markets (e.g., Los Angeles, San Francisco Bay Area, Seattle, New York City Metro): $135,000 - $210,000 USD
Benefits & Perks
- Flexible remote-friendly work environment; ClickHouse operates in ~20 countries
- Employer contributions towards healthcare
- Stock options for new team members
- Flexible time off in the US; generous entitlement in other countries
- $500 home office setup for remote employees
- Global company gatherings and offsites
Culture & Equal Opportunity
- Opportunity to shape company culture as an early employee
- ClickHouse is an equal opportunity employer and prohibits discrimination and harassment; candidate privacy practices are provided in the applicant privacy notice.