Staff Backend Engineer - Grafana Enterprise

📍 Canada
📍 United States
CAD 186,400-223,600 per year
SENIOR
✅ Remote

Used Tools & Technologies

Not specified

Required Skills & Competences

Security @ 4 Go @ 4 Grafana @ 4 Kubernetes @ 4 MySQL @ 7 Prometheus @ 4 TypeScript @ 4 CI/CD @ 4 Distributed Systems @ 4 Communication @ 7 gRPC @ 4 PostgreSQL @ 7 Protobuf @ 4 React @ 4 IaaS @ 4 API @ 4 Compliance @ 4 OpenTelemetry @ 4 Observability @ 4 AI @ 4

Details

Grafana Labs is a remote-first, open-source powerhouse. There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, available as Grafana Cloud or the Grafana Enterprise Stack.

This is a remote position. We are looking for candidates in the United States and Canada. Meetings tend to happen between 14:00 and 17:00 UTC. Backend at Grafana is Go, and engineers contribute to open source communities as well as to enterprise features.

Responsibilities

  • Design and lead development of backend services, distributed systems, and enterprise features at scale.
  • Architect and implement distributed backend services in Go with focus on correctness, observability, and performance at scale.
  • Design APIs and service contracts used by thousands of enterprise operators and cloud service providers (gRPC/protobuf use cases).
  • Drive projects from ideation through development lifecycle to production; own operational health and participate in on-call rotations (weekday 12h x 5d and separate weekend 24h x 2d).
  • Contribute to scalability, reliability, security, and multi-tenancy of the Grafana platform.
  • Collaborate with Product, UX, and frontend engineers to ship end-to-end solutions.
  • Engage directly with large enterprise customers and cloud service providers to understand requirements and translate them into engineering solutions.
  • Hire and develop engineers; drive continuous improvement of engineering and operational practices.

Requirements

  • Deep professional experience writing production services from ideation through production operations at scale.
  • Strong distributed systems fundamentals: replication, consistency models, partitioning, fault tolerance, and trade-offs when operating at scale.
  • Demonstrated experience designing and operating systems for large-scale, high-traffic, high-availability, or multi-tenant environments (infrastructure, observability, or software delivery platforms preferred).
  • Professional experience building and consuming gRPC/protobuf APIs and designing clean service contracts across service boundaries.
  • Strong database skills (PostgreSQL and/or MySQL), including schema design, query optimization, and schema migrations at scale.
  • Experience with large-scale CI/CD systems and build tooling; designing, operating, or integrating with continuous delivery pipelines that serve large engineering organizations or external operators at scale.
  • Comfort working with Kubernetes and containerized deployment environments, including patterns for operating stateful workloads and multi-tenant clusters.
  • Experience with observability tooling: OpenTelemetry, Prometheus metrics, structured logging, and distributed tracing.
  • Familiarity with dependency injection patterns (e.g., Google Wire) and clean, testable service architecture.
  • Strong teamwork and written/interpersonal communication skills; ability to operate in a remote-first environment and engage with external customers.

Nice to have

  • Experience with TypeScript and React for contributing to frontend features.
  • Experience with Grafana's LGTM+ observability stack (Loki, Mimir, Tempo, Pyroscope, Alloy).
  • Prior experience at or building for large-scale cloud service providers, IaaS providers, or global enterprises with demanding SLA requirements.
  • Experience designing or operating large-scale build infrastructure: artifact registries, distributed build caches, hermetic build systems (e.g., Bazel), or developer platform tooling.

Compensation & Equity

  • In Canada, the compensation range for this role is $186,368 - $223,642 CAD per year.
  • All roles include Restricted Stock Units (RSUs).
  • Compensation ranges are country specific; candidates applying from different locations will discuss their market's defined pay range with a recruiter.

Other details

  • Remote-first company culture; in-person onboarding is provided.
  • Grafana Labs provides a company-funded AI coding assistant usage budget and encourages pragmatic AI-assisted development within security guidelines.
  • Global annual leave policy of 30 days per annum (with 3 days reserved for Grafana Shutdown Days) and compliance with local legislation where applicable.
  • Equal opportunity employer.