Senior Fullstack Engineer - Observability Real User Monitoring (RUM)

📍 Canada
📍 United States
USD 154,400-185,300 per year
SENIOR
✅ Remote

Used Tools & Technologies

PostgreSQL

Required Skills & Competences

Security @ 4 Docker @ 3 Grafana @ 4 Kafka @ 4 Kubernetes @ 3 TypeScript @ 4 GCP @ 3 Distributed Systems @ 4 AWS @ 3 Azure @ 3 Communication @ 7 React @ 4 Android @ 3 Debugging @ 4 API @ 4 iOS @ 3 Cassandra @ 4 OpenTelemetry @ 3 Observability @ 4 AI @ 4 Data Modeling @ 4 ClickHouse @ 4

Details

Grafana Labs is a remote-first, open-source company building observability tools used by millions. The Real User Monitoring (RUM) initiative captures, stores, and queries high-volume user interaction data from browsers and mobile devices to help teams understand real-world user experiences at scale. The role focuses on building systems that ingest and process massive amounts of telemetry—sessions, events, traces, and logs—and making them explorable in real time, with attention to performance, cost-efficiency, and developer experience.

Responsibilities

  • Build and evolve fullstack features for RUM across backend services, APIs, storage systems, and frontend user experiences.
  • Design and implement systems that ingest, store, and query high-cardinality, high-volume telemetry data using columnar/analytical databases.
  • Develop performant query layers and APIs to power real-time exploration of user sessions, traces, and events.
  • Contribute to frontend applications that visualize RUM data to help users debug performance issues and understand behavior.
  • Work on data modeling, indexing strategies, and query optimization to ensure low-latency, cost-efficient analytics at scale.
  • Collaborate with SDK engineers (browser and mobile) to ensure high-quality data ingestion and schema evolution.
  • Own projects end-to-end: design, implementation, deployment, monitoring, and iteration.
  • Break down complex, ambiguous problems into incremental deliverables and iterate quickly based on feedback.
  • Ensure quality via testing, observability of systems, documentation, and smooth upgrade paths.
  • Collaborate cross-functionally with backend, frontend, product, and solutions engineering teams.
  • Support teammates, participate in technical design discussions, and help shape the RUM roadmap.
  • Use modern AI coding assistants as part of the workflow (company-funded usage budget), within security guidelines.

Requirements

  • 5+ years of fullstack engineering experience with strong backend fundamentals.
  • Backend experience (Go preferred) and frontend experience using TypeScript and React.
  • Experience building or operating distributed systems in production (examples include Kafka, WarpStream, ClickHouse, Cassandra, Postgres).
  • Familiarity with cloud-native systems: Docker, Kubernetes, AWS, GCP, Azure.
  • Experience working with high-throughput, high-cardinality data (logs, metrics, traces, events).
  • Strong understanding of data modeling, query optimization, and performance tradeoffs.
  • Experience designing and building APIs and distributed services.
  • Experience building data-heavy UIs (dashboards, query tools, debugging interfaces).
  • Familiarity with observability concepts (traces, logs, metrics) and/or OpenTelemetry.
  • Strong communication skills and ability to work effectively in a remote, distributed team.
  • Pragmatic, self-driven, comfortable navigating ambiguity, and customer-focused with a passion for developer experience.

Bonus / Nice-to-Have

  • Experience with browser or mobile instrumentation (RUM SDKs, telemetry collection).
  • Mobile development experience (iOS or Android) or familiarity with mobile performance and telemetry.
  • Contributions to OpenTelemetry or other observability open-source projects.
  • Experience building developer-facing platforms or observability products.
  • Familiarity with session replay, sampling strategies, or user behavior analytics systems.

Compensation and Benefits

  • In the United States, the compensation range for this role is $154,445 - $185,334 USD. Actual compensation may vary based on level, experience, and skillset.
  • All roles include Restricted Stock Units (RSUs).
  • 100% remote company culture; in-person onboarding to meet fellow new hires.
  • Global annual leave policy of 30 days per annum (with 3 days reserved for Grafana Shutdown Days), subject to local legislation.

Company & Culture

  • Grafana Labs focuses on open standards, developer experience, and shipping reliable software at scale. The company emphasizes transparency, autonomy, and a collaborative remote-first culture.
  • Grafana Observability integrates with OpenTelemetry and Grafana Cloud and focuses on performance, cost efficiency, and developer productivity.
  • Grafana Labs is an equal opportunity employer and may utilize AI tools in recruitment to assist matching CVs to job postings.