Senior Site Reliability Engineer

๐Ÿ“ United States
๐Ÿ“ Brazil
๐Ÿ“ Canada
๐Ÿ“ Argentina
๐Ÿ“ Chile
๐Ÿ“ Peru
๐Ÿ“ Colombia
๐Ÿ“ Mexico
USD 140,000-180,000 per year
SENIOR
โœ… Remote

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Go @ 4 MySQL @ 4 Terraform @ 6 GCP @ 6 Distributed Systems @ 4 Hiring @ 4 Bash @ 6 Communication @ 4 SRE @ 7 React @ 4 Debugging @ 7

Details

Customer.io helps teams send smarter, more relevant messages using real-time behavioral data. Over 7,500 companies use our platform to send billions of emails, push notifications, in-app messages, and SMS every day. Under the hood: Go, React, Ember and AI help us ship fast and scale with confidence.

Weโ€™re looking for a Site Reliability Engineer to help us scale our infrastructure, reduce operational toil, and increase reliability as we grow. If youโ€™ve worked on high-scale systems and love making platforms better for developers and customers alike, weโ€™d love to meet you.

What We Value

  • Ownership โ€” You own problems end to end. You move fast, act like an owner, and thrive in ambiguity. You've led complex projects before, whether officially or not, and you're ready to do it again.
  • Engineers with product taste โ€” You think like a user, not just an engineer. You think about performance, reliability, and how systems impact the customer experience.
  • A healthy skepticism for โ€œthe way things are doneโ€ โ€” You bring rigor and creativity. Best practices matter โ€” but never more than forward motion.

Responsibilities

  • Build and scale infrastructure to support billions of messages per day and real-time events
  • Automate deployments, alerting, and incident response
  • Improve on-call experience: create clear alerts, solid documentation, and faster resolution
  • Tune MySQL and other datastore performance and improve reliability across distributed systems
  • Collaborate across teams to debug, ship, and support systems in production
  • Share knowledge publicly and internally through short videos, thoughtful writing, and mentorship
  • Leverage AI tools to prototype, move faster, and make better decisions

Requirements

  • 7+ years in SRE or infrastructure roles, improving production systems at scale
  • Deep MySQL experience โ€” schema design, performance tuning, and operational tooling
  • Fluency in cloud-native technologies (GCP a plus) and Terraform
  • Proficiency in Go and Bash for scripting and systems programming
  • Strong skills in observability, incident response, and debugging distributed systems
  • Preference for action over perfection and pride in owning technical decisions

Compensation & Benefits

  • Starting salary: $140,000 - $180,000 USD (or equivalent in local currency) depending on experience and subject to market rate adjustment
  • 100% coverage of medical, dental, vision, mental health, and supplemental insurance premiums for you and your family
  • 16 weeks paid parental leave, unlimited PTO, stipends for remote work and wellness, and a professional development budget

Our Process

  • Application โ€” We review everyone with care. Tell us why you're interested.
  • Recruiter Call (30 mins)
  • Behavioral Interview (60 mins)
  • Take-Home Assignment โ€” Short, realistic task similar to the work here
  • Technical Interview + Assignment Review Call (90 mins)
  • Final candidates complete background check and employment verifications

Inclusion

Customer.io recognizes the stifling impact of systemic injustice on diverse communities and commits to increasing inclusion and equity within the tech industry. We strive to build an inclusive team culture and implement bias-free hiring practices.

Join us

We believe in empathy, transparency, responsibility, and, yes, a little awkwardness. If youโ€™re excited by what you read and want to build software that makes communication better for everyoneโ€”apply now.