Used Tools & Technologies
Not specified
Required Skills & Competences ?
Go @ 4 MySQL @ 4 Terraform @ 6 GCP @ 6 Distributed Systems @ 4 Hiring @ 4 Bash @ 6 Communication @ 4 SRE @ 7 React @ 4 Debugging @ 4Details
About Customer.io
Over 7,500 companies - from scrappy startups to global brands - use our platform to send billions of emails, push notifications, in-app messages, and SMS every day. Customer.io powers automated communication that people actually want to receive.
We help teams send smarter, more relevant messages using real-time behavioral data. Under the hood: Go, React, Ember and AI help us ship fast and scale with confidence.
We’re looking for a Site Reliability Engineer to help us scale our infrastructure, reduce operational toil, and increase reliability as we grow. If you’ve worked on high-scale systems and love making platforms better for developers and customers alike, we’d love to meet you.
What We Value
- Ownership
You own problems end to end. You move fast, act like an owner, and thrive in ambiguity. You've led complex projects before, whether officially or not, and you're ready to do it again. - Engineers with product taste
You think like a user, not just an engineer. You think about performance, reliability, and how systems impact the customer experience. - A healthy skepticism for “the way things are done”
You bring rigor and creativity. Best practices matter - but never more than forward motion.
Responsibilities
- Build and scale infrastructure to support billions of messages per day and real-time events
- Automate deployments, alerting, and incident response
- Make our on-call better - clear alerts, solid documentation, and faster resolution
- Tune MySQL and other datastore performance and improve reliability across distributed systems
- Collaborate across teams to debug, ship, and support systems in production
- Share knowledge and raise the bar through sharing your progress publicly with short videos, thoughtful writing, and mentorship
- Leverage AI tools to prototype, move faster, and make better decisions
Requirements
- 7+ years in SRE or infrastructure roles, improving production systems at scale
- Deep MySQL experience - schema design, performance tuning, and operational tooling
- Fluency in cloud-native tech (GCP a plus) and Terraform
- Proficiency in Go and Bash for scripting and systems programming
- Skill in observability, incident response, and debugging distributed systems
- A preference for action over perfection, and pride in owning technical decisions
Compensation & Benefits
- Starting salary: $140,000 - $180,000 USD (or equivalent in local currency) depending on experience and subject to market rate adjustment.
- 100% coverage of medical, dental, vision, mental health, and supplemental insurance premiums for you and your family
- 16 weeks paid parental leave
- Unlimited PTO
- Stipends for remote work and wellness
- Professional development budget
- Inclusive benefits package and other perks (see full benefits link in original post)
Interview Process
- Application review
- Recruiter call (30 mins)
- Behavioral interview (60 mins)
- Take-home assignment
- Technical interview + assignment review call (90 mins)
- Background check and employment verifications for final candidates
Diversity & Misc
Customer.io recognizes systemic injustice's impact on diverse communities and is committed to inclusion and equity, bias-free hiring practices, and building an inclusive team culture.
Zoom is used for virtual interviews. Offers are extended in writing on official Customer.io letterhead. For questions contact [email protected].
Join us!
We believe in empathy, transparency, responsibility, and, yes, a little awkwardness. If you’re excited by what you read and want to build software that makes communication better for everyone—apply now.