Senior Site Reliability Engineer
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 4 MySQL @ 4 Terraform @ 4 Hiring @ 4 Bash @ 4 Communication @ 4 SRE @ 4Details
We are looking for a talented Senior Site Reliability Engineer to join our SRE team!
You will tackle complex challenges by designing and implementing scalable, reliable infrastructure and services that power the future of customer engagement technology. In this pivotal role, you'll leverage your extensive expertise in backend systems and infrastructure management to enhance the performance and reliability of our platforms. Your contributions will directly influence the shaping of architecture and operational excellence needed for our product to thrive.
Responsibilities
- Architect and maintain critical infrastructure to enable Customer.io to scale and handle real-time processing of billions of messages.
- Strategically plan and implement infrastructure growth to meet evolving demands and repeatability.
- Streamline and automate processes for efficiency and reliability, removing manual toil.
- Participate in on-call rotations to swiftly address availability incidents and support technical engineers with customer-related issues.
- Develop observability to ensure comprehensive monitoring and effective alerting of infrastructure and applications.
- Troubleshoot and resolve production issues across various services and stack levels.
- Contribute to a collaborative and supportive team environment, fostering individual, professional, and team growth.
- Engage in continuous learning and knowledge sharing through code reviews, pair programming, and team collaborations to refine best practices.
Requirements
- 7+ years of professional experience as a Site Reliability Engineer, with proven experience leading large complex projects affecting production SaaS environments.
- Professional experience with relational database systems, managing servers and tuning performance, particularly MySQL.
- Proven experience managing scale, reliability, and performance challenges managing distributed applications on cloud infrastructure (Google Cloud Platform is advantageous), both managed and self-hosted solutions.
- Proven ability to build cloud infrastructure using Terraform and develop operational tooling in various languages including Golang and Bash.
- Deep knowledge of UNIX environments and modern collaborative development practices.
- Excellent communication skills, both verbal and written, with a collaborative mindset.
- Ability to work autonomously in your timezone with minimal guidance.
- Demonstrated ability to influence product direction and contribute technical insights that help drive business value.
- Strong focus on proactive identification and resolution of issues in production environments.
- Self-starter who thrives in both synchronous and asynchronous work environments.
- AI-forward mindset, excited by how AI reshapes software development and actively integrating AI tools into workflow.
Benefits
- Unlimited PTO with encouragement for at least 20 vacation days, plus holidays and sick days.
- 16 weeks paid parental leave (including adoption and foster care).
- 100% paid insurance premiums for employees and dependents.
- 401k retirement matching up to 5%.
- $1,500 annual healthy lifestyle budget.
- $250 monthly Remote Work Stipend for home internet, cell phone, and other remote work costs.
- $300 monthly co-working space rental reimbursement.
- $2,000 yearly professional education budget for conferences, courses, workshops, books, etc.
- $1,500 home office budget for ergonomic workspace setup.
- 1 month sabbatical after 5 years at Customer.io.
- Opportunities to meet peers in person throughout the year.
All final candidates undergo background checks and employment verifications.
Customer.io is committed to inclusion, equity, and building an inclusive team culture with bias-free hiring practices.