Site Reliability Engineer (SRE) - Multiple Teams

at X

📍 Palo Alto, United States

$127,000-297,000 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Security @ 4 System Administration @ 7 Go @ 6 Linux @ 7 Python @ 6 Distributed Systems @ 4 Communication @ 7 Networking @ 4 Perl @ 6 SRE @ 4 Debugging @ 7

Details

Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we’re on a mission to become the trusted global digital public square, committed to protecting freedom of speech and building the future unlimited interactivity. Our goal is to empower every user to freely create and share ideas, fostering open public discourse without barriers. Join us in shaping this thrilling journey where your contribution will be invaluable to our success!

Responsibilities

Join X's dynamic Site Reliability Engineering teams across various domains and locations. As an SRE, you will play a crucial role in ensuring the high performance, reliability, and security of our systems. Each team focuses on different aspects of our infrastructure.
CDN Senior/Staff Site Reliability Engineer - Edge Services Network Engineering: Shape the vision and delivery of CDN services at X. Act as the point of contact for CDN edge services operations. Design and implement multi-tier content delivery, caching systems, and more. Plan, coordinate, manage, and scale CDN platform node deployments.
Staff SRE Engineer - Traffic Front End: Oversee the operation of software and services in the Traffic Front End team. Leverage deep expertise in networking, routing, and traffic patterns. Develop monitoring, alerting, and incident response solutions. Contribute to ongoing enhancement efforts and champion reliability engineering best practices.
Senior/Staff Site Reliability Engineer - Distributed Systems: Leverage deep expertise in service discovery, distributed services, and cloud infrastructure. Enhance our systems for managing and deploying application configuration across infrastructure. Develop monitoring, alerting, and incident response solutions. Drive continuous improvements in performance, reliability, and efficiency across the distributed systems infrastructure.

Requirements

Highly motivated team player with initiative.
Strong debugging, documentation, and communication skills.
Ability to work collaboratively in a dynamic environment.
Availability for occasional travel (up to 20%).
Bachelor's degree or above in Computer Science, Engineering, or related field.
5+ to 10+ years of experience in site reliability engineering or related roles.
Expertise in relevant technologies, such as CDN operations, containerization, incident management, traffic routing, and distributed systems.
Proficiency in scripting and automation (Python, Perl, Go).
Strong knowledge of Unix/Linux system administration at scale.

Benefits

Join us in shaping the future of X!