Site Reliability Engineer - Cybersecurity

at xAI
USD 180,000-360,000 per year
MIDDLE
✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences

Security @ 3 Grafana @ 3 Kubernetes @ 3 Prometheus @ 3 Terraform @ 3 Python @ 3 GitHub @ 3 GitHub Actions @ 3 CI/CD @ 3 Distributed Systems @ 3 AWS @ 3 Communication @ 6 SRE @ 3 Prioritization @ 6 Puppet @ 3 AI @ 3 Data Pipelines @ 3

Details

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence. The organization values hands-on contributors, initiative, strong prioritization, and clear communication.

About the role

The Cybersecurity / SRE team focuses on ensuring the security and reliability of X Money (and cross-over with the X Social platform). This role centers on securing and maintaining the reliability of X Money’s infrastructure, working closely with cross-functional teams to enhance security measures, improve system resilience, and implement automation-first best practices. The ideal candidate will have experience in banking, money transmission, and P2P payments and will work with large distributed systems and security platforms at scale.

Responsibilities

  • Build and secure mission-critical applications in a hybrid cloud environment.
  • Manage identities and roles effectively.
  • Monitor and remediate infrastructure to comply with regulations and best practices (e.g., PCI, NIST CSF).
  • Maintain a SIEM and data pipelines needed for reliable alerting.
  • Design and implement secure container standards and automation to enable frictionless developer workflows.
  • Maintain Kubernetes security aligned with current best practices.
  • Build, deploy, and maintain security operations infrastructure using Python, Terraform, and Puppet.
  • Secure and enhance CI/CD pipelines and integrate/maintain code scanning platforms.
  • Develop dashboards and alerts from security metrics.
  • Own security projects: identify issues and implement solutions.
  • Apply critical analysis and problem-solving skills.

Requirements (Basic Qualifications)

  • Proven experience securing hybrid AWS/on-premises environments, including IAM and overall security posture.
  • Strong proficiency in Python, Terraform, and Puppet.
  • Certifications like CISA, CRISC, CGEIT, Security+, CASP+, or similar preferred.
  • Deep expertise in Kubernetes and container security.
  • Hands-on expertise building GitHub Actions and workflows.
  • Extensive experience with Prometheus, Grafana, CloudWatch, and Karma.
  • Well versed in management and integrations of Wazuh.
  • Hands-on experience with security scanning tools (Semgrep, Trivy, Falco).
  • Proactive mindset with strong ownership, critical thinking, and analytical problem-solving skills.
  • Located in the SF Bay Area or willing to relocate.

Compensation and benefits

  • Base salary: $180,000 - $360,000 USD
  • Total rewards package also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Additional notes

  • Role emphasizes automation-first approach, large distributed systems, and security platforms at scale.