Intermediate Site Reliability Engineer, Environment Automation
Used Tools & Technologies
Go IaCRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 3
Ansible @ 2
Kubernetes @ 3
Terraform @ 2
Distributed Systems @ 3
Git @ 3
SRE @ 3
Debugging @ 3
Observability @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
GitLab is the intelligent orchestration platform for DevSecOps. In this role on the Dedicated team you'll help power hundreds of isolated GitLab environments for customers by treating everything as code and contributing to automation across the entire lifecycle — from initial provisioning to day-to-day operations. You will collaborate with senior SREs to keep environments reliable, scalable, secure, and consistent while managing many tenant environments in parallel.
Responsibilities
- Contribute to automating operational tasks across many GitLab environments, from provisioning and configuration updates to upgrades and routine maintenance, reducing manual work and improving reliability under guidance of senior team members.
- Design, deploy, and maintain GitLab environments across cloud providers using infrastructure as code, deployment packages, and Kubernetes.
- Help build and refine the observability stack for multi-tenant environments to monitor Kubernetes, cloud services, and GitLab applications for early issue detection and capacity tracking.
- Assist in responding to platform alerts and incidents, collaborating with Environment Automation SREs and engineering teams to troubleshoot production issues across multiple tenants and document findings.
- Support planning and implementation of infrastructure changes, capacity expansions, and new service rollouts for Dedicated and other managed GitLab environments.
- Develop and maintain scripts, automation tools, and infrastructure-as-code workflows that manage parts of the GitLab environment lifecycle to enable more repeatable, self-service operations.
- Apply and help implement best practices for running GitLab on Kubernetes and cloud platforms, focusing on reliability, performance, and security while keeping environments consistent.
- Participate in the on-call rotation for production GitLab environments, helping triage and mitigate incidents and contributing to post-incident reviews.
- Document operational tasks, runbooks, and lessons learned so they become clear, repeatable processes and candidates for future automation.
Requirements
- Experience working as an SRE or in a similar role operating production infrastructure, with interest in automating lifecycle of many environments or tenants in parallel.
- Hands-on experience with backend programming languages such as Golang, able to read, understand, and modify infrastructure tools.
- Hands-on experience running Kubernetes-based workloads in production, including deployments, rollouts, and debugging issues like crash loops, failed health checks, and scheduling problems.
- Familiarity with infrastructure automation and configuration management tools such as Terraform and Ansible, including modules, variables, and safely managing state for multiple environments.
- Solid understanding of Git-based workflows and infrastructure-as-code practices; ability to contribute to reusable modules, templates, and pipelines.
- Experience in distributed systems or cloud-based production environments (ideally SaaS or managed services), with comfort participating in incident response and on-call rotations under guidance.
- Proactive mindset focused on automation and documentation; look for opportunities to remove manual steps and turn repetitive tasks into self-service tools.
- Comfort working asynchronously across distributed teams and alignment with GitLab values of collaboration, transparency, and iteration.
About the team
The team owns the lifecycle of GitLab environments that power GitLab Dedicated: automated single-tenant GitLab instances and supporting services. The team operates fully distributed and asynchronously across regions and focuses on infrastructure automation, environment lifecycle design, incident response, capacity planning, and reducing complexity by leveraging cloud managed products and services.
Compensation (United States Salary Range)
$103,600 - $222,000 USD
How GitLab Supports Full-Time Employees
- Benefits to support health, finances, and well-being
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and Development Fund
- Parental leave
- Home office support