Senior Infrastructure Engineer

at Groq
πŸ“ United States
USD 132,100-279,800 per year
SENIOR
βœ… Remote

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Go @ 4 Kubernetes @ 4 Linux @ 4 Terraform @ 4 Python @ 4 Hiring @ 4 Bash @ 4 Git @ 4 Networking @ 4

Details

At Groq, we are building a custom cloud from the ground up - one data center at a time. The Compute Storage team owns the systems that turn racks of bare metal into production-ready Kubernetes clusters powering the next generation of AI workloads. This role is hands-on and focused on fully automating deployment and lifecycle management of the Groq Cloud server fleet. You will work closely with data center, network, and platform teams to define and develop tools and automation that enable seamless deployment and management of Groq compute nodes and storage clusters.

Responsibilities

  • Develop robust, scalable automation solutions (Go, Python, Bash) to streamline and standardize deployment workflows across global data center environments.
  • Collaborate cross-functionally with data center operations, networking, and platform teams to ensure infrastructure is fully integrated and production-ready.
  • Develop automation to ensure production machines and clusters consistently meet optimal health standards in a timely manner.
  • Define best practices and standards for infrastructure-as-code and configuration management using Git, Flux, Terraform, and related tools.
  • Set technical direction and maintain high-quality system documentation, operational runbooks, and internal tooling to improve resilience, repeatability, and observability of the infrastructure stack.

Requirements

  • Experience deploying and supporting Linux and Kubernetes systems at scale.
  • Familiarity with infrastructure-as-code and Git-based workflows (for example: Terraform, Flux, Kustomize).
  • Ability to write and maintain basic tooling in modern languages such as Go and Python; comfortable with Bash scripting.
  • Understanding of networking fundamentals, including IPAM, VLANs, DHCP, and DNS.
  • Working knowledge of storage concepts (block vs object, NFS, RAID, etc.).
  • Strong sense of ownership and willingness to work through ambiguity.

Nice to Have

  • Experience provisioning physical machines in a data center environment.
  • Exposure to Talos Linux, Kubernetes bootstrapping, or Kubernetes platform engineering.
  • Previous collaboration with facilities, hardware, or network teams in an operational role.

Attributes of a Groqster

  • Humility β€” Egos are checked at the door
  • Collaborative & Team Savvy β€” We make up the smartest person in the room, together
  • Growth & Giver Mindset β€” Learn it all versus know it all; we share knowledge generously
  • Curious & Innovative β€” Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness β€” No limit thinking, fueling informed risk taking

Compensation & Other Details

  • Base salary range (United States): $132,100 to $279,800, determined by location, skills, qualifications, experience, and internal benchmarks. Compensation for candidates outside the USA will depend on the local market.
  • Remote-friendly role (#LI-Remote). Groq is an Equal Opportunity Employer and is committed to inclusive hiring and reasonable accommodations.