Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Kubernetes @ 3
GCP @ 6
Distributed Systems @ 3
AWS @ 6
Azure @ 6
Mentoring @ 5
Networking @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
We are seeking a Cloud Infrastructure Engineer to help design and evolve the platforms that power OpenAI’s products. This role is both deeply technical and highly strategic, requiring strong ownership, sound judgment, and the ability to partner effectively across engineering, product, and research organizations.
Responsibilities
- Design and build scalable, reliable, and secure infrastructure platforms that power OpenAI products
- Evolve cloud infrastructure abstractions that enable rapid product development across teams
- Architect systems to support significant growth, performance, and operational complexity
- Improve server orchestration, networking, and distributed systems reliability
- Influence technical direction and infrastructure strategy across multiple teams
- Partner closely with product, research, and engineering teams to align infrastructure with evolving needs
- Own operational excellence, including participation in on-call rotations and incident response
- Mentor engineers and raise the overall technical bar of the organization
- Contribute to a culture of high ownership, low ego, and thoughtful collaboration
Requirements
- 8+ years of experience building and operating large-scale infrastructure systems
- Deep expertise in Kubernetes and container orchestration at scale
- Strong experience designing cloud abstractions and platform infrastructure (AWS, GCP, Azure, or similar)
- Proven track record of leading complex technical initiatives across teams
- Experience operating highly reliable, secure, and scalable distributed systems
- Strong systems thinking with the ability to balance velocity, reliability, and simplicity
- Comfortable operating in ambiguous, fast-moving environments
- Experience mentoring engineers and influencing technical direction
- Passion for building infrastructure that enables impactful products
Why This Role Matters
This role is critical to enabling OpenAI to scale its products and infrastructure to the next level. Your work will directly impact the reliability, scalability, and velocity of OpenAI’s technology—helping bring advanced AI capabilities to millions of users safely and effectively.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. The company values diverse perspectives and is an equal opportunity employer. Background checks will be administered in accordance with applicable law. OpenAI is committed to providing reasonable accommodations to applicants with disabilities.
Benefits
- Medical, dental, and vision insurance with employer contributions to Health Savings Accounts
- Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses
- 401(k) retirement plan with employer match
- Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave
- Flexible paid time off and paid company holidays
- Mental health and wellness support
- Employer-paid basic life and disability coverage
- Annual learning and development stipend
- Daily meals in offices and meal delivery credits as eligible
- Relocation support for eligible employees
- Additional taxable fringe benefits such as charitable donation matching and wellness stipends