Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Distributed Systems @ 3
Hiring @ 3
Networking @ 3
Debugging @ 6
Reporting @ 3
GPU @ 3
Observability @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
We are seeking a Tokens-as-a-Service (TaaS) Engineer to help build the systems that convert large-scale infrastructure capacity into measurable, reliable token throughput for OpenAI workloads.
In this role, you will work across performance benchmarking, tokenomics, model porting, infrastructure integration, systems tooling, and operational monitoring. You will help connect partner and first-party compute environments into OpenAI’s infrastructure stack, ensuring GPU capacity can be onboarded, measured, monitored, and optimized against real workload outcomes.
Responsibilities
- Develop systems and tooling to measure, monitor, and improve token throughput across first-party and partner-owned compute environments.
- Support performance benchmarking, tokenomics analysis, and model porting across heterogeneous infrastructure environments.
- Build tooling to integrate external or partner infrastructure into OpenAI’s internal compute, observability, and workload management systems.
- Develop and monitor operational metrics including billing, usage, SLAs, utilization, reliability, and throughput.
- Identify bottlenecks across hardware, networking, software, and workload enablement that prevent capacity from becoming productive tokens.
- Partner with compute, infrastructure, networking, finance, and operations teams to translate raw capacity into usable workload-serving capacity.
- Build dashboards, automation, and reporting systems that provide clear visibility into TaaS capacity, performance, and business outcomes.
Requirements
- Strong software engineering background with experience building systems, tooling, automation, or infrastructure platforms.
- Experience working across compute infrastructure, distributed systems, performance engineering, or production operations.
- Ability to reason about token throughput, utilization, benchmarking, infrastructure efficiency, and workload performance.
- Comfortable integrating external systems or partner environments into internal infrastructure stacks.
- Strong analytical and debugging skills across hardware, networking, software, and operational domains.
Preferred Skills
- Experience with GPU clusters, AI infrastructure, performance benchmarking, or workload optimization.
- Familiarity with model porting, inference/training workloads, token economics, or compute efficiency analysis.
- Experience building monitoring systems for billing, usage, SLAs, utilization, or infrastructure reliability.
- Background in systems engineering, infrastructure software, observability, distributed systems, or platform engineering.
Location and Workplace
- Primary location: San Francisco, United States
- Secondary location: Seattle, United States
- Workplace type: Hybrid
Compensation and Benefits
- Base salary range listed: $293,000 - $455,000 per year. Offers equity.
- Benefits include medical, dental, and vision insurance; HSA contributions; pre-tax FSAs; 401(k) with employer match; paid parental and medical leave; flexible PTO; paid company holidays and office closures; mental health and wellness support; employer-paid basic life and disability coverage; annual learning and development stipend; daily meals and meal credits; relocation support for eligible employees; and additional taxable fringe benefits.
About OpenAI
OpenAI is an AI research and deployment company focused on ensuring that general-purpose artificial intelligence benefits all of humanity. The company emphasizes safety, diverse perspectives, and inclusive hiring practices. Background checks are conducted in accordance with applicable law. OpenAI is an equal opportunity employer and provides reasonable accommodations to applicants with disabilities.