Compute Optimization Researcher/Engineer

at OpenAI

📍 San Francisco, United States
📍 Seattle, United States

USD 293,000-455,000 per year

MIDDLE

✅ Hybrid

✅ Relocation

Used Tools & Technologies

Not specified

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Python @ 5 SQL @ 5 Spark @ 5 Communication @ 3 Mathematics @ 3 Networking @ 3 Planning @ 3 Pandas @ 5 GPU @ 3 AI @ 3

Details

OpenAI’s infrastructure organization builds and operates the systems that power frontier AI workloads at global scale. As compute expands across first-party data centers, cloud providers, and strategic partners, efficient capacity planning and resource allocation are critical. The Compute Optimization team develops models, decision systems, and planning frameworks that optimize how compute resources are deployed, scheduled, and scaled across a rapidly growing global environment.

About the Role

We are seeking a Compute Optimization Researcher/Engineer to build the systems that maximize the value of OpenAI’s global compute capacity. You will work on optimization problems spanning capacity allocation, demand forecasting, cluster planning, workload placement, and infrastructure utilization. This role combines mathematical modeling, software systems, and cross-functional execution to improve planning and consumption across GPU clusters, networking, storage, and data center environments. The role is based in San Francisco, CA with a hybrid work model (3 days in office per week) and offers relocation assistance.

Responsibilities

Build optimization models for compute allocation, workload scheduling, and cluster utilization.
Develop planning systems that balance supply, demand, cost, latency, and reliability constraints.
Create forecasting frameworks for GPU demand, infrastructure growth, and capacity needs.
Design decision tools for allocating compute across internal teams, products, and strategic priorities.
Partner with architecture, infrastructure engineering, finance, and operations teams to translate business needs into mathematical models.
Integrate multiple operational data sources into planning systems and optimization workflows.
Improve utilization of GPUs, networking, power, cooling, and storage infrastructure.
Analyze tradeoffs across first-party data centers, cloud providers, and hybrid environments.
Build dashboards, metrics, and operational tooling for capacity decision-making.
Lead ambiguous, cross-functional initiatives that improve infrastructure efficiency at scale.
Present recommendations clearly to technical leaders and executives.
Continuously refine models based on changing workloads, supply constraints, and business priorities.

Requirements

Doctorate degree in Computer Science, Engineering, Mathematics, Operations Research, Economics, or related field.
5+ years of experience in optimization, planning, infrastructure analytics, or systems engineering.
Strong experience with linear programming, mixed-integer optimization, convex optimization, simulation, or forecasting methods.
Proficiency in Python and data tooling (SQL, Pandas, Spark, etc.).
Experience translating real-world business constraints into scalable optimization systems.
Strong analytical problem-solving skills and comfort operating in ambiguous environments.
Ability to influence cross-functional stakeholders without formal authority.
Excellent communication skills with both technical and non-technical audiences.

Preferred Qualifications

Experience with large-scale infrastructure, cloud capacity planning, or data center operations.
Familiarity with tools such as Gurobi, CPLEX, CVXPY, Pyomo, or similar solvers.
Experience optimizing GPU fleets, networking systems, or distributed compute environments.
Background in supply-demand planning, logistics, marketplace optimization, or resource scheduling.
Experience working in fast-scaling technology environments.

Benefits

Base pay range listed for this role; total compensation also includes equity and potential performance-related bonus(es).
Medical, dental, and vision insurance with employer contributions to Health Savings Accounts.
Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses.
401(k) retirement plan with employer match.
Paid parental leave and paid medical and caregiver leave.
Paid time off (flexible PTO for exempt employees; up to 15 days annually for non-exempt employees).
13+ paid company holidays and additional coordinated company office closures; paid sick or safe time as required by law.
Mental health and wellness support; employer-paid basic life and disability coverage.
Annual learning and development stipend.
Daily meals in offices and meal delivery credits as eligible.
Relocation support for eligible employees.
Additional taxable fringe benefits such as charitable donation matching and wellness stipends.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of AI capabilities and seek to safely deploy them through our products. OpenAI is an equal opportunity employer and provides reasonable accommodations to applicants with disabilities. Background checks will be administered in accordance with applicable law.