Software Engineer, Inference - Performance Optimization

at OpenAI
USD 295,000-555,000 per year
MIDDLE
✅ On-site
✅ Relocation

Used Tools & Technologies

Not specified

Required Skills & Competences

Distributed Systems @ 3 Networking @ 3 AI @ 3 Profiling @ 3

Details

Our team analyzes inference stack performance across the application, model, and fleet layers to identify bottlenecks and drive faster, cheaper inference. We combine systems profiling, benchmarking, and analysis to understand where time and cost are spent, then turn that understanding into performance optimizations and models that project performance and capacity needs for future launches.

About the Role

In this role, you will model inference performance across application, model, and fleet layers with higher fidelity. You will build cost-to-serve estimates from microbenchmarks and create tools that help cross-functional teams reason about latency, capacity, utilization, and cost tradeoffs.

Responsibilities

  • Build and refine performance models that translate microbenchmark results into cost-to-serve estimates.
  • Analyze inference workloads end to end across applications, models, and fleet infrastructure.
  • Enhance tooling to identify bottlenecks across layers for latency and throughput.
  • Partner with other teams to turn performance insights into concrete improvements and project how future changes affect inference.

Requirements / Who You Are

  • Enjoy reasoning from first principles about distributed systems, model inference, and hardware efficiency.
  • Comfortable working across abstraction layers, from application behavior to kernels, accelerators, networking, and fleet scheduling.
  • Deep expertise with performance profiling, benchmarking, analysis, and optimization.
  • Enjoy collaborating with engineering and research teams to improve real production systems.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. We are an equal opportunity employer and provide reasonable accommodations to applicants with disabilities.

Benefits

  • Medical, dental, and vision insurance with employer contributions to Health Savings Accounts.
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses.
  • 401(k) retirement plan with employer match.
  • Paid parental leave and paid medical/caregiver leave.
  • Flexible paid time off for exempt employees and up to 15 days annually for non-exempt employees.
  • 13+ paid company holidays and additional company office closures.
  • Mental health and wellness support; employer-paid basic life and disability coverage.
  • Annual learning and development stipend.
  • Daily meals in offices and meal delivery credits as eligible.
  • Relocation support for eligible employees.
  • Additional taxable fringe benefits such as charitable donation matching and wellness stipends.