Research Engineer, Frontier Evals & Environments - Finance

at OpenAI

📍 San Francisco, United States

USD 200,000-370,000 per year

MIDDLE

✅ Hybrid

✅ Relocation

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Hiring @ 3 Communication @ 3 ChatGPT @ 3

Details

The Frontier Evals team builds north-star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer training, safety, and launch decisions. The team has open-sourced evaluations such as SWE-bench Verified, MLE-bench, PaperBench, and SWE-Lancer, and has built and run frontier evaluations for models including GPT4o, o1, o3, GPT 4.5, ChatGPT Agent, and GPT5. If you are interested in feeling firsthand the fast progress of our models and steering them towards good, this is the team for you.

Responsibilities

Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas.
Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evaluations to measure it end-to-end.
Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities.

Requirements

Strong engineering and statistical analysis skills, with at least 2–3 years of full-time technical experience.
Passion for Excel spreadsheets and/or finance.
Detail-oriented and thorough approach to work.
Team player comfortable doing a variety of tasks to move the team forward.
Passionate and knowledgeable about AGI/ASI measurement.
Ability to operate effectively in a dynamic and extremely fast-paced research environment and to scope and deliver projects end-to-end.

It would be great if you also have:

Prior background or domain expertise in finance, especially investment banking or private equity (e.g., through internships or prior jobs).
Ability to work cross-functionally.
Excellent communication skills.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. The company emphasizes safety and human needs in AI development and is an equal opportunity employer. Background checks will be administered consistent with applicable law. OpenAI is committed to providing reasonable accommodations to applicants with disabilities.

Compensation & Benefits

Compensation range: $200K–$370K (base range provided) and the role offers equity.
Base pay may vary depending on market location, job-related knowledge, skills, and experience. Total compensation may include equity and performance-related bonuses for eligible employees.
Benefits include medical, dental, and vision insurance with employer contributions to Health Savings Accounts; pre-tax Health FSA and Dependent Care FSA; commuter expense accounts; 401(k) with employer match; paid parental and medical/caregiver leave; paid time off (flexible PTO for exempt employees and up to 15 days annually for non-exempt employees); 13+ paid company holidays and periodic company office closures; mental health and wellness support; employer-paid basic life and disability coverage; annual learning and development stipend; daily meals in offices and meal delivery credits as eligible; relocation support for eligible employees; and additional taxable fringe benefits (charitable donation matching, wellness stipends, etc.).

More details about benefits are available to candidates during the hiring process.