Data Engineer, Analytics

at OpenAI
USD 255,000-405,000 per year
MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Marketing @ 3 Security @ 3 Python @ 5 Scala @ 5 Spark @ 3 ETL @ 3 Java @ 5 Airflow @ 3 Flink @ 3 Data Science @ 3 Dagster @ 3 Data Engineering @ 3 Hadoop @ 3 ChatGPT @ 3 Compliance @ 3

Details

About the team

The Applied team works across research, engineering, product, and design to bring OpenAI’s technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the role

We're seeking a Data Engineer to take the lead in building our data pipelines and core tables for OpenAI. These pipelines are crucial for powering analyses, safety systems that guide business decisions, product growth, and preventing bad actors. This role provides the opportunity to collaborate closely with the researchers behind ChatGPT and help them train new models to deliver to users. As we continue our rapid growth, we value data-driven insights, and your contributions will play a pivotal role in our trajectory.

This role is exclusively based in our San Francisco HQ. We offer relocation assistance to new employees.

Responsibilities

  • Design, build, and manage data pipelines, ensuring all user event data is seamlessly integrated into our data warehouse.
  • Develop canonical datasets to track key product metrics including user growth, engagement, and revenue.
  • Work collaboratively with Infrastructure, Data Science, Product, Marketing, Finance, and Research to understand data needs and provide solutions.
  • Implement robust and fault-tolerant systems for data ingestion and processing.
  • Participate in data architecture and engineering decisions, contributing experience and knowledge.
  • Ensure the security, integrity, and compliance of data according to industry and company standards.

Requirements

  • 3+ years of experience as a data engineer and 8+ years of any software engineering experience (including data engineering).
  • Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java.
  • Experience with distributed processing technologies and frameworks, such as Hadoop, Flink, and distributed storage systems (e.g., HDFS, S3).
  • Expertise with ETL schedulers such as Airflow, Dagster, Prefect, or similar frameworks.
  • Solid understanding of Spark and ability to write, debug, and optimize Spark code.
  • Strong experience in designing fault-tolerant data ingestion and processing systems and participating in data architecture decisions.

Benefits

  • Competitive base pay (see job posting for range), equity, and potential performance-related bonuses.
  • Medical, dental, and vision insurance with employer HSA contributions.
  • Pre-tax accounts (Health FSA, Dependent Care FSA, commuter benefits).
  • 401(k) with employer match.
  • Paid parental, medical, and caregiver leave.
  • Flexible PTO for exempt employees and paid days off for non-exempt employees.
  • 13+ paid company holidays and coordinated office closures.
  • Mental health and wellness support; employer-paid basic life and disability coverage.
  • Annual learning and development stipend.
  • Daily meals in offices and meal delivery credits as eligible.
  • Relocation support for eligible employees.

Location

This role is based at OpenAI's San Francisco headquarters (on-site). Relocation assistance is offered for new employees.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of AI capabilities and seek to safely deploy them to the world through our products. We value diverse perspectives and are an equal opportunity employer.

Additional notes

Background checks will be administered in accordance with applicable law. OpenAI is committed to providing reasonable accommodations to applicants with disabilities.