Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Python @ 5
SQL @ 6
Spark @ 3
dbt @ 3
ETL @ 3
Airflow @ 3
Dagster @ 3
Data Engineering @ 5
ELT @ 3
Reporting @ 3
Snowflake @ 3
Observability @ 3
AI @ 3
Data Pipelines @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
OpenAI's Scaling Analytics team builds the data foundations that enable infrastructure deployment, hardware operations, supply chain, capacity planning, and site execution. The team develops data models, pipelines, metrics, and reporting systems to transform fragmented operational data into actionable insights as OpenAI's Industrial Compute expands across global data center campuses.
This is a full-time, hybrid role based in San Francisco. The team partners with Hardware Operations, Capacity Planning, Supply Chain, Infrastructure Delivery, Finance, and Engineering to create reliable data products that support critical operational and strategic decisions.
Responsibilities
- Design, build, and maintain scalable data pipelines supporting infrastructure deployment, operations, capacity planning, and supply chain functions.
- Develop trusted datasets and reporting systems that provide visibility into hardware inventory, deployment status, site readiness, capacity utilization, and operational performance.
- Partner with cross-functional stakeholders to define metrics, establish data standards, and improve decision-making across infrastructure organizations.
- Create scalable data models that enable consistent reporting and analytics across multiple data sources and operational systems.
- Improve data quality, lineage, observability, and governance practices across critical infrastructure datasets.
- Support executive reporting, operational reviews, forecasting exercises, and strategic planning initiatives through reliable analytical foundations.
- Collaborate with engineering teams to integrate new data sources and operational telemetry into existing analytics ecosystems.
- Build solutions that reduce manual reporting efforts and improve the speed and accuracy of infrastructure decision-making.
- Document systems, processes, and analytical frameworks to improve long-term maintainability and organizational resilience.
Requirements
- 5+ years of experience building and maintaining production data pipelines and analytical systems.
- Strong proficiency in SQL and experience designing scalable data models.
- Proficiency in Python or another programming language commonly used for data engineering.
- Experience working with modern data warehouses (e.g., Snowflake, BigQuery, Redshift) and orchestration frameworks (e.g., Airflow, Dagster).
- Experience designing reliable ETL/ELT workflows with a focus on maintainability, performance, and operational excellence.
- Experience partnering with cross-functional stakeholders to translate business requirements into technical solutions.
- Experience implementing data quality checks, monitoring, and observability practices in production environments.
Preferred Skills
- Experience supporting infrastructure, hardware operations, supply chain, manufacturing, logistics, or capacity planning organizations.
- Familiarity with large-scale operational telemetry and business-critical reporting environments.
- Experience with distributed processing frameworks such as Spark.
- Experience with transformation frameworks such as dbt.
- Experience developing executive reporting and operational review metrics.
- Experience operating in fast-paced, ambiguous environments with evolving priorities.
- Interest in building the analytical foundations that support large AI infrastructure deployments.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. OpenAI emphasizes safety, diverse perspectives, and equitable outcomes in building and deploying AI systems.
Benefits
- Base pay varies by market location, knowledge, skills, and experience. Total compensation includes equity and performance-related bonuses for eligible employees.
- Medical, dental, and vision insurance with employer contributions to Health Savings Accounts.
- Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses.
- 401(k) retirement plan with employer match.
- Paid parental leave and paid medical/caregiver leave.
- Flexible PTO for exempt employees and up to 15 days annually for non-exempt employees.
- 13+ paid company holidays and additional paid office closures.
- Mental health and wellness support; employer-paid basic life and disability coverage.
- Annual learning and development stipend.
- Daily meals in offices and meal delivery credits as eligible.
- Relocation support for eligible employees.
- Additional taxable fringe benefits (charitable donation matching, wellness stipends).