Software Engineer, Data Acquisition

at OpenAI
USD 325,000-405,000 per year
MIDDLE
✅ On-site
✅ Relocation

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 5 Kubernetes @ 3 Algorithms @ 3 Distributed Systems @ 3 Communication @ 6 Compliance @ 3

Details

The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support model training operations. The team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. This role involves building highly scalable data acquisition systems that handle petabytes of data and working closely with legal and compliance teams when needed.

Responsibilities

  • Own and lead engineering projects in data acquisition, including web crawling, data ingestion, and search.
  • Collaborate with sub-teams such as Data Processing, Architecture, and Scaling to ensure smooth data flow and system operability.
  • Work closely with the legal team to handle compliance and data privacy-related matters.
  • Develop and deploy highly scalable distributed systems capable of handling petabytes of data.
  • Architect and implement algorithms for data indexing and search capabilities.
  • Build and maintain backend services for data storage, including work with key-value databases and synchronization.
  • Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks.
  • Conduct and analyze experiments on data to provide insights into system performance.

Requirements

  • BS/MS/PhD in Computer Science or a related field.
  • 4+ years of industry experience in software development.
  • Experience with large web crawlers is a plus.
  • Strong expertise in large stateful distributed systems and data processing.
  • Proficiency in Kubernetes and Infrastructure-as-Code concepts.
  • Willingness and enthusiasm for trying new approaches and technologies.
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong written and verbal communication skills.

Compensation & Benefits

  • Base salary range: $325,000 - $405,000 (offers equity). Total compensation may include equity and performance-related bonuses.
  • Medical, dental, and vision insurance with employer contributions to Health Savings Accounts.
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses.
  • 401(k) with employer match.
  • Paid parental leave and paid medical/caregiver leave.
  • Flexible PTO for exempt employees and up to 15 days annually for non-exempt employees.
  • 13+ paid company holidays and additional company office closures.
  • Mental health and wellness support; employer-paid basic life and disability coverage.
  • Annual learning and development stipend.
  • Daily meals in offices and meal delivery credits as eligible.
  • Relocation support for eligible employees.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity. The company focuses on safe development and deployment of AI systems and seeks diverse perspectives and experiences. OpenAI is an equal opportunity employer and provides reasonable accommodations to applicants with disabilities.