AI Training Infrastructure Engineer - Post Training
š San Francisco, United States
USD 220,000-290,000 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 6 Algorithms @ 3 LLM @ 3 PyTorch @ 3 CUDA @ 6Details
Perplexity is seeking experienced AI Research Engineers and Scientists to continue to improve our in house Online LLMs, the Sonar models. Your job is to work with team and create a robust and effective training framework (on top of Megatron/PyTorch), especially for post training LLMs.
Responsibilities
- Build a post training framework that can run cutting-edge model training jobs in scale
- Implement the necessary infra and components to support latest models and algorithms like SFT, RL (DPO/GRPO) and more
- Own the full stack data, training, and eval pipelines required to post-train LLM models
- Work closely with engineering teams to integrate Sonar models into our product.
Requirements
- Proven experience with large-scale LLMs frameworks building
- Strong in Python/PyTorch; C++/CUDA is a plus
- Self-starter with a willingness to take ownership of tasks
- Passion for tackling challenging problems
- Minimum of 6 years of working on relevant projects.
Bonus
- PhD in AI/ML/Systems or related areas
- Experience building LLM training frameworks, especially post training
The cash compensation range for this role is $220,000 - $290,000.
Final offer amounts are determined by multiple factors, including experience and expertise, and may vary from the amounts listed above.
Equity: In addition to the base salary, equity is part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.