AI Systems Engineer - SF Or Palo Alto
USD 190,000-250,000 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
TensorFlow @ 3 API @ 3 LLM @ 2 PyTorch @ 3Details
We are looking for an AI Inference engineer to join our growing team.
Responsibilities
- Develop APIs for AI inference that will be used by both internal and external customers
- Help set up our infrastructure and deploy efficiently and reliably
- Benchmark and reduce bottlenecks throughout our inference stack
- Improve the reliability and observability of our systems and respond to system outages
Requirements
- Experience with ML systems and some deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
- High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)
- Experience with deploying reliable, distributed, real-time systems at scale
Compensation
The cash compensation range for this role is $190,000 - $250,000 per year. Equity may be part of the total compensation package.
Benefits
Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.