AI Systems Engineer - SF Or Palo Alto

USD 190,000-250,000 per year
MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

TensorFlow @ 3 API @ 3 LLM @ 2 PyTorch @ 3

Details

We are looking for an AI Inference engineer to join our growing team.

Responsibilities

  • Develop APIs for AI inference that will be used by both internal and external customers
  • Help set up our infrastructure and deploy efficiently and reliably
  • Benchmark and reduce bottlenecks throughout our inference stack
  • Improve the reliability and observability of our systems and respond to system outages

Requirements

  • Experience with ML systems and some deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)
  • Experience with deploying reliable, distributed, real-time systems at scale

Compensation

The cash compensation range for this role is $190,000 - $250,000 per year. Equity may be part of the total compensation package.

Benefits

Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.