Senior Machine Learning Engineer, Quantized Training
at Nvidia
📍 Seattle, United States
$180,000-339,200 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Machine Learning @ 4 Communication @ 7 LLM @ 4 PyTorch @ 4Details
NVIDIA is seeking machine learning engineers to support next-generation recipes for mixed-precision training. In this role you will (1) distill LLM research literature into its core, (2) translate literature into experiments at scale, (3) create insights to support or refute the efficacy of a technique, and (4) generate reproducible training recipes.
Responsibilities
- Review state-of-the-art literature in quantized training
- Build robust, reproducible, and portable training recipes
- Provide engineering support to customers using HW and SW approaches
- Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in quantization
- Work with production SW teams to realize recipes in production workflows
Requirements
- Experience with PyTorch or similar frameworks such as jax/xla/etc
- Proficient in the math of machine learning
- Familiarity with FP8 for training
- Published research or significant contributions to the field of AI, particularly in algorithm development for hardware-software co-design
- PhD, M.S. degree or equivalent experience in Computer Science or a related field
- 5+ YoE working in ML / AI
- Strong written and oral communication skills
- Strong programming skills and ability to debug ML systems
Ways to stand out from the crowd
- Experience in LLM training, fine-tuning and optimization (quantization, sparsity)
- Familiarity with MX formats for training
- Experience with Transformer Engine, Megatron-LM, or NeMo
GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. This opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!