Senior AI Software Engineer, GenAI Framework

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 6 GCP @ 4 AWS @ 4 Azure @ 4 Debugging @ 6 LLM @ 4 PyTorch @ 4 Cloud Computing @ 4 GPU @ 4

Details

NVIDIA NeMo is an open-source, scalable, and cloud-native framework designed for researchers and developers working on Large Language Models (LLM), Multimodal (MM), and Speech AI. The framework supports end-to-end model training including data curation, alignment, customization, evaluation, deployment, and tooling to optimize performance and user experience.

Responsibilities

Develop the GenAI open source NeMo framework and Megatron Core.
Solve large-scale, end-to-end AI training and inference deployment challenges such as data curation, pre-processing, orchestrating and running model training and tuning, and model serving.
Work at the intersection of deep learning applications, libraries, frameworks, and the full software stack.
Perform performance tuning and optimizations of deep learning framework and software components.
Research, prototype, and develop effective AI tools and pipelines.

Requirements

MS, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field with 5+ years industry experience.
Experience with AI Frameworks such as PyTorch, JAX, and/or inference and deployment environments like TensorRT (TRT), ONNX, Triton.
Proficient in Python programming, software design, debugging, performance analysis, test design, and documentation.
Proven record of working effectively across multiple engineering initiatives and advancing AI libraries with new innovations.
Solid understanding of deep learning fundamentals and techniques.

Ways to Stand Out

Experience with large scale AI training and understanding of compute system concepts including latency/throughput bottlenecks, pipelining, multiprocessing, and performance tuning.
Prior experience with generative AI techniques applied to LLM and multimodal learning (Image, Video, Speech).
Knowledge of GPU/CPU architecture and numerical software.
Experience with cloud computing for AI training and inference pipelines on cloud service providers like AWS, Azure, GCP.
Contributions to open source deep learning frameworks.

Benefits

Eligible for equity and employee benefits.
Work in an innovative, diverse, and inclusive environment at one of technology's most desirable employers.

The base salary range is 184,000 USD to 356,500 USD, determined based on location, experience, and peer pay.