Solutions Architect, AI Models
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 3 Linux @ 5 Python @ 5 Algorithms @ 3 Data Science @ 3 TensorFlow @ 5 Mathematics @ 3 Debugging @ 5 API @ 3 PyTorch @ 5 GPU @ 3Details
Do you want to be part of the team that brings innovative Artificial Intelligence (AI) from research to reality? We are looking for a Solution Architect to join the NVIDIA AI Enterprise (NVAIE) SA Segment team. We specialize in the newest technology and advances in deep learning, Generative AI, and Cloud. The vision of the NVAIE Segment team is to use our deep expertise, at the intersection of research and engineering, to guide and enable the successful adoption of NVIDIA AI software in the enterprise.
If you are passionate about AI and how it can be applied to address real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI and is looking for developers like you to design and build enterprise AI solutions using our newest technology. As a member of the Solution Architecture team, you will work closely with customers and partners to solve hard problems in customizing and deploying AI workloads at scale.
Responsibilities
- Develop end-to-end AI solutions for enterprise use cases and help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise.
- Tackle sophisticated AI challenges across the AI model lifecycle β from data preprocessing and orchestration to training, post-training, evaluation, and optimized deployment.
- Guide customers in implementing next-generation model distillation, domain adaptation, reinforcement learning (RL) and post-training algorithms, using NVIDIA frameworks.
- Help improve NVIDIA products and build creative solutions to overcome scaling challenges at the intersection of computer architecture, libraries, and AI applications.
- Contribute to the wider organization and community by sharing expert knowledge (open-source contributions, product engineering, publishing findings, delivering hands-on training).
Requirements
- BS, MS, or Ph.D. in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).
- 5+ years of experience with AI frameworks such as PyTorch, JAX, or TensorFlow, and libraries like Hugging Face Transformers.
- Proficiency in Python programming, software design, debugging, and performance analysis, with at least 5+ years of experience in a Linux environment.
- Hands-on experience with full AI model lifecycle, including pre-training, supervised fine-tuning, post-training techniques such as reinforcement learning (RL), and model evaluation.
- Expertise in distributed computing methodologies, including model and data parallelism.
- Experience with distributed computing tools, like SLURM and Kubernetes, for training large models on GPUs.
- Ability to learn fast and quickly adapt to change.
- Clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.
Ways to stand out
- Experience with and/or contributions to open-source NVIDIA AI Enterprise deep learning libraries and frameworks, particularly NeMo, Megatron Core, or NeMo-RL.
- Hands-on experience in large-scale foundation model training, accuracy, and performance profiling.
- Prior experience with AI model training techniques applied to multi-modal data (audio, image, and video).
- Knowledge of NVIDIA GPU/CPU architecture and its impact on software performance.
- Willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.
Compensation & Benefits
- Base salary range: 148,000 USD - 235,750 USD (determined based on location, experience, and pay of employees in similar positions).
- Eligible for equity and benefits (see NVIDIA benefits page).
Other details
- Applications for this job will be accepted at least until October 21, 2025.
- NVIDIA is an equal opportunity employer committed to fostering a diverse work environment and does not discriminate on the basis of protected characteristics.