Used Tools & Technologies
Not specified
Required Skills & Competences ?
Marketing @ 4 Kubernetes @ 3 Python @ 7 GitHub @ 7 MLOps @ 3 Communication @ 7 Mathematics @ 4 Debugging @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4Details
NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training and/or deployment for a customer-facing role. The role involves accelerating customer workloads and leading technical engagements around NVIDIA software and technologies with top technology companies, observing emerging industry trends.
Responsibilities
- Collaborate closely with customers to improve workload performance and reduce infrastructure costs.
- Lead and develop proofs-of-concept for AI solutions in the Consumer Internet industry, including LLMs and recommenders, and build collateral (notebook/code) as needed.
- Develop and debug software for NVIDIA and open-source AI frameworks and libraries.
- Partner with NVIDIA’s software engineering, product, and sales teams to secure design wins and drive development of innovative solutions based on customer feedback.
Requirements
- BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related field, or equivalent experience.
- 8+ years as an AI/Software Engineer with proven coding in Python and/or C++ with popular AI libraries and GPUs.
- Experience profiling and optimizing model training/inference on GPUs.
- Experience developing and optimizing GPU kernels for deep learning, focusing on GEMM and attention kernels.
- Strong communication skills to convey ideas and code via GitHub, documentation, and presentations.
- Team player who collaborates with cross-functional teams including Engineering, Research, Sales, Product, and Marketing.
- Self-starter passionate about growth, continuous learning, and sharing insights.
Ways to Stand Out
- Full stack experience from DL framework level (e.g., PyTorch/JAX) to lower level (e.g., CUDA/CUTLASS/cuDNN/NCCL).
- Experience working with enterprise developers and strong customer-facing skills.
- Familiarity with MLOps technologies like containers, Kubernetes, and data center deployments.
- Experience with large-scale production data pipelines and AI model training/deployment.
- Creative problem-solving skills for debugging and resolving complex issues.
The role supports remote work with occasional travel for onsite visits and conferences.
Compensation and Benefits
- Base salary range: $184,000 - $287,500 USD per year, influenced by location and experience.
- Eligible for equity and comprehensive benefits.
NVIDIA is an equal opportunity employer fostering diversity and inclusion in its workforce.