Principal Datacenter New Product Introduction Architect - DGX Cloud
at Nvidia
π Santa Clara, United States
USD 272,000-425,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
DevOps @ 4 Distributed Systems @ 7 Hiring @ 4 Communication @ 7 Mathematics @ 4 SRE @ 4 Planning @ 7Details
NVIDIA is hiring engineers to scale up New Product Introduction (NPI) for its AI Infrastructure. This role sits on the DGX Cloud Software Team and focuses on architecting, designing, and implementing next-generation DGX cloud clusters, with emphasis on hybrid deployments between cloud and on-prem. The team expects deep understanding of NPIs, distributed systems, software testing and deployment, and strong communication and planning abilities. The position is for engineers who are creative, execution-oriented, and eager to work on infrastructure solutions for large-scale AI applications.
Responsibilities
- Lead technical activities for new product introductions at scale with focus on hybrid deployments between cloud and on-prem.
- Provide expertise in NPI planning spanning infrastructure workflows, hardware, software release, workload orchestration and application tuning.
- Provide fast and creative solutions for complex problems and write clear, reliable architecture specifications.
- Translate requirements into vision, architecture and roadmap.
- Work with engineering teams across NVIDIA to ensure new product architectures integrate seamlessly from hardware up to AI training applications.
Requirements
- 15+ years of overall experience.
- BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics) or equivalent experience.
- Proven track record of impactful project deliveries focused on cloud infrastructure or cloud application services.
- Experience with at-scale infrastructure, DevOps and/or SRE practices and/or Platform Engineering.
- System-level experience with both hardware and software.
- Familiarity with distributed systems, software testing and deployment, workload orchestration, and application tuning.
- Systematic problem-solving approach, strong communication skills, sense of ownership and drive.
- Motivated self-starter, ability to work concurrently with multiple groups locally and abroad.
Ways to stand out / Preferred
- Experience developing ML/AI infrastructure or bare metal as a service (BMaaS) systems.
- Experience building multi-cloud infrastructure services.
- Strong interest and experience in crafting, analyzing and fixing large-scale distributed systems.
Compensation & Benefits
- Base salary range: 272,000 USD - 425,500 USD (final base salary determined based on location, experience, and pay of employees in similar positions).
- Eligible for equity and company benefits (link provided in original posting).
Other information
- Applications for this job will be accepted at least until September 14, 2025.
- NVIDIA is an equal opportunity employer and values diversity in its workforce.