Used Tools & Technologies
Not specified
Required Skills & Competences ?
System Administration @ 3 Linux @ 3 Leadership @ 3 Communication @ 3 Networking @ 3 Planning @ 3 Audit @ 3 Agile @ 3Details
NVIDIA is seeking a Solutions Architect in Data Center Infrastructure to join the Infrastructure Specialists team. You will lead planning and deployments of AI data centers including power/cooling systems, cabling and network provisioning, bring-up and validation. The role focuses on data center audit, planning, deployment and validation to ensure integrity of NVIDIA platform infrastructure against NVIDIA reference architectures, operational requirements, and industry standards. The infrastructure scope includes architectural systems, power distribution, liquid/air cooling systems, compute, network and cabling (fiber and copper), and telemetry systems.
Responsibilities
- Collaborate with NVIS Datacenter Engineering and other teams to plan and implement data center infrastructure solutions based on NVIDIA Datacenter reference architectures, including power distribution, cooling systems, network architecture, server hardware, and storage systems.
- Plan and manage deployment of rack-scale, liquid-cooled compute and networking hardware systems for AI/HPC workloads in fast-paced environments.
- Conduct pre-deployment planning: review cluster and data center architecture, plan network port mapping and fiber optic cabling BOM, identify risks, train vendors, and find areas for improvement.
- Evaluate customers' and partners' infrastructure design proposals for consistency with industry standards and regulatory requirements; provide recommendations for performance, scalability, and cost-effectiveness.
- Perform testing, troubleshooting, bring-up and validation of compute systems in collaboration with product and engineering teams.
- Establish and enforce quality assurance processes to verify deployments meet specifications and performance benchmarks.
- Drive continuous improvement initiatives to enhance data center efficiency, resilience, and sustainability; streamline processes and automate repetitive tasks where possible.
- Act as a domain expert and NVIS mentor: provide guidance, mentorship, and support to the NVIS team and act as point of contact for infrastructure-related inquiries and blocking issues.
- Collaborate and communicate across internal teams, external vendors, and customers to facilitate seamless integration of data center infrastructure solutions.
Requirements
- Bachelorโs degree (or equivalent experience) in Engineering, Computer Science, Information Technology, or a related field.
- Minimum 3+ years of experience in enterprise and/or hyperscale data centers with continuous infrastructure deployment experience, preferably for high-density AI/HPC data centers.
- Working experience in data center operations or infrastructure management roles focusing on large-scale data center deployments.
- Strong technical knowledge of the data center stack: power distribution, liquid cooling, servers, networking, storage and pre-deployment planning.
- Demonstrated technical and project leadership in fluid situations and ability to adapt to change.
- Excellent analytical, problem-solving, decision-making, communication and interpersonal skills.
- Strong organization and time management; able to plan, schedule, and organize tasks to meet goals on time.
- Willingness to travel (up to 40%).
- Relevant certifications preferred.
Way to stand out
- Linux system administration skills.
- Strong knowledge of the whole data center infrastructure stack.
- Flexible/agile and enjoys solving challenging problems.
Compensation & Benefits
- Base salary ranges by level: Level 3: 120,000 USD - 189,750 USD; Level 4: 148,000 USD - 235,750 USD.
- Eligible for equity and other NVIDIA benefits.
Additional information
- Applications for this job will be accepted at least until September 9, 2025.
- NVIDIA is an equal opportunity employer and values diversity in its workforce.