Senior Solution Network Architect - Enterprise AI

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Security @ 7 Ansible @ 4 Cumulus Linux @ 3 Docker @ 4 Grafana @ 4 Kubernetes @ 4 Linux @ 6 Prometheus @ 4 Terraform @ 4 Python @ 6 GitHub @ 6 Datadog @ 4 Distributed Systems @ 4 Leadership @ 4 Bash @ 6 Communication @ 7 Networking @ 4 Performance Monitoring @ 4

Details

Join our NVIDIA Solutions Architecture team to support the Enterprise Products team as a Senior Network Engineer, where your passion and expertise in networking, compute hardware, storage, and cloud-native software will be pivotal. We are looking for a multifaceted professional with a profound understanding of large network design, distributed systems, and datacenter architecture. You will collaborate in a multi-disciplinary approach to craft scalable datacenter implementations for enterprise-grade AI systems and automate repetitive networking activities.

Responsibilities

Own the deployment of scalable datacenter networking for enterprise AI/ML systems.
Deploy and validate cluster designs, optimizing them for enterprise facilities.
Collaborate closely with experts in networking, compute, software, and storage to drive innovation.
Lead multi-disciplinary projects, addressing high-level goals and complex challenges.
Engineer on-premises cloud-native solutions that integrate with diverse cloud providers.
Serve as a pivotal contributor in compute and hardware architecture domain.
Demonstrate multidisciplinary understanding of Ethernet, InfiniBand, datacenter LAN, WAN, and software-defined networks.
Conduct TCO analysis and optimize datacenter efficiency for cost-effectiveness.
Identify operational improvements and collaborate with teams to build solutions improving network operations and sustainability.

Requirements

Bachelor's degree or equivalent experience, with 8-10+ years in hardware or infrastructure architecture.
Proven expertise designing and deploying on-prem cloud-native platforms; deep understanding of scaling and resilience at chassis, rack, cluster, and datacenter levels.
In-depth knowledge of networking protocols and technologies: Ethernet, TCP/IP, VLAN, VXLAN, BGP, EVPN, MPLS, QoS, and InfiniBand.
Extensive experience with optical networking and cabling, fiber types, and transceiver modules (SFP/SFP+, QSFP, OSFP), including signal modulation, FEC, and multi-platform compatibility.
Strong grasp of cloud-native systems focused on high availability, scalability, and security; demonstrated system-level thinking to enhance reference designs.
Hands-on experience with infrastructure-as-code and monitoring tools: Base Command Manager (BCM), Ansible, Terraform, Grafana, Prometheus.
Proficient with Linux (including Cumulus OS) and scripting languages such as Python and Bash.
Familiarity with NVIDIA networking products including Mellanox switches, Cumulus Linux, BlueField DPUs, and InfiniBand technologies.
Demonstrated leadership in cluster design (networking, security, remote access management); experienced working independently and with distributed teams across time zones.
Strong written and verbal communication skills; capable of creating documentation such as Methods of Procedure (MoPs) and deployment guides.

Ways to stand out

Broad experience across Networking, Compute, Storage, and Platform Sizing, with focus on Infrastructure Cost Optimization and TCO analysis.
Strong understanding of network topologies, load balancing, and congestion control; experience with standards and open-source communities.
Proficient in Python with a personal GitHub showcasing relevant projects. Skilled in Kubernetes, Docker, and performance monitoring tools such as Grafana, Prometheus, and Datadog.
Hands-on experience with networking simulators including NVIDIA Air, GNS3, and EVE-NG for digital twin and virtual network testing.
Strong collaboration, communication skills, and an accountable work ethic.

Compensation & Benefits

Base salary (location, experience, and level dependent):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
Eligible for equity and company benefits.

Additional details

Applications accepted at least until September 21, 2025.
NVIDIA is an equal opportunity employer and values diversity. The company does not discriminate on the basis of protected characteristics.