Senior Solution Network Architect - Enterprise AI
at Nvidia
π Santa Clara, United States
USD 184,000-356,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 7 Ansible @ 4 Cumulus Linux @ 3 Docker @ 4 Grafana @ 4 Kubernetes @ 4 Linux @ 6 Prometheus @ 4 Terraform @ 4 Python @ 6 GitHub @ 6 Datadog @ 4 Distributed Systems @ 4 Leadership @ 4 Bash @ 6 Communication @ 7 Networking @ 4 Performance Monitoring @ 4Details
Join our NVIDIA Solutions Architecture team to support the Enterprise Products team as a Senior Network Engineer, where your passion and expertise in networking, compute hardware, storage, and cloud-native software will be pivotal. We are looking for a multifaceted professional with a profound understanding of large network design, distributed systems, and datacenter architecture. You will collaborate in a multi-disciplinary approach to craft scalable datacenter implementations for enterprise-grade AI systems and automate repetitive networking activities.
Responsibilities
- Own the deployment of scalable datacenter networking for enterprise AI/ML systems.
- Deploy and validate cluster designs, optimizing them for enterprise facilities.
- Collaborate closely with experts in networking, compute, software, and storage to drive innovation.
- Lead multi-disciplinary projects, addressing high-level goals and complex challenges.
- Engineer on-premises cloud-native solutions that integrate with diverse cloud providers.
- Serve as a pivotal contributor in compute and hardware architecture domain.
- Demonstrate multidisciplinary understanding of Ethernet, InfiniBand, datacenter LAN, WAN, and software-defined networks.
- Conduct TCO analysis and optimize datacenter efficiency for cost-effectiveness.
- Identify operational improvements and collaborate with teams to build solutions improving network operations and sustainability.
Requirements
- Bachelor's degree or equivalent experience, with 8-10+ years in hardware or infrastructure architecture.
- Proven expertise designing and deploying on-prem cloud-native platforms; deep understanding of scaling and resilience at chassis, rack, cluster, and datacenter levels.
- In-depth knowledge of networking protocols and technologies: Ethernet, TCP/IP, VLAN, VXLAN, BGP, EVPN, MPLS, QoS, and InfiniBand.
- Extensive experience with optical networking and cabling, fiber types, and transceiver modules (SFP/SFP+, QSFP, OSFP), including signal modulation, FEC, and multi-platform compatibility.
- Strong grasp of cloud-native systems focused on high availability, scalability, and security; demonstrated system-level thinking to enhance reference designs.
- Hands-on experience with infrastructure-as-code and monitoring tools: Base Command Manager (BCM), Ansible, Terraform, Grafana, Prometheus.
- Proficient with Linux (including Cumulus OS) and scripting languages such as Python and Bash.
- Familiarity with NVIDIA networking products including Mellanox switches, Cumulus Linux, BlueField DPUs, and InfiniBand technologies.
- Demonstrated leadership in cluster design (networking, security, remote access management); experienced working independently and with distributed teams across time zones.
- Strong written and verbal communication skills; capable of creating documentation such as Methods of Procedure (MoPs) and deployment guides.
Ways to stand out
- Broad experience across Networking, Compute, Storage, and Platform Sizing, with focus on Infrastructure Cost Optimization and TCO analysis.
- Strong understanding of network topologies, load balancing, and congestion control; experience with standards and open-source communities.
- Proficient in Python with a personal GitHub showcasing relevant projects. Skilled in Kubernetes, Docker, and performance monitoring tools such as Grafana, Prometheus, and Datadog.
- Hands-on experience with networking simulators including NVIDIA Air, GNS3, and EVE-NG for digital twin and virtual network testing.
- Strong collaboration, communication skills, and an accountable work ethic.
Compensation & Benefits
- Base salary (location, experience, and level dependent):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
- Eligible for equity and company benefits.
Additional details
- Applications accepted at least until September 21, 2025.
- NVIDIA is an equal opportunity employer and values diversity. The company does not discriminate on the basis of protected characteristics.