Senior Solution Network Architect - Enterprise AI

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 7 Ansible @ 4 Cumulus Linux @ 3 Docker @ 4 Grafana @ 4 Kubernetes @ 4 Linux @ 6 Prometheus @ 4 Terraform @ 4 Python @ 6 GitHub @ 6 Datadog @ 4 Distributed Systems @ 4 Leadership @ 4 Bash @ 6 Communication @ 7 Networking @ 4 Performance Monitoring @ 4

Details

Join our NVIDIA Solutions Architecture team to support the Enterprise Products team as a Senior Network Engineer, where your passion and expertise in networking, compute hardware, storage, and cloud-native software will be pivotal. We are looking for a multifaceted professional with a profound understanding of large network design, distributed systems, and datacenter architecture. You will collaborate in a multi-disciplinary approach to craft scalable datacenter implementations for enterprise-grade AI systems and automate repetitive networking activities.

Responsibilities

  • Own the deployment of scalable datacenter networking for enterprise AI/ML systems.
  • Deploy and validate cluster designs, optimizing them for enterprise facilities.
  • Collaborate closely with experts in networking, compute, software, and storage to drive innovation.
  • Lead multi-disciplinary projects, addressing high-level goals and complex challenges.
  • Engineer on-premises cloud-native solutions that integrate with diverse cloud providers.
  • Serve as a pivotal contributor in compute and hardware architecture domain.
  • Demonstrate multidisciplinary understanding of Ethernet, InfiniBand, datacenter LAN, WAN, and software-defined networks.
  • Conduct TCO analysis and optimize datacenter efficiency for cost-effectiveness.
  • Identify operational improvements and collaborate with teams to build solutions improving network operations and sustainability.

Requirements

  • Bachelor's degree or equivalent experience, with 8-10+ years in hardware or infrastructure architecture.
  • Proven expertise designing and deploying on-prem cloud-native platforms; deep understanding of scaling and resilience at chassis, rack, cluster, and datacenter levels.
  • In-depth knowledge of networking protocols and technologies: Ethernet, TCP/IP, VLAN, VXLAN, BGP, EVPN, MPLS, QoS, and InfiniBand.
  • Extensive experience with optical networking and cabling, fiber types, and transceiver modules (SFP/SFP+, QSFP, OSFP), including signal modulation, FEC, and multi-platform compatibility.
  • Strong grasp of cloud-native systems focused on high availability, scalability, and security; demonstrated system-level thinking to enhance reference designs.
  • Hands-on experience with infrastructure-as-code and monitoring tools: Base Command Manager (BCM), Ansible, Terraform, Grafana, Prometheus.
  • Proficient with Linux (including Cumulus OS) and scripting languages such as Python and Bash.
  • Familiarity with NVIDIA networking products including Mellanox switches, Cumulus Linux, BlueField DPUs, and InfiniBand technologies.
  • Demonstrated leadership in cluster design (networking, security, remote access management); experienced working independently and with distributed teams across time zones.
  • Strong written and verbal communication skills; capable of creating documentation such as Methods of Procedure (MoPs) and deployment guides.

Ways to stand out

  • Broad experience across Networking, Compute, Storage, and Platform Sizing, with focus on Infrastructure Cost Optimization and TCO analysis.
  • Strong understanding of network topologies, load balancing, and congestion control; experience with standards and open-source communities.
  • Proficient in Python with a personal GitHub showcasing relevant projects. Skilled in Kubernetes, Docker, and performance monitoring tools such as Grafana, Prometheus, and Datadog.
  • Hands-on experience with networking simulators including NVIDIA Air, GNS3, and EVE-NG for digital twin and virtual network testing.
  • Strong collaboration, communication skills, and an accountable work ethic.

Compensation & Benefits

  • Base salary (location, experience, and level dependent):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and company benefits.

Additional details

  • Applications accepted at least until September 21, 2025.
  • NVIDIA is an equal opportunity employer and values diversity. The company does not discriminate on the basis of protected characteristics.