Senior Manager, Network Site Reliability - GeForce Now

at Nvidia

📍 Santa Clara, United States

USD 248,000-396,800 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Security @ 4 Ansible @ 4 Cumulus Linux @ 4 Grafana @ 4 Linux @ 4 Prometheus @ 4 IaC @ 4 Terraform @ 4 GCP @ 7 Distributed Systems @ 3 Leadership @ 7 AWS @ 7 Azure @ 7 Networking @ 8 SRE @ 4 Debugging @ 4 Compliance @ 4

Details

GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader who is dedicated to optimizing network performance and ensuring a smooth user experience. The position focuses on managing Network SRE to streamline network operations, minimize manual tasks, and achieve service level objectives (SLOs). In this position, you will have the opportunity to tackle challenges through active troubleshooting and a commitment to network automation, observability, documentation, and operational excellence.

Responsibilities

Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture of collaboration, accountability, and technical excellence, along with offering mentorship.
Manage the design, implementation, and maintenance of robust and scalable network infrastructure across data centers, cloud environments, and edge locations to ensure consistent connectivity and performance.
Apply proactive reliability engineering techniques to reduce network disruptions and decrease Mean Time to Recovery (MTTR), improving overall service reliability and user satisfaction.
Work closely with Security and Compliance teams to ensure that all network infrastructure meets regulatory standards and internal policies, maintaining a secure operational environment.
Lead initiatives to improve network observability by integrating advanced monitoring and alerting systems, collaborating with multi-functional teams to implement network solutions that support business objectives and enhance user experiences.

Requirements

Bachelor’s or Master’s degree in Computer Science or a related field, or equivalent experience.
12+ years overall experience in host and infrastructure networking.
6+ years in leadership roles managing teams focused on high-performance Software Defined Networking (SDN) solutions.
Strong understanding of networking protocols, with hands-on experience in kernel development and technologies including routing, switching, load balancers, firewalls, VPNs, and cloud platforms such as AWS, GCP, and Azure.
Skilled in Infrastructure as Code (IaC) using automation tools like Ansible and Terraform.
Experience with monitoring and observability tools such as Prometheus, Grafana, and NetBox.
Proven ability to design network architectures for cloud and distributed systems, with practical experience in large-scale configurations and familiarity with SR-IOV, Xen virtualization, and Open Virtual Switch (OVS) or similar SDN technologies.

Ways to stand out

Extensive experience in managing hybrid cloud environments and large-scale distributed systems.
Strong understanding of Site Reliability Engineering (SRE) concepts, including SLAs, SLOs, and incident management best practices.
Proven ability to use operational signals like SNMP, Syslog, and Streaming Telemetry for efficient issue identification and resolution.
Comprehensive knowledge of Open Virtual Switch (OVS) and SR-IOV RDMA.
Experience in debugging and improving code, automating repetitive tasks, and working with Mellanox/Cumulus Linux, Palo Alto firewalls, and Netscaler load balancers.

Compensation and Timing

Base salary range: 248,000 USD - 396,750 USD (base salary determined based on location, experience, and pay of employees in similar positions).
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until August 8, 2025.

Benefits and Other Information

NVIDIA offers a generous benefits package and equity eligibility.
NVIDIA is an equal opportunity employer and does not discriminate on the basis of protected characteristics.