Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 3 System Administration @ 6 Ansible @ 3 Docker @ 3 Kubernetes @ 3 Linux @ 6 Vault @ 3 DevOps @ 6 Python @ 3 Leadership @ 6 AWS @ 3 Communication @ 6 Helm @ 3 Networking @ 3 SRE @ 6 API @ 3 JWT @ 3 Cassandra @ 3 PKI @ 3Details
NVIDIA is seeking a Software Infrastructure Engineer to develop, validate, and direct software security requirements, implementations, and integrations within the Cloud Data Services team. The role focuses on building and operating high-performance, large-scale infrastructure to support cloud-delivered services and gaming workloads.
Responsibilities
- Build, develop, and maintain infrastructure to support high-performance, large-scale applications and services.
- Operate and run hundreds of distributed database instances deployed across 13 regions.
- Implement and lead deployment automation processes for platforms including databases, proxies, applications, and containers.
- Set up security solutions protecting database access, data storage, backups, and restore processes.
- Build and support automation tools that configure NVIDIA's security systems implementing JWT, OIDC, PKIs, policies, and database secrets.
- Develop and maintain comprehensive documentation detailing the configurations of deployed solutions.
- Optimize processes and flows to streamline new service deployments, regular maintenance, and replacement tasks.
- Mentor and influence teammates; communicate technical direction effectively across varied teams.
Requirements
- BS in Computer Science or Information Systems, or equivalent experience.
- 8+ years in large-scale systems engineering roles with exposure to DevOps and SRE or equivalent experience.
- Hands-on experience with DevOps tooling such as Python and Ansible.
- Experience with containers and container tooling (Docker/Containers) and container orchestration workflows.
- Experience with GitOps or similar workflows (Argo CD mentioned explicitly).
- Proven Linux system administration expertise (Ubuntu/Debian preferred) with a strong emphasis on security.
- Understanding of AWS technologies (EC2, ELB, ECS, RDS, API Gateway, WAFs, Lambda) and provisioning automation.
- Networking knowledge (VPC, Subnets, Route Tables, Internet Gateways, NATs, etc.).
- Experience with HashiCorp Vault and PKI certificate chains.
- Competence in measuring system performance and implementing iterative improvements.
- Strong communication skills and demonstrated mentorship/leadership within teams.
Nice to have / Ways to stand out
- Operational experience running HashiCorp Vault integrated into deployment processes and application security (encryption, key management, DLP, HSMs).
- Experience running large Apache Cassandra clusters and designing scalable Cassandra schemas and integrations.
- Experience launching and running Kubernetes clusters in cloud environments and managing deployments with Helm and related tooling.
- Demonstrated initiative and ability to work independently.
Compensation & Benefits
- Base salary ranges (location, level, and experience dependent):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
- Eligible for equity and company benefits. (See NVIDIA benefits page for details.)
Location & Application
- Location: Santa Clara, California, United States.
- Employment type: Full time.
- Applications for this job will be accepted at least until September 7, 2025.
NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.