Used Tools & Technologies
GoRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Software Development @ 7
Docker @ 4
Grafana @ 4
Kubernetes @ 4
Prometheus @ 4
Python @ 4
Distributed Systems @ 4
Helm @ 4
React @ 4
Node.js @ 4
Rust @ 4
API @ 4
CUDA @ 4
GPU @ 4
AI @ 4
Slurm @ 4
HPC @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.
Responsibilities
- Gather use cases and requirements, translate those into software roadmaps, and execute those roadmaps across internal NVIDIA teams and external partners.
- Report project status, risks, help needed, and roadmap pivots to internal and external executives via status reports and in-person meetings.
- Broker technical discussions between highly technical subject matter experts.
- Leverage AI tools and workflows to quickly iterate on designs, prototypes, documentation, tests, and code.
- Architect distributed, robust, and scalable GoLang and Rust system software, deployed to monitor and manage large datacenters.
Requirements
- BS or higher in Computer Science or equivalent experience; 15+ years of meaningful industry experience with a strong scalable system software development background.
- Development experience with Rust, Python, and/or GoLang.
- Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment.
- Experience with APIs and interface design.
- Experience with AI tools and development workflows.
- Outstanding written and verbal interpersonal skills; business-level English.
- Ability to manage time in a fast, heavily multitasked environment and to quickly understand unfamiliar technical domains, identify core problems, and translate ambiguous requirements into actionable engineering plans.
- Skilled at producing clear technical documentation, design docs, and status updates that keep cross-functional partners aligned.
- Track record of identifying process inefficiencies and introducing automation, tooling, or AI-powered workflows that measurably improve team output.
Preferred / Ways to stand out
- Development experience in relevant coding languages like GoLang and Rust.
- Experience with SCADA or data center power related software.
- Background with containers (e.g. Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks using tools such as Prometheus, Loki and Grafana.
- Experience with modern UI development in React and Node.js or similar frameworks.
- Experience developing Kubernetes operators or Helm charts.
- Experience with HPC job schedulers like Slurm or Run.AI and familiarity with Kubernetes internals.
- Exposure to GPU programming with CUDA.
Compensation & Benefits
- Base salary range: 272,000 USD - 431,250 USD (determined based on location, experience, and pay of employees in similar positions).
- Eligible for equity and benefits.
Other information
- Applications for this job will be accepted at least until April 26, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.