Senior Software Engineer, AI Agent Runtime and Open Source Infrastructure
at Nvidia
USD 224,000-431,200 per year
Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 4
Docker @ 4
Kubernetes @ 4
Linux @ 4
TypeScript @ 4
CI/CD @ 4
Communication @ 4
JavaScript @ 4
Mentoring @ 3
Node.js @ 4
Rust @ 4
Debugging @ 4
LLM @ 3
macOS @ 4
GPU @ 4
AI @ 4
Agentic AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. The NemoClaw team works at the forefront of agentic AI, developer infrastructure, and runtime security, composing open-source stacks that allow developers to harness always-on agents in secure, efficient environments.
Responsibilities
- Build and implement production-grade features across NemoClaw, focusing on onboarding flows, policy controls, inference routing, and sandbox lifecycle.
- Develop and sustain secure agent runtime infrastructure, ensuring strong network policy administration, credential management, and failure recovery.
- Engage in daily open-source workflows: author pull requests, conduct technical reviews, address issues, write tests, and contribute to documentation.
- Use AI-assisted development tools to improve the engineering loop, while applying rigorous verification and security measures.
- Develop tools, test harnesses, automation scripts, and CI/CD workflows to boost team efficiency.
- Diagnose complex failures across various platforms and environments, including TypeScript/Node.js, containers, Linux, macOS, WSL, and GPU-backed systems.
- Collaborate with internal teams and external communities, including OpenShell and AI platform partners.
Requirements
- BS, MS, or equivalent experience in Computer Science, Software Engineering, or a related technical field.
- Over 12+ years of experience in developing and managing production software systems, developer infrastructure, or open-source platforms.
- Strong systems engineering fundamentals with a proven track record of solving multifaceted problems.
- Skilled in at least one prominent programming language and capable of rapidly learning TypeScript, JavaScript, Node.js, and Rust.
- Comfort working in large codebases, with experience in reading unfamiliar code, conducting detailed reviews, and improving maintainability.
- Demonstrated experience with open-source practices, including managing tasks, pull requests, code reviews, and public technical discussions.
- Experience with AI-supported development tools and a solid understanding of validating generated code.
- Security-conscious engineering approaches, particularly concerning secrets management, sandboxing, and network policy enforcement.
- Solid testing, continuous integration and delivery, and debugging abilities, with the capability to replicate failures, determine root causes, and clearly convey results.
- Excellent written and verbal communication skills, capable of explaining technical concepts to diverse audiences.
Ways to stand out
- Contributions to open-source developer infrastructure, AI tooling, or large public software projects.
- Hands-on experience with AI coding agents, workflow automation, or multi-agent systems.
- Experience with containers and Linux isolation technologies including Docker, Kubernetes, and network policy management.
- Demonstrated experience in developing dependable CI, comprehensive validation, and test infrastructure for dynamic software.
- Familiarity with LLM inference, GPU-backed workloads, or performance-sensitive AI infrastructure as well as demonstrated ability to elevate engineering standards through thoughtful reviews, clear documentation, and effective mentoring.
Compensation & Other
- Base salary ranges provided by location and level: 224,000 USD - 356,500 USD for Level 5; 272,000 USD - 431,250 USD for Level 6.
- Eligible for equity and benefits.
- Applications accepted at least until May 15, 2026.
- NVIDIA uses AI tools in its recruiting processes and is an equal opportunity employer.