Senior Infrastructure Software Engineer, Deep Learning Libraries
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 4 System Administration @ 4 Docker @ 4 Jenkins @ 4 Kubernetes @ 4 Linux @ 4 Python @ 3 Distributed Systems @ 4 LLM @ 4 Cloud Computing @ 4 GPU @ 4Details
NVIDIA's Deep Learning Libraries Group is seeking a Senior Infrastructure Software Engineer to enable the next wave of NVIDIA’s highest performing deep learning libraries. The role spans multiple products including TensorRT, TensorRT-LLM, and cuDNN. The mission is to design and develop scalable, modular infrastructure that streamlines development, builds, and testing across NVIDIA’s diverse set of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models.
Responsibilities
- Design and develop software for testing and analysis of codebases.
- Build scalable automation for build, test, integration, and release processes for publicly distributed deep learning libraries.
- Develop throughout the software stack, from the user experience down to the cluster and database layers.
- Configure, maintain, and build upon deployments of industry-standard tools (e.g. Kubernetes, Jenkins, Docker, CMake, GitLab, etc.).
- Enable new platforms, which may include preparing hardware for testing and enabling testing in automation for new platforms.
Requirements
- BS or equivalent experience or higher degree in Computer Science or Computer Engineering.
- 5+ years of relevant experience.
- Strong familiarity with Python (or similar) and experience with building C/C++ codebases.
- System administration experience maintaining both Linux and Windows systems.
- Experience setting up, maintaining, and automating continuous integration systems.
- A pragmatic approach to solving problems and collaboration.
Ways to stand out / Preferred Qualifications
- Experience designing and developing automation in Jenkins with Groovy (or similar).
- Background with distributed systems and cluster/cloud computing, especially with Kubernetes.
- Knowledge of GPU computing systems.
- Experience with mobile/embedded platforms and multiple operating systems (Ubuntu, RedHat, Windows, QNX, or similar).
- Track record of identifying useful new technologies and incorporating them into software development flows.
Compensation & Benefits
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits.
Additional Information
Applications for this job will be accepted at least until September 21, 2025.
NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.