Senior Systems Software Engineer, Tao Software And API

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 4 Ansible @ 6 Docker @ 6 Jenkins @ 6 Kubernetes @ 6 Terraform @ 6 Python @ 7 GCP @ 4 CI/CD @ 4 Machine Learning @ 4 Hiring @ 4 AWS @ 4 Azure @ 4 Debugging @ 4 API @ 4 QA @ 4

Details

We are the NVIDIA-Metropolis TAO (Train-Adapt-Optimize) group seeking a Senior Systems Software Engineer to join the TAO Software and API team. The TAO toolkit offers scalable, user-friendly modules for training, fine-tuning, and optimizing machine learning models, especially in Computer Vision and Multi-Modal AI.

Responsibilities

  • Develop and maintain advanced machine learning software and frameworks, optimizing for performance and scalability.
  • Enhance CI/CD pipelines to streamline development, testing, and deployment of large-scale machine learning models.
  • Implement and manage cloud infrastructure for continuous integration, delivery, and deployment with high availability and scalability.
  • Collaborate with engineering, QA, and research teams to improve workflows and software delivery quality.
  • Troubleshoot and resolve complex issues in software development, containerization, and cloud infrastructure within production environments.
  • Write and maintain documentation for development and deployment processes.
  • Communicate effectively with technical and non-technical stakeholders regarding release and deployment expectations.
  • Lead code reviews, testing, and debugging to ensure high-quality code.
  • Mentor junior engineers to foster professional growth and enhance team capabilities.

Requirements

  • Bachelor’s or master’s degree or equivalent experience in Computer Science, Information Systems, Engineering, or related fields.
  • 7+ years of software engineering experience, especially with CI/CD, cloud infrastructure, and advanced machine learning frameworks.
  • Proficiency with automation and orchestration tools such as Docker, Kubernetes, Jenkins, Terraform, Ansible, or similar.
  • Experience with cloud platforms like AWS, Azure, or GCP.
  • Strong programming skills in Python and/or other relevant languages.
  • Experience developing and deploying scalable software solutions.
  • Strong analytical and problem-solving skills with practical and scalable focus.
  • Ability to work collaboratively and manage multiple tasks.
  • Familiarity with version control systems and configuration management.

Ways to Stand Out

  • Demonstrated rapid learning and adaptation to new technologies.
  • Strong academic record including competitive programming achievements.
  • Extensive experience with advanced AI tools and frameworks, including LLMs and NVIDIA Blueprints.
  • Contributions to open-source projects showing a collaborative and innovative mindset.
  • Experience deploying machine learning models on edge devices or platforms.

Benefits

Competitive salary range: $184,000 - $356,500 USD annually, depending on location, experience, and peer compensation.
Eligibility for equity and benefits.

NVIDIA embraces diversity and is committed to equality in hiring and promotion.