Senior Systems Software Engineer, Tao Software And API
at Nvidia
USD 184,000-356,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 4 Ansible @ 6 Docker @ 6 Jenkins @ 6 Kubernetes @ 6 Terraform @ 6 Python @ 7 GCP @ 4 CI/CD @ 4 Machine Learning @ 4 Hiring @ 4 AWS @ 4 Azure @ 4 Debugging @ 4 API @ 4 QA @ 4Details
We are the NVIDIA-Metropolis TAO (Train-Adapt-Optimize) group seeking a Senior Systems Software Engineer to join the TAO Software and API team. The TAO toolkit offers scalable, user-friendly modules for training, fine-tuning, and optimizing machine learning models, especially in Computer Vision and Multi-Modal AI.
Responsibilities
- Develop and maintain advanced machine learning software and frameworks, optimizing for performance and scalability.
- Enhance CI/CD pipelines to streamline development, testing, and deployment of large-scale machine learning models.
- Implement and manage cloud infrastructure for continuous integration, delivery, and deployment with high availability and scalability.
- Collaborate with engineering, QA, and research teams to improve workflows and software delivery quality.
- Troubleshoot and resolve complex issues in software development, containerization, and cloud infrastructure within production environments.
- Write and maintain documentation for development and deployment processes.
- Communicate effectively with technical and non-technical stakeholders regarding release and deployment expectations.
- Lead code reviews, testing, and debugging to ensure high-quality code.
- Mentor junior engineers to foster professional growth and enhance team capabilities.
Requirements
- Bachelor’s or master’s degree or equivalent experience in Computer Science, Information Systems, Engineering, or related fields.
- 7+ years of software engineering experience, especially with CI/CD, cloud infrastructure, and advanced machine learning frameworks.
- Proficiency with automation and orchestration tools such as Docker, Kubernetes, Jenkins, Terraform, Ansible, or similar.
- Experience with cloud platforms like AWS, Azure, or GCP.
- Strong programming skills in Python and/or other relevant languages.
- Experience developing and deploying scalable software solutions.
- Strong analytical and problem-solving skills with practical and scalable focus.
- Ability to work collaboratively and manage multiple tasks.
- Familiarity with version control systems and configuration management.
Ways to Stand Out
- Demonstrated rapid learning and adaptation to new technologies.
- Strong academic record including competitive programming achievements.
- Extensive experience with advanced AI tools and frameworks, including LLMs and NVIDIA Blueprints.
- Contributions to open-source projects showing a collaborative and innovative mindset.
- Experience deploying machine learning models on edge devices or platforms.
Benefits
Competitive salary range: $184,000 - $356,500 USD annually, depending on location, experience, and peer compensation.
Eligibility for equity and benefits.
NVIDIA embraces diversity and is committed to equality in hiring and promotion.