Senior Software Engineer, Fleet Management - DGX Cloud
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 4 Go @ 6 Kubernetes @ 4 Linux @ 3 Python @ 6 GCP @ 7 Java @ 6 Distributed Systems @ 4 Hiring @ 4 AWS @ 7 Azure @ 7 Communication @ 4 JavaScript @ 3 PostgreSQL @ 4 Next.js @ 3 React @ 3 Angular @ 3 Debugging @ 7 API @ 4 Cloud Computing @ 4 GPU @ 4Details
NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us. If you're passionate about building scalable, efficient systems to power cloud operations, we invite you to join our team.
We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a critical role in designing scalable cloud services that integrate with diverse systems including GPU telemetry in datacenters, and enabling operational automation across global cloud operations.
Responsibilities
- Design and develop RESTful APIs to ingest telemetry from AI datacenters.
- Build scalable cloud services for high-volume ingestion, processing, and storage of large datasets.
- Build and manage data pipelines for online and offline data storage.
- Collaborate across teams to codify business processes into scalable, self-measuring systems.
- Optimize the reliability and efficiency of cloud services and operations.
- Lead and ship impactful technical projects, ensuring quality and scalability at every stage.
Requirements
- At least 6+ years of industry experience with a Bachelor’s degree (or equivalent experience); Master’s degree preferred.
- Expertise in building scalable REST APIs backed by PostgreSQL-compatible data stores.
- Proficiency in programming languages such as Go, Java, or Python.
- Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).
- Strong understanding of cloud infrastructure (AWS, GCP, Azure, etc.) and container technologies like Docker and Kubernetes.
- Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.
- Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.
- A passion for delivering scalable and efficient cloud services.
- Familiarity with Linux operating systems.
Ways to Stand Out
- A track record of delivering and managing high-performance cloud services at Internet scale.
- Experience operating NVIDIA datacenter GPUs.
- Strong debugging and problem-solving skills in distributed environments.
NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on cutting-edge technology that powers the future of AI and cloud computing.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.
You will also be eligible for equity and benefits. Applications for this job will be accepted at least until September 29, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.