Senior Software Engineer - NIM Platform SDK and Framework

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences

Security @ 4 Docker @ 4 Kubernetes @ 4 Python @ 4 GCP @ 7 Algorithms @ 7 AWS @ 7 Azure @ 7 Helm @ 4 Rust @ 4 Microservices @ 4 API @ 4 LLM @ 4 Compliance @ 4 GPU @ 4 Observability @ 4 AI @ 4 Agentic AI @ 4 vLLM @ 4 Data Pipelines @ 4 TensorRT @ 4 SGLang @ 4

Details

NVIDIA seeks a senior engineer to own and evolve the core NIM Platform SDK and microservice framework that powers NVIDIA Inferencing Microservices (NIM). This is a hands-on, deeply technical role focused on building foundational platform libraries and production-grade microservices for AI inference at scale across cloud, on-prem, and Kubernetes environments.

Responsibilities

  • Develop and advance the inference microservice framework: OpenAI-compatible API endpoints, inference backend integrations (vLLM, SGLang, TensorRT-LLM, Dynamo), middleware, observability instrumentation, and production hardening across cloud, on-prem, and Kubernetes.
  • Architect significant new features in open-source codebases and shepherd them through project acceptance into production.
  • Build and optimize high-performance model download and caching pipelines across multiple cloud storage backends (NGC, HuggingFace, S3, GCS) including parallel transfers and integrity verification for multi-cloud operability.
  • Implement model profile and manifest systems to ensure NIMs are optimized for every NVIDIA GPU platform (profile selection, validation, multi-GPU configuration).
  • Develop and refine cloud microservice patterns: service discovery, health checking, graceful degradation, API gateway integration, and end-to-end request lifecycle management for reliable operation at scale.
  • Produce high-quality code across Python, Rust, and C/C++, and champion practices such as test-driven development, agentic AI-assisted development, and rigorous code review.
  • Mentor teammates and establish engineering standards for container quality, security, and operability.

Requirements

  • BS or MS in Computer Science, Computer Engineering, or related field (or equivalent experience).
  • 8+ years of demonstrated experience developing performant microservice, cloud software, and/or platform infrastructure.
  • Deep technical expertise in cloud-native microservice architecture, including service mesh, API gateways, load balancing, and distributed system patterns.
  • Expertise in high-performance data pipelines with parallel I/O, caching strategies, and integrity verification across distributed storage systems.
  • Solid understanding of containerized application delivery (Docker, Kubernetes, Helm).
  • Understanding of application security principles: secure coding, vulnerability mitigation, secrets management, and supply chain integrity for containerized environments.
  • Strong problem-solving skills grounded in first-principles reasoning.
  • Excellent programming skills in Python and Rust, with strong foundations in algorithms and software engineering principles.

Ways to stand out

  • Direct involvement in open-source inference backends such as vLLM, TRTLLM, or SGLang.
  • Direct involvement in disaggregated serving frameworks like NVIDIA Dynamo.
  • Experience building and operating production microservices at scale.
  • Deep knowledge of multi-cloud deployment strategies across AWS, GCP, Azure, and OCI.
  • Experience operating in regulated, air-gapped, or disconnected environments with strict security and compliance controls.

Compensation & Benefits

  • Base salary range: 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
  • Eligible for equity and company benefits (link provided in posting).

Additional Information

  • Applications accepted at least until April 4, 2026.
  • This posting is for an existing vacancy.
  • NVIDIA uses AI tools in its recruiting processes.
  • NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.