Senior System Software Engineer - Performance

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 Leadership @ 4 Performance Optimization @ 7 Data Analysis @ 7 Debugging @ 6 CUDA @ 3 GPU @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. The Tegra system-on-chip (SoC) software organization is seeking a highly motivated technical engineer to work on ARM software ecosystem and system software architecture with a charter to enable datacenter-scale performance and efficiency for next-generation datacenter products, including CPUs and CPU+GPU Superchips. Work will span both pre-silicon and post-silicon phases.

Responsibilities

  • Design, develop, test, and optimize software for next-generation SoCs in both pre-silicon and post-silicon phases.
  • Review architectural performance bottlenecks for system-wide workloads and identify HW/SW policies to drive performance and performance-per-watt leadership.
  • Produce and present architecture and analysis documents; communicate technical findings to internal and external audiences.
  • Perform competitive analysis comparing microarchitecture and workload performance metrics on NVIDIA ARM SoCs against other vendors' processors.
  • Influence and drive full-stack adoption of performance optimizations and best practices across NVIDIA software products and open-source SDKs.

Requirements

  • BS or MS in Computer Engineering, Computer Science, or a related degree (or equivalent experience).
  • 6+ years of relevant computer architecture or software development experience.
  • Proven leadership skills and strong ownership on past projects.
  • Hands-on technical experience with complex software and hardware designs.
  • Strong understanding of multicore hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers, and real-time programming.
  • Strong skills in performance analysis, data analysis, and performance optimization.

Preferred / Ways to stand out

  • Deep expertise in ARM architecture and the ARM software ecosystem.
  • Proficient in analyzing, debugging, and tuning performance of complex system software stacks.
  • Experience with CPU server system workloads and performance analysis.
  • Familiarity with CUDA programming and/or GPUs.
  • Experience with HPC or large-scale computing environments.

Compensation & Benefits

  • Base salary ranges (dependent on level, location, and experience):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and company benefits (see NVIDIA benefits).

Additional information

  • Applications for this job will be accepted at least until August 14, 2025.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.