Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Python @ 7
GPU @ 4
AI @ 4
Performance Analysis @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. As a member of the GeForce NOW cloud team you will help enable users to play high-quality PC games on various devices without the need for a dedicated gaming PC or console. GeForce NOW is built on NVIDIA GPU technology, proprietary GPU architectures, and software optimizations to deliver efficient, high-quality gaming and AI experiences in the cloud.
Responsibilities
- Architect next-generation cloud infrastructure optimized for AI workloads alongside gaming.
- Perform deep performance and power analysis of GPU/CPU microarchitecture features for AI inference and gaming workloads.
- Deploy, optimize, and benchmark AI/gaming kernels in the cloud across various system configurations.
- Build models and tools to guide platform decisions balancing performance, power, and cost.
- Present findings to cross-functional teams including product, engineering, and executives.
- Collaborate with engineers, architects, and researchers to implement world-class solutions.
- Contribute to the development of sophisticated computing systems that redefine limits of performance and efficiency.
Requirements
- Bachelors, Masters, PhD, or equivalent experience in Computer Engineering, Computer Science, Electrical Engineering, AI, or related fields.
- 8+ years of experience in CPU or GPU performance, working on microarchitecture bottleneck analysis.
- Demonstrated expertise in hardware power and performance analysis and understanding of microarchitecture design trade-offs.
- Experience characterizing and optimizing AI, gaming, and/or cloud workloads, including software and compiler-level optimizations.
- Strong programming skills in C, C++, Python, and scripting languages; hands-on experience configuring, deploying, and running AI models.
- Good understanding of performance analysis methodologies including code instrumentation, sampling, and roofline analysis.
- Problem-solving skills, ability to analyze complex data, form hypotheses, and communicate conclusions concisely to various audiences.
Compensation & Benefits
- Base salary range: 184,000 USD - 287,500 USD (determined based on location, experience, and pay of employees in similar positions).
- Eligible for equity and benefits (link provided in original posting).
Additional Information
- Applications for this job will be accepted at least until February 10, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.