Used Tools & Technologies
Not specified
Required Skills & Competences ?
Algorithms @ 4 Distributed Systems @ 4 Machine Learning @ 4 Data Science @ 4 Communication @ 7 Microservices @ 4Details
Are you passionate about generative AI and building agentic workflows to solve real problems? Are you interested in learning more about computer hardware architecture? NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our chips and systems power applications within artificial intelligence, computer graphics, autonomous vehicles, robotics, gaming, virtual reality, and high-performance computing.
We are now looking for a senior SW AI architect to help redefine our engineering flows. Come join the NVIDIA hardware architecture team to build agentic systems to improve our future chip designs, or put differently, use AI to design our next-generation AI systems!
Responsibilities
- Serve as an expert in implementing and deploying AI applications based on large language models (LLMs), internal and external agentic frameworks, and custom models.
- Work with hardware architects to identify how to best design, customize, and deploy AI-based solutions to their specific problem domains.
- Collaborate with infrastructure engineers to improve existing automated workflows by incorporating LLMs and establishing best practices for future solutions.
- Develop and optimize retrieval and generation algorithms for enterprise data (text, code, and images) to build advanced AI applications.
- Interact with internal research groups on how to solve complex chip design problems in new ways by leveraging machine learning (ML) and deep learning (DL).
- Research emerging AI technologies and engineering best practices to continuously evolve our development ecosystem and maintain a competitive edge.
Requirements
- MSc or PhD in Data Science, Computer Science/Engineering, Electrical Engineering, or equivalent experience.
- 5+ years of industry or research experience.
- Deep practical knowledge of LLMs, DL/ML, and agent development.
- Well versed in agentic literature and eager to continue learning.
- Strong background in implementing AI solutions to solve real-world engineering problems.
- Experience with training/fine-tuning custom models, building multi-agent systems, retrieval augmented generation (RAG) pipelines, and vector databases.
- Strong analytical, communication, and interpersonal skills.
Ways to stand out
- Background in computer architecture or hardware development.
- Good understanding of distributed systems and microservice architecture.
- Hands-on experience with NVIDIA Inference Microservices (NIMs).
Compensation & Benefits
- Base salary ranges (dependent on level and location):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
- You will also be eligible for equity and benefits (see NVIDIA benefits page).
Additional information
- Location: Santa Clara, CA (US). #LI-Hybrid
- Employment type: Full time.
- Applications for this job will be accepted at least until September 22, 2025.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.