Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 8 Ceph @ 3 Chef @ 3 Docker @ 6 ElasticSearch @ 4 Kafka @ 3 Kubernetes @ 3 MySQL @ 4 Python @ 7 SQL @ 4 Java @ 7 NoSQL @ 4 Algorithms @ 7 Distributed Systems @ 4 Machine Learning @ 7 Git @ 3 MongoDB @ 4 OpenStack @ 3 API @ 7 Hadoop @ 3 Puppet @ 3 Cassandra @ 4 Agile @ 4Details
NVIDIA is seeking an AI Solutions Architect to join its Infrastructure Planning and Process Team. This role focuses on scaling key AI solutions for NVIDIA's internal cloud infrastructure. IPP is a global organization collaborating with teams across Graphics Processors, Mobile Processors, Deep Learning, AI, and Driverless Cars to support infrastructure needs, managing a cloud with nearly half a million automated jobs daily on 5,000 servers.
Responsibilities
- Serve as an architect developing internal AI systems used globally within NVIDIA.
- Identify gaps and issues in tools and resolve whether AI or conventional solutions are appropriate.
- Research and evaluate "buy vs build" options for AI systems.
- Align with various teams to establish and break down AI system goals into objectives.
- Mentor and motivate sub-system leads for agile improvements.
- Identify bottlenecks and optimize AI development and testing system speed and cost efficiency.
- Plan software/hardware capacity across internal and public cloud.
- Introduce technologies for massively parallel systems to improve turnaround times.
- Collaborate with AI product vendors to gain industry insights.
Requirements
- BS in EE/CS or equivalent with 10+ years in systems software development; at least 1 year in AI development/exploration.
- Experience with Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Fine-Tuning LLMs, AI Agentic workflows, LangChain, LangGraphs, Cascading models.
- Experience deploying hybrid, multi-cloud architectures and edge computing.
- Expertise in architecting and shipping large-scale distributed software systems.
- Strong programming skills in Java, Python, Shell scripting; good understanding of distributed systems and REST APIs.
- Experience with SQL/NoSQL databases such as MySQL, Cassandra, MongoDB, or Elasticsearch.
- Proficient with Docker containers, Virtual Machines.
- Familiarity with cloud technologies like OpenStack, Docker, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git, Perforce, JFrog, Kafka.
- Excellent cross-team collaboration skills in multi-national, multi-time-zone corporate environments.
Ways to Stand Out
- MS or PhD in EE/CS.
- Deep knowledge of AI, Machine Learning, Deep Learning algorithms.
- Strong interpersonal skills and experience guiding others.
- Experience with large-scale service-oriented architectures under real-time performance needs.
- Proven track record designing scalable, high-performance software systems focused on hardware cost optimization.
Competitive salaries and a generous benefits package are offered. NVIDIA is an equal opportunity employer committed to diversity and inclusion. The base salary range is $184,000 - $356,500 USD, varying by location and experience. Additional equity and benefits are provided.