Distinguished Software Architect - Deep Learning And HPC Communications
at Nvidia
📍 Germany
USD 100,000-180,000 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 6 Algorithms @ 7 TensorFlow @ 7 Communication @ 4 Networking @ 4 Parallel Programming @ 4 Performance Optimization @ 7 Debugging @ 7 PyTorch @ 7 GPU @ 4Details
NVIDIA leads in groundbreaking developments in Artificial Intelligence, High Performance Computing (HPC), and Visualization. Their GPU innovations serve as the visual cortex of modern computers, powering AI, autonomous cars, and scientific discovery.
Responsibilities
- Research new communication technologies (e.g., expand the GPUDirect technology portfolio) and design new features for GPU communication libraries.
- Propose innovative hardware and software solutions for next-gen platforms, co-designing with GPU, Networking, and Software architects.
- Inspire changes based on quantitative data from proof-of-concepts or technical analysis/modeling.
- Drive adoption of new communication technologies across application verticals.
- Collaborate with diverse teams including deep learning researchers and customers.
Requirements
- PhD in Computer Science, Computer Engineering, or related field, or equivalent experience.
- 15+ years of relevant experience in academia or industry.
- Expertise in HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).
- Deep understanding of high performance networking: network technologies (Infiniband, Ethernet), design, topologies, debugging, and performance analysis.
- Strong skills in ML/DL fundamentals related to communications, parallel algorithms, fault tolerance, resiliency, performance optimization on large clusters, and DL frameworks (PyTorch, TensorFlow).
- Programming fluency with C or C++ for systems software development.
- Ability to work and communicate effectively across hardware/software teams and multiple time zones.
Ways To Stand Out
- Industry recognized leader in HPC/DL communications with patents, publications, and conference talks.
- Influential role in industry standards (MPI, OpenSHMEM) and open source software (PyTorch, UCX, Open MPI).
NVIDIA is recognized as a leading tech employer with a creative and autonomous culture, inviting talented professionals to join them.