Senior Solutions Architect, Infiniband And Networking Ethernet - NVIS
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Ansible @ 4 Cumulus Linux @ 4 Linux @ 4 Python @ 4 GCP @ 3 CI/CD @ 4 Leadership @ 4 AWS @ 3 Azure @ 3 Mathematics @ 4 Networking @ 4 iOS @ 4 Salt @ 4 GPU @ 4Details
NVIDIA is seeking a Senior Networking (ETH/IB) Solutions Architect to join its NVIDIA Infrastructure Specialist Team. The team supports academic and commercial groups worldwide using NVIDIA products to revolutionize deep learning, data analytics, and data center performance. The role involves working on dynamic customer-focused projects requiring excellent interpersonal skills to interact with customers, partners, and internal teams. The candidate will analyze, define, and implement large-scale networking projects spanning networking, system design, and automation.
Responsibilities
- Build AI/HPC infrastructure for new and existing customers.
- Support operational and reliability aspects of large-scale AI clusters with focus on performance, real-time monitoring, logging, and alerting.
- Engage in and improve the entire lifecycle of services from design through deployment, operation, and refinement.
- Maintain live services by measuring and monitoring availability, latency, and system health.
- Provide feedback to internal teams by documenting bugs, workarounds, and suggesting improvements.
Requirements
- BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.
- Over 5 years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture.
- Proficiency in configuring, testing, and resolving issues in LAN and InfiniBand networks, especially in medium to large-scale HPC/AI environments.
- Advanced knowledge of EVPN, BGP, OSPF, VXLAN protocols.
- Hands-on experience with network switch/router platforms including Cumulus Linux, SONiC, IOS, JunosOS, and EOS.
- Extensive experience delivering automated network provisioning solutions using Ansible, Salt, and Python.
- Ability to develop CI/CD pipelines for network operations.
- Strong focus on customer satisfaction.
- Self-motivated with leadership skills for collaboration with customers and internal teams.
- Strong written, verbal, and listening skills in English.
Ways to Stand Out
- Familiarity with cloud networks such as AWS, GCP, Azure.
- Linux or Networking certifications.
- Experience with high-performance computing architectures and job schedulers like Slurm and PBS.
- Knowledge of cluster management technologies, with bonus credit for BCM (Base Command Manager).
- Experience with GPU-focused hardware and software.
NVIDIA is considered one of the world’s most desirable technology employers, known for forward-thinking and hardworking individuals. Creative and autonomous candidates are encouraged to apply.