Senior HPC Support Engineer - InfiniBand And Nvlink

at Nvidia
USD 108,000-201,200 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Marketing @ 4 System Administration @ 4 Linux @ 4 Python @ 4 AWS @ 4 Bash @ 4 Networking @ 4 Debugging @ 4 Customer Support @ 6 ChatGPT @ 4 GPU @ 4

Details

We are seeking a motivated Senior HPC Support Engineer focusing on InfiniBand and NVLink technology, passionate about data center and networking technologies, to provide comprehensive solutions for sophisticated installations, maintenance, or operations for a broad scope of groundbreaking networking products. As a primary point of contact for our customers; assisting them with technical questions, debugging and resolving their issues. As a member of our Technical Support team, you are a conscientious, proficient communicator who is fundamentally interested in taking ownership in resolving issues, while ensuring a high level of customer satisfaction is maintained and delivered. Significant part of the role is also to interact with Engineering, Marketing, and Support teams regularly on technical issues.

Responsibilities

  • Resolve sophisticated customer concerns and technical issues through meticulous research, reproduction, and solving problems for customers installing products and supporting systems using Linux OS (Multi-distro), focusing on NVIDIA InfiniBand, NVLink and GPU Technology and End-to-End Solutions.
  • Respond to customer product support inquiries via telephone, email, or conference calls.
  • Resolve customer issues during installation, operation, maintenance or product application or interoperability with other vendors.
  • Participate in multi-functional team meetings and provide feedback to engineering and marketing regarding product requirements, customer experience, support tools, etc.
  • Develop, re-define and document standard methodologies to share with internal teams for support processes and improvements.
  • Conduct site visits and conference calls with customers.

Requirements

  • 5+ years providing in-depth customer support and debugging for hardware and software products.
  • Exceptional interpersonal skills to maintain and own resolution of critical issues raised by customers.
  • Linux OS including System Administration and Networking on LFCS/RHCSA level.
  • Networking Technology, protocols and routing including IP, L2 and L3 on CCNP/CompTIA Networking+ and Cloud+ level.
  • Containerized solutions experience at DCA and/or CKA level, Virtualization (KVM/ESXi), and Cloud Infrastructure (AWS/OCI) technologies.
  • Debug networking protocols using tools such as TCPDUMP and Wireshark or similar tools.
  • Bash/Python scripting abilities.
  • Strong organizational and multitasking skills with limited supervision.
  • Integrate AI tools (Cursor, Gemini, ChatGPT, Copilot, Glean, etc.) into daily workflow.
  • Four-year degree in Computer Science, or Electrical or Computer Engineering, or equivalent experience.

Ways to Stand Out

  • NVIDIA Certifications related to AI Infrastructure, Operations and Networking.
  • Experience with InfiniBand, RDMA, NVLink, and NVIDIA GPU Technology.
  • Knowledge of Clustering or HPC Data-Center technologies including Upper Layer Protocols (MPI, NCCL).
  • Additional OS knowledge such as Microsoft Windows, VMware, Unix.
  • Configuration and operational expertise with traditional network switches/routers and Open platforms.

Compensation

  • Base salary range: 108,000 USD - 201,250 USD, determined by location, experience, and peer pay.
  • Eligible for equity and benefits.

NVIDIA is committed to diversity and equal employment opportunity across all protected characteristics.