Used Tools & Technologies
Not specified
Required Skills & Competences ?
ElasticSearch @ 4 Grafana @ 4 Linux @ 4 Prometheus @ 4 Kibana @ 4 Python @ 4 Machine Learning @ 4 Bash @ 4 Communication @ 7 Mathematics @ 4 Networking @ 4 Debugging @ 4 Splunk @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4Details
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Responsibilities
- Provide engineering solutions to enable deployment of world-class GPU computing products at scale, lead technical relationships with engineering teams, and assist system administrators, software and hardware engineers, and machine learning/deep learning engineers in building creative solutions.
- Lead aspects of performance analysis and scalable practices to support large scale infrastructure, delivering powerful tools, methodologies, and workflows to validate expectations.
- Deliver engineering solutions to deliver continuous insights into performance of AI workloads over evolving environments, generating quick insights to improvements and regressions over time.
- Decompose multi-faceted issues into minimal reproduction cases, working towards final root cause of underlying problems.
- Participate and engage with multiple team members to develop best practices for understanding trends in test results and presenting data clearly to develop data-driven actions.
Requirements
- 5+ years of experience running multinode workloads and identifying bottlenecks and implementing improvements.
- Proven understanding of high-performance computing based architectures and GPU accelerated computing software stacks and deep learning frameworks (CUDA, PyTorch).
- Experience with CPU architectures.
- Experience with C/C++, Python, Bash programming/scripting.
- Strong teamwork and communication skills.
- Ability to multitask in a dynamic environment.
- Action driven with strong analytical skills.
- BS in Engineering, Mathematics, Physics, or Computer Science; MS or PhD desirable (or equivalent experience).
Ways to Stand Out From the Crowd
- Experience tuning memory, storage, and networking settings for performance on Linux systems.
- Knowledge of modern cloud and container-based architectures.
- Hands-on experience deploying and debugging systems with NVIDIA NVLink and Infiniband.
- Experience with multiple monitoring stacks such as Prometheus+Grafana, Elasticsearch+Kibana, Splunk, Zabbix, etc.
- Demonstrated work with open-source software: building, debugging, patching, and contributing code.
Benefits
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you’re creative and autonomous, with a genuine passion for technology, we want to hear from you.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.