Manager, Development Operations - RAPIDS Data Science
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 3 System Administration @ 3 Docker @ 3 Jenkins @ 3 Linux @ 3 DevOps @ 3 Python @ 3 GitHub @ 3 GitHub Actions @ 3 CI/CD @ 3 Machine Learning @ 3 Data Science @ 3 AWS @ 3 Azure @ 3 Bash @ 3 Communication @ 3 Git @ 3 API @ 3 Compliance @ 3 CUDA @ 3Details
NVIDIA RAPIDS is an open-source software suite that leverages NVIDIA GPUs to accelerate data science and machine learning workflows. It provides libraries and APIs that enable users to perform data manipulation, machine learning, and graph analytics entirely on GPUs. This team supports the Data Science engineering team (including RAPIDS, NeMo Retriever, and NeMo curator), and is responsible for CI/CD systems, infrastructure deployment and maintenance, build systems, and security compliance.
Responsibilities
- Lead a team of DevOps engineers supporting multiple software projects in the data science and AI domain (many open-source).
- Collaborate with build engineers, developers, and management to ensure delivery of high-quality software.
- Take a hands-on approach and work directly with engineers on the team.
- Lead DevOps initiatives including CI/CD, security/legal compliance, and SysAdmin.
- Operate and run infrastructure and development processes; support build and release of CUDA/C++ and Python libraries and containers.
Requirements
- Bachelor of Science in Computer Engineering, Computer Science, or related technical field, or equivalent experience.
- 8+ years overall technical experience primarily related to DevOps, with 3+ years as a team or technical leader.
- Excellent communication and interpersonal skills.
- Detail-oriented and comfortable supporting and prioritizing across multiple teams.
- Experience administering, optimizing, and troubleshooting CI/CD and related tools (including Jenkins, Git, GitHub Actions).
- Linux system administration experience (Ubuntu strongly preferred).
- Knowledge of programming and automation with scripting languages (Bash and Python preferred).
Ways to Stand Out
- Experience with cloud services (AWS, Azure, and others), especially permissions, budget, and cost management.
- Experience with NVIDIA's technology stack, including CUDA toolkit and drivers.
- Background with Conda and/or PyPI packaging, and container technologies such as Docker (especially building and publishing).
- Experience with GitHub operations, including user, repository, and organization management and permissions, along with open-source development and community building on GitHub.
Compensation & Benefits
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 208,000 USD - 333,500 USD for Level 3, and 248,000 USD - 396,750 USD for Level 4. You will also be eligible for equity and benefits.
Additional Information
- Applications for this job will be accepted at least until July 29, 2025.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.
#deeplearning