Used Tools & Technologies
Machine LearningRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Grafana @ 4
Kafka @ 4
Kubernetes @ 4
Kibana @ 4
Python @ 4
SQL @ 4
R @ 4
Spark @ 4
Statistics @ 4
Java @ 4
CI/CD @ 4
MLOps @ 4
Data Science @ 4
scikit-learn @ 4
TensorFlow @ 4
Leadership @ 4
Communication @ 7
Mathematics @ 4
KubeFlow @ 4
MLFlow @ 4
Technical Leadership @ 4
Databricks @ 4
LLM @ 4
PyTorch @ 4
Pandas @ 4
GPU @ 4
AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Join the NVIDIA GeForce NOW cloud team that enables users to play high-quality PC games on multiple devices without a dedicated gaming PC or console. GeForce NOW is built on NVIDIA GPU architectures and software optimizations to deliver efficient, high-quality experiences at high resolutions and frame rates with industry-leading low latencies.
Our team builds diagnostic, prescriptive and AI-augmented analytics solutions for processing, visualization, anomaly detection, root-cause and predictive modeling for millions of end users. Active projects include real-time demand forecasting, constraint-optimized capacity allocation, dynamic per-session prescriptions, customer onboarding and Voice of Customer analytics, targeted outreach driven by retention models, personalized diagnostics, and an LLM chatbot. The technology stack uses industry-standard components and tools including Python, R, Pandas, JupyterLab, Spark, SQL, Databricks, MLflow, Delta Lake, Grafana, Kibana, Kubeflow, Elyra, Kubernetes, GitLab, CI/CD, MLOps, Kafka, and SQS.
Responsibilities
- Provide technical leadership to data scientists and engineers for global, large-scale GPU compute service deployments.
- Collaborate with leadership and stakeholders to gather high-level requirements, build technical roadmaps, design solutions, and guide delivery.
- Acquire and apply domain knowledge of the product and platform to lead design, implementation, and deployment of AI/ML solutions that generate actionable insights and real-time prescriptive analytic pipelines for production services.
- Build and deploy real-time, scalable solutions for user diagnostics, LLM chatbot, dynamic suspicious activity detection, feedback-based clustering/alerting, and LLM-based actionable insight generation for engineering and management.
- Process and wrangle petabytes of data using statistical, AI/ML and LLM models to deliver actionable, real-time insights and improve organizational productivity.
- Apply forecasting models and constraint-optimization solvers to improve capacity management, server efficiency, and end-user latency.
- Develop ML/AI predictive models with explainability for user retention/churn and design targeted outreach campaigns.
- Create innovative multi-agent self-learning harnesses to improve analytics and deployment productivity.
Requirements
- Master's/PhD or equivalent experience in Data Science, Statistics, Mathematics, Physics, Operations Research, or a related quantitative field.
- 15+ years of software experience delivering large-scale, reliable production deployments and 8+ years of proven experience in statistics/AI/ML.
- Hands-on expertise with Python, SQL, Java and modeling frameworks such as scikit-learn, PyTorch, and TensorFlow for large projects.
- Experience with data storage and processing tools (e.g., Spark, Pandas, Delta Lake) and operating large-scale software across big networks.
- Strong verbal and written communication skills to convey complex data insights to technical and non-technical stakeholders.
- Demonstrated track record leading successful, large-scale data science projects.
- Experience in user retention modeling, LLMs, time series forecasting, or operations research is a plus.
Benefits
- Competitive base salary range: 248,000 USD - 379,500 USD (final base depends on location, experience, and comparable pay).
- Eligibility for equity and company benefits.
Additional details
- Applications accepted at least until June 26, 2026. This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer committed to fostering an inclusive work environment and does not discriminate based on protected characteristics.
#deeplearning