Used Tools & Technologies
Machine LearningRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Ansible @ 5
Kubernetes @ 3
IaC @ 5
Terraform @ 5
Python @ 3
TensorFlow @ 3
AWS @ 2
Azure @ 2
Mentoring @ 3
KubeFlow @ 3
Project Management @ 6
PyTorch @ 3
CUDA @ 3
Cloud Computing @ 3
GPU @ 3
Deep Learning @ 3
AI @ 3
OpenCL @ 3
Slurm @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside experienced and innovative leaders and engineers.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team includes over 800 employees and more than 400 engineers with expertise across hardware and software engineering, and an in-house AI R&D team. You are welcome to work remotely from the United States for this role.
Role description
We are looking for a Customer Engineer to support key and strategic Nebius GPU Cloud services customers. In this role you will be a trusted technical advisor, helping clients design, deploy, and scale AI solutions while managing large-scale GPU workloads involving hundreds to thousands of GPUs. You will collaborate with sales and product teams to drive growth and enhance customer satisfaction.
Responsibilities
- Serve as the primary technical point of contact for troubleshooting and resolving complex AI/ML issues.
- Guide customers in optimizing GPU performance for ML training and inference workloads, ensuring seamless integration and scalability.
- Partner with the sales team to identify new opportunities, promote the latest products, and deliver technical presentations.
- Act as a bridge to product teams, providing customer feedback, relaying feature requests, and ensuring alignment with customer requirements.
- Engage with internal and external stakeholders, negotiate solutions, and effectively drive alignment to address customer challenges.
Requirements
- Experience: 5+ years in roles like Solutions Architect, Technical Account Manager, or Customer Engineer, with hands-on experience in cloud services and AI/ML workloads.
- Proficiency in Infrastructure as Code (IaC) tools like Terraform and Ansible.
- Experience with Kubernetes and Python programming.
- Solid understanding of GPU computing, including ML training and inference workloads, and GPU stacks (e.g., CUDA, OpenCL).
- Customer-centric approach with a proven ability to build trust and foster long-term relationships.
- Strong ability to explain technical concepts to technical and non-technical audiences.
It will be an added bonus if you have
- Hands-on experience with HPC/ML orchestration frameworks (e.g., Slurm, Kubeflow).
- Experience with deep learning frameworks (e.g., PyTorch, TensorFlow).
- Familiarity with ML tools from NVIDIA, AWS, Azure, and Google Cloud providers.
- Strong project management skills, with the ability to prioritize tasks and deliver on deadlines.
- Proven experience mentoring technical teams and driving team growth.
- Expertise in stakeholder negotiation to support problem resolution and ensure seamless collaboration.
Benefits
- Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
- 401(k) Plan: Up to 4% company match with immediate vesting.
- Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Remote Work Reimbursement: Up to $85/month for mobile and internet.
- Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.
- Competitive salary and comprehensive benefits package, opportunities for professional growth, flexible working arrangements, and a dynamic, collaborative work environment.
Compensation
We offer competitive salaries, ranging from $225k - $275k OTE (On-Target Earnings) and equity based on your experience, skills, and location.