Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 β basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 β daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 β you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 β exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 4
Kubernetes @ 4
Linux @ 4
Python @ 4
GCP @ 4
Machine Learning @ 4
Leadership @ 4
AWS @ 4
Azure @ 4
Bash @ 4
Communication @ 4
Networking @ 4
Performance Optimization @ 4
Technical Leadership @ 4
GPU @ 4
AI @ 4
- 1-2 β basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 β daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 β you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 β exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
About Nebius:
Nebius is leading a new era in cloud infrastructure for the global AI economy. The company is building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment. Nebius focuses on large-scale GPU orchestration, inference optimization, and owns problems across compute, storage, networking and applied AI. The company is listed on Nasdaq (NBIS) and headquartered in Amsterdam with R&D hubs across Europe, the UK, North America and Israel.
Role overview
You will resolve complex technical issues escalated by Nebius clients and Technical Account Managers. This senior role requires advanced expertise, strong problem-solving skills and a customer-focused approach to ensure seamless operations. As a senior team member, you will lead process improvements, mentor junior staff and develop scalable support practices. The role is shift work and can be performed remotely from the United States.
Responsibilities
- Lead diagnosis and resolution of advanced technical issues across Linux, networking, security, Kubernetes and cloud environments.
- Serve as a technical escalation point, providing guidance and support for complex troubleshooting scenarios.
- Investigate and resolve storage-related issues, leveraging deep expertise in data storage architectures and performance optimization.
- Apply ML knowledge to support and optimize model deployment, performance tuning and integration challenges within ML pipelines.
- Design and develop Python and Bash automation scripts to streamline workflows and improve operational efficiency.
- Deliver timely updates and detailed explanations to customers, ensuring clear communication of issue status, root cause analysis and resolutions.
- Proactively escalate unresolved problems while collaborating with cross-functional teams to minimize service disruptions and improve processes.
- Maintain and continuously enhance technical documentation, reflecting best practices.
- Provide technical leadership and mentorship to mid-level support colleagues.
- Collaborate with engineering and product teams to identify patterns, propose enhancements and contribute to the evolution of systems and services.
Requirements / Skills
- Linux administration β expert
- Networking β advanced
- Kubernetes β advanced
- Cloud platforms (AWS, Azure, GCP) β advanced
- Python and Bash scripting β advanced
- Machine learning training, inference and pipelines β intermediate
- Data storage concepts and performance optimization β advanced
- Troubleshooting & problem-solving β expert
- Experience: 7+ years in technical support, 5+ years of cloud hands-on
Qualifications
- Education: Bachelorβs degree in Computer Science, Information Technology or a related field.
- Certifications: Relevant certifications in cloud platforms (AWS, Azure, GCP), Linux administration or Kubernetes are highly desirable.
Compensation
- Compensation Range: $150,000 β $200,000 USD (competitive package based on experience).
Benefits
- 100% company-paid medical, dental, and vision coverage for employees and families.
- 401(k) plan with up to 4% company match and immediate vesting.
- Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Remote work reimbursement: Up to $85/month for mobile and internet.
- Company-paid short-term, long-term, and life insurance coverage.
- Competitive compensation, career growth, flexibility, and opportunity to work on impactful AI projects.
Other details
- Remote from the United States (work authorization required). Applicants must be authorized to work in the country in which they apply and will be required to provide proof of employment eligibility as a condition of hire.
- Shift work required.
- Nebius is an equal opportunity employer committed to an inclusive and diverse workplace.