Used Tools & Technologies
Machine LearningRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Linux @ 3
Data Analysis @ 3
System Architecture @ 3
Cloud Computing @ 3
GPU @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 engineers with deep expertise across hardware and software engineering and an in-house AI R&D team.
New data center development:
You will have the opportunity to work with cutting-edge technologies in data operations, cloud computing and infrastructure management. As global data center operations grow, there will be opportunities for career progression. Working in the data center directly impacts performance, customer satisfaction and efficiency, with the opportunity to contribute to new data center projects. You will collaborate with experts in AI data center development and operations and work on solutions that exceed industry standards in design and deployment.
Responsibilities
- Maintain and optimize IT infrastructure within the data center, including hands-on work with modern technologies such as the advanced H200 GPU cloud cluster.
- Troubleshoot and resolve firmware- and hardware-related issues with servers, requiring in-depth knowledge of system architecture and advanced troubleshooting.
- Take charge of hardware problem management, workarounds and solutions; act as a subject matter expert and escalation point for the team.
- Perform hardware and network diagnostics, carry out physical repairs, and participate in on-call rotations and travel between data centers as required.
- Create and improve processes, documentation and training materials for the IT hardware team.
- Collaborate with R&D to improve hardware designs and with vendors on warranty replacements (RMA).
Requirements
- Knowledge of datacenters and server equipment.
- Deep knowledge of IT hardware and practical experience troubleshooting hardware issues.
- Advanced skills working with Unix/Linux operating systems and command line.
- Experience with equipment monitoring, data analysis and presentation.
- Proactiveness and a strong sense of responsibility.
- High proficiency in spoken and written English.
It will be an added bonus if you have:
- Valid driver's license.
- Skills repairing electronics at component level (SMD).
- Knowledge of network equipment and troubleshooting.
You are welcome to work in our colocation in Vineland, New Jersey.
Benefits
- Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families (US).
- 401(k) plan: Up to 4% company match with immediate vesting.
- Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Company-paid short-term, long-term, and life insurance coverage.
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements and a dynamic, collaborative work environment.
Compensation
We offer competitive salaries, ranging from $90,000 to $100,000 per year.
Join Nebius Today! If you are excited about AI and ML and enjoy working on data center infrastructure, apply to join our team.