Used Tools & Technologies
Machine LearningRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Linux @ 3
Python @ 3
NoSQL @ 3
CI/CD @ 3
Bash @ 3
Communication @ 6
Networking @ 3
API @ 3
HTTP @ 3
Cloud Computing @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Employees work at the cutting edge of AI cloud infrastructure alongside experienced leaders and engineers.
Role overview
Nebius operates large-scale, mission-critical bare-metal infrastructure. As a Software Engineer (Python) you will design and build systems that provision, configure, test, and manage physical hardware at scale. Your work will sit close to the metal — interfacing directly with servers, networks, and management controllers — while supporting highly automated, reliable infrastructure operations. You will collaborate closely with hardware, networking, and data center operations teams to ensure platforms are robust, scalable, and production ready.
Responsibilities
- Design and develop backend services and automation in Python
- Build and maintain systems for hardware provisioning, testing, and lifecycle management
- Develop software that runs directly on bare-metal environments
- Integrate with Linux systems, using Bash and low-level tooling where needed
- Implement and maintain CI/CD pipelines for infrastructure-focused software
- Work with networking services including IPv4/IPv6, DHCP, DNS, network boot, and server boot workflows (PXE / network boot)
- Interface with BMC controllers and management protocols (IPMI-style protocols, HTTP-based standards)
- Enable reliable hardware interaction and automation across large fleets
- Support ARM64 / ARM64EC architectures
- Design and integrate NoSQL data stores for system state and orchestration data
- Write clear documentation and contribute to operational excellence
Requirements
- Strong professional experience as a software engineer, with a focus on Python
- Solid experience with Linux systems and shell scripting (Bash)
- Hands-on experience working with bare-metal servers or low-level infrastructure
- Strong understanding of networking fundamentals (IPv4/IPv6, DHCP, DNS, PXE / network boot)
- Experience interacting with hardware management interfaces (BMC, IPMI-like protocols, HTTP APIs)
- Familiarity with CI/CD systems and production deployment workflows
- Experience designing or working with NoSQL databases
- Ability to debug complex issues spanning software, hardware, and networks
- Strong ownership mindset and clear communication skills in a distributed team
Nice to have
- Experience operating or building systems for large-scale infrastructure
- Familiarity with ARM-based platforms in production environments
- Background in hardware testing, validation, or factory provisioning
- Experience with infrastructure automation or internal platform tooling
- Contributions to open-source or internal systems software projects
Working conditions
- Fully remote position (United States)
- Collaboration with globally distributed engineering and operations teams
Benefits
- Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
- 401(k) plan: up to 4% company match with immediate vesting
- Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
- Remote work reimbursement: up to $85/month for mobile and internet
- Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
- Competitive salary and comprehensive benefits package
- Opportunities for professional growth and flexible working arrangements
Compensation
We offer competitive salaries, ranging from $150k–$210k base, plus equity and quarterly performance bonuses.
Join Nebius and help build the software that powers the next generation of AI infrastructure.