Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Software Development @ 6
Distributed Systems @ 3
JVM @ 5
Communication @ 6
Rust @ 5
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
About xAI
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence. The organization expects hands-on contributions, strong communication, initiative, and the ability to prioritize.
Role overview
We are building a new storage tier that powers training, inference, recommendations, and real-time data extraction for a platform used by ~600 million users per month. Systems in scope include an exabyte-scale S3-compatible object store, a multi-tenant transactional key/value store serving hundreds of millions of requests per second, a caching tier serving billions of requests per second, a scalable vector database for recommendations, and other specialized high-throughput databases.
Responsibilities
- Design, build, and launch production features and improvements to unify common components across storage systems.
- Diagnose and resolve performance issues; work with customers to meet latency, availability, and data durability requirements.
- Lead and drive incident responses and recoveries; contribute to and review incident postmortems.
- Mentor peers, perform code and design reviews, and help uplevel the team.
- Learn new technologies as needed and ground decisions with data and reasoning.
Requirements
- At least 7 years of software development experience, plus 2+ years building storage systems or databases.
- Strong focus on reliability, performance, and quality.
- Proficiency in high-performance C++, Rust, or JVM-based languages.
- Expertise building, running, and operating scalable and resilient distributed systems.
- Experience or familiarity with S3-compatible object stores, transactional key/value systems, caching tiers, and vector databases is relevant.
Compensation and benefits
- Base salary: $180,000 - $440,000 USD
- Total rewards also include equity, comprehensive medical/vision/dental coverage, access to a 401(k) plan, short & long-term disability insurance, life insurance, and other discounts and perks.
Location
- Palo Alto, CA (United States)