Used Tools & Technologies
LLM GPURequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Software Development @ 3
Kubernetes @ 3
Python @ 3
Spark @ 3
GCP @ 3
Distributed Systems @ 3
Machine Learning @ 3
Data Science @ 3
AWS @ 3
Azure @ 3
Rust @ 3
Hadoop @ 3
NLP @ 3
Cloud Computing @ 3
Deep Learning @ 3
AI @ 3
Computer Vision @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
xAI is seeking engineers to introduce innovative techniques and analyses to the AI field to facilitate breakthroughs in quantitative reasoning and language understanding. The team is small and focused on engineering excellence; employees are expected to be hands-on, communicate concisely, and contribute directly to the company mission. Telecommuting is permitted.
Responsibilities
- Stabilize large language model training, pipeline-parallel training of large language models, and fine-tuning large language models with truthful data.
- Perform cutting-edge research on advanced techniques from AI and deep learning, including neural network architectures, language modeling, and speech recognition.
- Design and implement hands-on technical solutions and tooling within company software and systems.
- Work closely with leaders across the company to deliver impactful projects that may involve machine learning, applied data science, recommendation systems, and information retrieval systems.
- Collaborate with distributed systems engineers and AI researchers to develop scalable technologies for NLP, computer vision, and speech recognition applications.
Requirements
- Bachelor’s degree or foreign equivalent in Computer Science, Computer Engineering, Mechanical Engineering, Machine Learning, or a related field, and 2 years of relevant experience.
- Experience, knowledge, or coursework in each of the following areas:
- Big Data systems such as Spark, Hadoop, BigQuery, and related technologies to build highly scalable data processing systems.
- Building large-scale Kubernetes clusters for data storage, processing, and analysis on on-prem systems and cloud computing.
- Applied machine learning techniques and deploying large-scale deep learning systems.
- Working with distributed system engineers and AI researchers on NLP, computer vision, and speech recognition applications.
- Programming in Rust, C++, or Python to build tooling and features within company software development code and standards.
- Building applications with hardware accelerators, such as GPUs and TPUs from GCP, Azure, and AWS.
- Employment and background checks may be required.
Additional Information
- Location: Palo Alto, CA (telecommuting permitted).
- Reference: 00100860
- Salary: $324,000 - $396,000 per year