Member of Technical Staff - Inference

at xAI

📍 Palo Alto, United States

USD 180,000-440,000 per year

MIDDLE

✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Leadership @ 3 Communication @ 6 Prioritization @ 6 GPU @ 3 AI @ 3 Reinforcement Learning @ 3

Details

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills and be able to concisely and accurately share knowledge with their teammates.

Location: Palo Alto, CA

Responsibilities

Optimizing the latency and throughput of model inference.
Building reliable and performant production serving systems to serve billions of users.
Accelerating research on scaling test-time compute and rollout in reinforcement learning training.
Model-hardware co-design for next-generation architectures.

Requirements

The posting lists the following basic qualifications:

Worked on system optimizations for model serving, such as batching, caching, load balancing, and parallelism.
Worked on low-level optimizations for inference, such as GPU kernels and code generation.
Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding, and low-precision numerics.
Worked on large-scale inference engines or reinforcement learning frameworks.
Worked on large-scale, high-concurrent production serving.
Worked on testing, benchmarking, and reliability of inference services.

Compensation and benefits

$180,000 - $440,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view their Recruitment Privacy Notice.