Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 3 Automated Testing @ 3 Python @ 5 Distributed Systems @ 3 Machine Learning @ 3 TensorFlow @ 3 Communication @ 6 Rust @ 3 Debugging @ 3 API @ 3 PyTorch @ 3 GPU @ 3Details
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. The Horizons team leads reinforcement learning research and development, contributing to advancements in AI autonomy, coding capabilities, scalable training infrastructure, and reasoning capabilities of large language models.
Responsibilities
- Architect and optimize core reinforcement learning infrastructure from training abstractions to distributed experiment management across GPU clusters.
- Design, implement, and test novel training environments, evaluations, and methodologies for reinforcement learning agents.
- Drive performance improvements by profiling, optimization, caching solutions, and debugging distributed systems.
- Collaborate with research and engineering teams to develop automated testing frameworks, clean APIs, and scalable infrastructure.
Requirements
- Proficient in Python and async/concurrent programming (e.g., Trio).
- Experience with machine learning frameworks such as PyTorch, TensorFlow, or JAX.
- Industry experience in machine learning research.
- Ability to balance research exploration with engineering implementation.
- Strong code quality, testing, and performance focus.
- Strong systems design and communication skills.
- Passionate about AI’s potential impact and committed to safe, beneficial AI systems.
Strong candidates may also have:
- Familiarity with large language model architectures and training methodologies.
- Experience with reinforcement learning techniques and environments.
- Experience with virtualization and sandboxed code execution.
- Experience with Kubernetes, distributed systems, or high-performance computing.
- Experience with Rust and/or C++.
Strong candidates need not have:
- Formal certifications or academic research history.
Benefits
Anthropic is a public benefit corporation based in San Francisco offering competitive compensation, equity donation matching, generous vacation and parental leave, flexible working hours, and a collaborative office environment. Visa sponsorship is available for qualifying candidates.
Logistics
- Education requirement: At least a Bachelor's degree or equivalent experience.
- Hybrid work policy: Staff expected to be in-office at least 25% of the time.
- Rolling application review.