Member of Technical Staff, RL Training Framework

xAI
Full-time
Palo Alto, CA
$180,000 - $440,000
Posted on a month ago

Job Description

xAI is seeking a highly motivated engineer to join their reasoning infrastructure team and build an end-to-end RL training framework for pretraining-scale RL. The role involves designing, implementing, profiling, debugging, and optimizing distributed RL systems, as well as software and algorithm co-design.

Responsibilities

  • Design and implement distributed RL systems
  • Profile, debug, and optimize system performance
  • Software and algorithm co-design

Requirements

  • Experience building, debugging, and optimizing large scale distributed training systems
  • Experience building async RL training frameworks
  • Experience in inference systems
  • Proficiency in Python, Jax, or Rust
  • Strong communication skills

Benefits

  • No benefits