Member of Technical Staff, Pre-Training Data Infrastructure

xAI
Full-time
Palo Alto, CA; San Francisco, CA
$180,000 - $440,000
Posted on a month ago

Job Description

xAI is seeking a highly motivated engineer to design, implement, and manage petabyte-scale data processing systems for pre-training and post-training AI models. The role involves building data pipelines, improving data quality, and ensuring data discoverability. xAI values engineering excellence, initiative, and strong communication skills.

Responsibilities

  • Design and implement petabyte-scale data processing systems
  • Design and implement tools for data pipeline orchestration
  • Improve data discoverability and quality
  • Build and manage data pipelines for training data creation

Requirements

  • Strong systems skills in distributed data processing
  • Experience building data processing systems
  • Experience preparing data for large language models
  • Data organization and meticulous bookkeeping skills
  • Proficiency in Python, Rust, or C++
  • Familiarity with Spark and Ray

Benefits

  • No benefits