Member of Technical Staff, Pre-Training Data

xAI
Full-time
Palo Alto, CA
$180,000 - $440,000
Posted on a month ago

Job Description

xAI is seeking a highly motivated engineer to join the pre-training data team, focused on crafting data recipes for training an omni-model using text, image, video, and audio. The role involves collaborating on data sourcing, building pipelines, developing evaluations, and innovating data scaling techniques.

Responsibilities

  • Collaborate with the crawling team to source datasets
  • Architect pipelines to transform datasets at petabyte scales
  • Develop evaluations for pre-training models
  • Craft experiments to assess dataset performance
  • Innovate data scaling techniques

Requirements

  • Strong engineering skills in Python, Spark, and Ray
  • Familiarity with ML and large model scaling
  • Ability to design ML experiments
  • Familiarity with AI training data curation techniques
  • Strong communication skills

Benefits

  • No benefits