Member of Technical Staff - Multimodal Post-Training

xAI
Full-time
Palo Alto, CA
$180,000 - $440,000
Posted on a month ago

Job Description

xAI is seeking a highly motivated engineer to build the next generation of Multimodal Grok, focusing on reasoning and tool usage to solve challenging problems across the LLM stack. The role involves working with real-time video understanding, data curation, pipelines, and evaluation for video-based multimodal capabilities.

Responsibilities

  • Build next-generation Multimodal Grok
  • Work across the LLM stack (pre-training, SFT, RL)
  • Deliver the strongest model for end users
  • Work with real-time video understanding
  • Data curation, pipelines, and evaluation for video-based multimodal capabilities

Requirements

  • Strong engineering skills
  • Strong understanding of large language models and data
  • Strong communication skills
  • Experience with data processing and training pipelines
  • Experience or publications in (multimodal) large language models is a plus
  • Knowledge of reinforcement learning techniques is a plus
  • Experience in computer vision, video processing, or multimodal datasets is a plus

Benefits

  • No benefits