AI Engineer & Researcher - Pre-training Scaling, Data, and Eval

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
$180,000 - $440,000
Machine Learning
Senior Software Engineer
In-Person
11 - 50 Employees
5+ years of experience
AI

Description For AI Engineer & Researcher - Pre-training Scaling, Data, and Eval

xAI, a cutting-edge AI research company, is seeking an AI Engineer & Researcher specializing in pre-training scaling, data, and evaluation. The role is based in the Bay Area (San Francisco and Palo Alto) and offers a competitive salary range of $180,000 - $440,000 USD.

The company operates with a flat organizational structure where all employees are hands-on contributors to the mission of creating AI systems that can accurately understand the universe. The team is small, highly motivated, and focused on engineering excellence, making it an ideal environment for curious individuals who thrive on challenges.

The role involves working with state-of-the-art AI technologies, including training trillion-parameter neural networks and implementing cutting-edge methods from deep learning literature. The tech stack includes Python, JAX and XLA, Rust/C++, and Spark. The successful candidate will be responsible for innovating new ideas for pretraining and scaling paradigms, while also improving data quality across different modalities.

The ideal candidate should possess strong engineering skills, expertise in ML and large model scaling, and familiarity with distributed training and multi-GPU neural network optimization. They should also have experience with AI training data preparation and be skilled at organizing data across multiple clouds and modalities.

The interview process is thorough but efficient, typically completed within one week, consisting of a phone screening followed by four technical interviews including coding assessment, systems hands-on, project deep-dive, and team meet-and-greet. The company values strong communication skills and the ability to share knowledge effectively with teammates.

Last updated 6 days ago

Responsibilities For AI Engineer & Researcher - Pre-training Scaling, Data, and Eval

  • Training trillion parameter neural networks at scale, as well as a variety of smaller specialized models
  • Rapidly implementing the latest state-of-the-art methods from the deep learning literature
  • Innovating new ideas for pretraining and new scaling paradigm
  • Improving pretraining data quality at scale across different modalities

Requirements For AI Engineer & Researcher - Pre-training Scaling, Data, and Eval

Python
  • Strong engineering skills with passion to improve different aspects of data and model
  • Expert in ML and large model scaling, familiar with all kinds of scaling laws
  • Familiar with distributed training, multi-GPU neural network training and experience on optimizing ML training efficiency
  • Familiar with state-of-the-art techniques for preparing AI training data
  • Good at organizing and meticulously bookkeeping data across multiple clouds, of multiple modalities, and from many sources

Interested in this job?

Jobs Related To xAI AI Engineer & Researcher - Pre-training Scaling, Data, and Eval

AI Engineer & Researcher - Inference

Senior AI Engineer & Researcher position at xAI focusing on optimizing model inference and building scalable AI systems, offering $180-440K in San Francisco/Palo Alto.

Senior Software Engineer, AGI Automations

Senior Software Engineering role leading Amazon's AGI team in developing generative AI technologies and multimodal foundation models, requiring 5+ years of experience.

Senior Software Engineer, Machine Learning, YouTube

Senior Software Engineer position focused on machine learning at YouTube, developing AI technologies to enhance video platform capabilities.

Senior Machine Learning Engineer, Developer Productivity

Senior Machine Learning Engineer position at Apple focused on enhancing developer productivity through ML solutions, offering competitive compensation and comprehensive benefits.

Senior Software Engineer - Applied Sciences Group

Senior Software Engineer role at Microsoft's Applied Sciences Group in Belgrade, focusing on AI/ML development for next-gen Windows experiences. Hybrid work model with comprehensive benefits.