Software Engineer - Data Acquisition / Web Crawling

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
$180,000 - $440,000
Data
Senior Software Engineer
In-Person
11 - 50 Employees
5+ years of experience
AI

Description For Software Engineer - Data Acquisition / Web Crawling

xAI is at the forefront of artificial intelligence development, creating systems to understand the universe and advance human knowledge. This role offers an exciting opportunity to join their Data Acquisition team, where you'll be instrumental in building world-class systems that process hundreds of petabytes of diverse data - from web content to code, images, audio, and video.

As a Software Engineer specializing in Data Acquisition and Web Crawling, you'll be working on systems that power cutting-edge AI models like Grok 3. You'll collaborate closely with pre-training, reasoning, and multimodal teams, helping to define and implement precise data requirements and large-scale classifiers. The role demands expertise in building robust, high-throughput data pipelines and managing petabyte-scale challenges.

The position is based in the Bay Area, offering a competitive salary range of $180,000 - $440,000 USD. xAI operates with a flat organizational structure where initiative and excellence determine leadership opportunities. The team is small, highly motivated, and focused on engineering excellence, making it perfect for those who thrive on curiosity and challenging themselves.

The tech stack includes Python, Rust, Spark, and Kubernetes, and you'll be working with various data modalities and distributed systems. The role requires strong proficiency in compiled languages, experience with large-scale system optimization, and deep knowledge of internet fundamentals. This is an opportunity to directly impact the evolution of transformative AI while working with cutting-edge technology and brilliant minds in the field.

Last updated 6 days ago

Responsibilities For Software Engineer - Data Acquisition / Web Crawling

  • Building petabyte-scale, high-throughput data processing systems managing hundreds of petabytes to exabytes of data
  • Designing and operating large-scale distributed systems and pipelines processing hundreds of thousands to millions of operations per second
  • Managing workloads across large cloud compute clusters
  • Pre-processing datasets for AI training
  • Building and operating large-scale crawlers

Requirements For Software Engineer - Data Acquisition / Web Crawling

Python
Kubernetes
  • Strong engineering skills with a passion for improving different aspects of data and model performance
  • Strong proficiency in at least one compiled language: Rust, Go, C++, or Java
  • Experience with one or more modalities other than text
  • Experience building bespoke data processing libraries from scratch
  • Experience designing and implementing distributed systems in Rust
  • Experience with performance optimization of large-scale systems
  • Experience with SQL/NoSQL databases
  • Great debugging skills
  • Deep knowledge of internet fundamentals including DNS, OSI model, crawler architectures

Interested in this job?

Jobs Related To xAI Software Engineer - Data Acquisition / Web Crawling

Business Intelligence Engineer, Amazon MGM Studios Data Insights

Senior Business Intelligence Engineer role at Amazon MGM Studios focusing on data analytics and insights for entertainment industry.

Senior Data Engineer, 3PX Private Pricing - Analytics & Insights

Senior Data Engineer position at AWS focusing on Private Pricing Analytics, requiring 5+ years experience in data engineering, SQL expertise, and programming skills in Python/Java/Scala/NodeJS.

Senior Business Intelligence Engineer, Long-term planning

Senior Business Intelligence Engineer role at Amazon focusing on long-term planning and optimization of consolidation systems for improved delivery efficiency.

Senior Software Engineer - Big Data

Senior Software Engineer position at Disney focused on big data engineering for streaming platforms, working on experimentation and personalization systems at petabyte scale.

Senior KDB Data Engineer

Senior KDB Data Engineer position at Barclays, focusing on kdb+/q database development and financial data engineering.