Taro Logo

Senior Software Engineer, AI/ML and Data Infrastructure, Central Technology

The Chan Zuckerberg Initiative was founded by Priscilla Chan and Mark Zuckerberg in 2015 to help solve some of society's toughest challenges — from eradicating disease and improving education to addressing the needs of our local communities. Our mission is to build a more inclusive, just, and healthy future for everyone.
$190,000 - $285,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Software Engineer - Qualcomm, Linkoping, Sweden

Senior Software Engineer position at Qualcomm Linkoping focusing on autonomous driving software and computer vision, offering comprehensive benefits and professional growth opportunities.

Senior Research Engineer for On-Device LLM Efficiency

Senior Research Engineer position at Qualcomm focusing on optimizing LLM efficiency for on-device applications.

Software Engineer - AI/ML

Senior Software Engineering role at Qualcomm focusing on AI/ML development for mobile and embedded devices, requiring 5+ years of experience and strong Python skills.

Senior Engineer, AI Orchestration

Senior Engineer role at Qualcomm focusing on AI Orchestration and machine learning implementation for Snapdragon platforms.

Senior Software Engineer - Avatar AI

Senior Software Engineer position at Roblox focusing on Avatar AI and machine learning implementation, offering competitive compensation and the opportunity to work on cutting-edge ML models.

Description For Senior Software Engineer, AI/ML and Data Infrastructure, Central Technology

The Chan Zuckerberg Initiative (CZI) is seeking a Senior Software Engineer specializing in AI/ML and Data Infrastructure for their Central Technology team. This role is part of CZI's mission to build a more inclusive, just, and healthy future for everyone.

As a Senior Software Engineer in AI/ML and Data Infrastructure, you will:

  1. Design and build efficient, stable, performant, scalable, and secure AI/ML and Data infrastructure engineering solutions.
  2. Work hands-on with applications and systems integrations for large-scale AI/ML GPU compute infrastructure and platforms across multiple clouds.
  3. Develop containerized applications and infrastructure using Kubernetes to support large-scale GPU Research clusters and heterogeneous AI/ML environments.
  4. Collaborate on Cloud-based AI/ML platform solutions, including Databricks Spark, Weaviate Vector Databases, and hosted Cloud GPU Compute services running containerized PyTorch on large-scale Kubernetes.
  5. Partner on data management solutions for complex datasets.
  6. Build tooling to optimize shared infrastructure for AI/ML efforts with world-class GPU Compute Cluster and other compute environments.

The ideal candidate will have:

  • BS or MS degree in Computer Science or related technical discipline, or equivalent experience
  • 5+ years of relevant coding experience
  • 3+ years of systems Architecture and Design experience across Data, AI/ML, Core Infrastructure, and Security Engineering
  • Expertise in scaling containerized applications on Kubernetes
  • Proficiency with cloud platforms (AWS, GCP, or Azure) and on-premises/colocation environments
  • Strong coding skills in systems languages (Rust, C/C++, C#, Go, Java, or Scala) and scripting languages (Python, PHP, or Ruby)
  • Experience with AI/ML Platform Operations, including large-scale Kafka and Spark deployments
  • MLOps experience with medium to large-scale GPU clusters in Kubernetes or HPC environments
  • Knowledge of Nvidia CUDA and AI/ML custom libraries
  • Understanding of Linux systems optimization and administration

This position offers a competitive salary range of $190,000 - $285,000, along with comprehensive benefits including 401(k) matching, annual benefits for various life needs, paid time off for volunteering, and more.

The role is based in Redwood City, CA, with a hybrid work arrangement requiring on-site presence a few days per week. CZI is committed to diversity, equity, and inclusion, and encourages applicants from all backgrounds to apply.

Join the Chan Zuckerberg Initiative and be part of a team working to leverage technology and innovation to address some of society's most pressing challenges in science, education, and social justice.

Last updated 9 months ago

Responsibilities For Senior Software Engineer, AI/ML and Data Infrastructure, Central Technology

  • Design and build efficient, stable, performant, scalable, and secure AI/ML and Data infrastructure engineering solutions
  • Work hands-on with applications and systems integrations for large-scale AI/ML GPU compute infrastructure and platforms across multiple clouds
  • Develop containerized applications and infrastructure using Kubernetes to support large-scale GPU Research clusters and heterogeneous AI/ML environments
  • Collaborate on Cloud-based AI/ML platform solutions, including Databricks Spark, Weaviate Vector Databases, and hosted Cloud GPU Compute services
  • Partner on data management solutions for complex datasets
  • Build tooling to optimize shared infrastructure for AI/ML efforts with world-class GPU Compute Cluster and other compute environments

Requirements For Senior Software Engineer, AI/ML and Data Infrastructure, Central Technology

Kubernetes
Python
Java
Go
Scala
Redis
Kafka
  • BS or MS degree in Computer Science or related technical discipline, or equivalent experience
  • 5+ years of relevant coding experience
  • 3+ years of systems Architecture and Design experience across Data, AI/ML, Core Infrastructure, and Security Engineering
  • Expertise in scaling containerized applications on Kubernetes
  • Proficiency with cloud platforms (AWS, GCP, or Azure) and on-premises/colocation environments
  • Strong coding skills in systems languages (Rust, C/C++, C#, Go, Java, or Scala) and scripting languages (Python, PHP, or Ruby)
  • Experience with AI/ML Platform Operations, including large-scale Kafka and Spark deployments
  • MLOps experience with medium to large-scale GPU clusters in Kubernetes or HPC environments
  • Knowledge of Nvidia CUDA and AI/ML custom libraries
  • Understanding of Linux systems optimization and administration

Benefits For Senior Software Engineer, AI/ML and Data Infrastructure, Central Technology

  • Competitive salary range
  • 401(k) matching
  • Annual benefit for various life needs
  • Paid time off for volunteering
  • CZI Life of Service Gifts
  • Funding for select family-forming benefits
  • Relocation support for employees moving to the Bay Area

Interested in this job?