Senior Site Reliability Engineer Cloud Platform

Zilliz is the industry's leading vector database company for enterprise-grade AI, founded by the engineers behind Milvus, the world's most popular open-source vector database.
$150,000 - $220,000
DevOps
Senior Software Engineer
Hybrid
4+ years of experience
AI · Enterprise SaaS

Description For Senior Site Reliability Engineer Cloud Platform

Zilliz, the pioneering force behind the world's most popular open-source vector database Milvus, is seeking a Senior Site Reliability Engineer to join their cloud platform team. This role represents a unique opportunity to work at the forefront of AI infrastructure, helping to build and maintain enterprise-grade vector database systems.

The position combines traditional SRE responsibilities with the excitement of working on cutting-edge AI technology. You'll be responsible for ensuring the reliability and performance of distributed database systems, implementing monitoring solutions, and automating operations. The role requires expertise in cloud platforms, container orchestration, and infrastructure as code, making it perfect for someone who enjoys working with modern DevOps tools and practices.

As part of the team, you'll have the chance to directly impact the success of a fast-growing startup while contributing to the open-source community. The company offers a competitive compensation package, including equity opportunities and comprehensive benefits, reflecting their commitment to attracting top talent.

The hybrid work environment (three days per week in office) provides a balance between collaborative in-person work and flexibility. Located in Redwood City, California, you'll be working in the heart of Silicon Valley's tech ecosystem. This is an excellent opportunity for an experienced SRE who wants to work on challenging problems in the AI space while helping to shape the future of vector databases.

Last updated 13 hours ago

Responsibilities For Senior Site Reliability Engineer Cloud Platform

  • Work at the intersection of development and site reliability
  • Ensure the reliability, availability, and performance of Zilliz's distributed database systems
  • Develop and implement strategies for monitoring, incident management, and disaster recovery
  • Automate system operations and maintenance tasks
  • Design and build tools to manage and monitor infrastructure
  • Collaborate with software engineers to enhance system reliability, scalability, and performance
  • Maintain and improve the CI/CD pipeline
  • Contribute to the Milvus Vector Database open-source community

Requirements For Senior Site Reliability Engineer Cloud Platform

Python
Go
Java
Kubernetes
  • 4+ years of experience in site reliability engineering or similar roles
  • Proficiency in scripting languages such as Python, Go, or Java
  • Strong knowledge of container orchestration technologies like Kubernetes and Docker
  • Expertise with cloud platforms such as AWS, GCP, or Azure
  • Experience with infrastructure as code tools such as Terraform or Ansible
  • Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo
  • Proven ability to troubleshoot complex distributed systems
  • Bachelor's degree in computer science, software engineering, or relevant disciplines
  • Ability to thrive in a fast-paced startup environment

Benefits For Senior Site Reliability Engineer Cloud Platform

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Competitive compensation (cash + equity)
  • Regular bonus and equity refresh opportunities
  • Medical, dental, and vision insurance
  • Paid time off, including vacation, sick leave, and global reset/wellbeing days
  • Generous 401(k) and regional retirement plans

Interested in this job?

Jobs Related To Zilliz Senior Site Reliability Engineer Cloud Platform

Senior DevOps Engineer

Senior DevOps Engineer role at Mastercard in Pune, focusing on infrastructure automation, cloud platforms, and container orchestration to support global payment systems.

Production Service Developer 3

Senior Tech Architect role at Oracle focusing on Citrix and Windows environments, requiring 5-7 years experience in system administration and automation.

Senior Software Engineer, DevOps

Senior DevOps Engineer role at Capital One, focusing on cloud infrastructure, automation, and modern DevOps practices using Python, Golang, and AWS technologies.

Production Service Developer 3

Senior Tech Architect role responsible for Citrix and Windows-based environments architecture, implementation guidance, and technical leadership.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at SingleStore focused on Kubernetes and cloud infrastructure for managed database service.