Founding Site Reliability Engineer

Voice AI platform helping developers bridge the gap between raw models and voice AI in production.
$150,000 - $300,000
DevOps
Staff Software Engineer
In-Person
1 - 10 Employees
3+ years of experience
AI

Description For Founding Site Reliability Engineer

Apple Intelligence and Google Gemini are about to onboard 4 billion people to voice assistants that actually talk like people. This marks a new beginning for voice as the world's default interface. We will all want voice everywhere. Businesses are not ready. Products are not ready.

We're building the infrastructure that helps developers close the gap between the raw model and voice AI in production. We launched in March, scaled revenue, and raised a Series A from a top-tier firm this past week. We're a small team of 6 and growing. We're building out the founding team in-person in SF to scale what's working.

As our Founding Site Reliability Engineer, you'll play a crucial role in ensuring our real-time distributed systems remain robust, scalable, and reliable. You'll be responsible for:

  • Managing 24/7 on-call rotation and incident response
  • Maintaining 99.99% system uptime for our distributed infrastructure
  • Automating infrastructure management and deployment processes
  • Optimizing real-time system performance for AI model integration
  • Establishing incident response protocols
  • Building scalable infrastructure for rapid growth

Required technical skills include:

  • Experience with Kubernetes, Terraform, Ansible
  • Cloud platforms (AWS/GCP/Azure)
  • Programming in Python/Go/Bash
  • Distributed systems and real-time infrastructure
  • Infrastructure-as-code expertise
  • Observability tools (Prometheus, Grafana, ELK stack)

We're looking for a self-starter who can proactively identify and execute high-value tasks. Bonus points for previous experience as a technical founder in B2B SaaS or telecom background.

Last updated 3 hours ago

Responsibilities For Founding Site Reliability Engineer

  • Participate in an on-call rotation to ensure 24/7 availability and rapid incident response
  • Ensure 99.99% system uptime and reliability for our real-time distributed infrastructure
  • Automate infrastructure management, reducing deployment and recovery times
  • Optimize real-time system performance, enabling seamless integration with AI models
  • Establish robust incident response protocols, minimizing downtime
  • Build infrastructure and processes that scale effectively with rapid company growth

Requirements For Founding Site Reliability Engineer

Kubernetes
Python
Go
  • Proven experience with distributed systems, real-time infrastructure, and infrastructure-as-code
  • Proficiency in at least one scripting or programming language (Python, Go, Bash)
  • Familiarity with observability tools like Prometheus, Grafana, or ELK stack
  • Self-starter: You proactively identify and execute the highest-value tasks
  • 3+ years of experience

Interested in this job?

Jobs Related To Vapi Founding Site Reliability Engineer

Founding Infrastructure Software Engineer

Founding Infrastructure Engineer role at Vapi, building scalable voice AI infrastructure for developers.

Staff Software Engineer (Developer Productivity)

Staff Software Engineer position at Okta focusing on Developer Productivity, building and maintaining automated build and testing infrastructure with competitive compensation and benefits.

Sr. Engineer, Infrastructure, Hardware Compute Group

Senior Infrastructure Engineer position at Amazon Lab126, focusing on build systems and infrastructure for AI hardware development team.

Staff Operational Technology Engineer

Staff Operational Technology Engineer position at Smith & Nephew, focusing on operational technology and industrial systems in healthcare manufacturing.

Staff Operational Technology Engineer

Staff Operational Technology Engineer position at Smith & Nephew in Memphis, focusing on industrial automation and operational technology systems.