Taro Logo

Senior Site Reliability Engineer (Remote - India)

A company providing cloud-native data infrastructure solutions.
Site Reliability
Senior Software Engineer
Remote
10+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer (Remote - India)

Join Dremio as a Senior Site Reliability Engineer in a fully remote position based in India. This role puts you at the forefront of maintaining and improving mission-critical systems in a cloud-native environment. You'll be working with cutting-edge technologies including Kubernetes, service meshes, and modern observability tools across multiple cloud providers (AWS, GCP, Azure). The position offers a unique opportunity to shape large-scale distributed systems used by global enterprises while working in a collaborative, forward-thinking environment. You'll be responsible for leading continuous improvements in cloud infrastructure, implementing robust SLI/SLOs, and driving reliability engineering practices throughout the organization. The role combines technical expertise with leadership opportunities, offering a chance to make significant impact on system reliability and scalability. Benefits include competitive compensation, flexible work arrangements, comprehensive healthcare, and strong emphasis on professional development. This is an ideal opportunity for experienced SRE professionals looking to work with modern cloud technologies in a fast-paced, innovative environment.

Last updated 13 days ago

Responsibilities For Senior Site Reliability Engineer (Remote - India)

  • Lead continuous improvements in Kubernetes usage, GitOps deployment strategies, and service mesh configuration across cloud platforms
  • Extend cross-cloud networking and connectivity solutions including VPNs, BGP, and partner interconnects
  • Collaborate with Engineering teams to ensure systems are production-ready
  • Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Drive observability efforts by enhancing logging, metrics, tracing, and system profiling
  • Optimize and debug code, automate recurring tasks
  • Advocate for reliability engineering practices
  • Participate in on-call rotation and lead incident response
  • Promote scalable practices and support continuous delivery

Requirements For Senior Site Reliability Engineer (Remote - India)

Kubernetes
Python
Go
Java
  • 10+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure
  • Advanced proficiency in Kubernetes, Istio, Terraform, Terragrunt, and ArgoCD/Flux
  • Strong understanding of cloud-native networking, VPNs, and multi-cloud connectivity solutions
  • Demonstrated hands-on experience with cloud platforms including GCP, AWS, and Azure
  • Skilled in Python or Go, with the ability to debug and review Java
  • Proven ability to design, analyze, and troubleshoot large-scale distributed architectures
  • Strong communication, ownership, and problem-solving abilities
  • Experience with managing Kubernetes clusters at large scale (1,000+ nodes)
  • Experience with developing and managing production-grade SLIs/SLOs

Benefits For Senior Site Reliability Engineer (Remote - India)

Medical Insurance
Dental Insurance
Vision Insurance
  • Competitive compensation package
  • Flexible hybrid work environment with Workplace Wednesdays
  • Catered lunches or meal credits on in-office days
  • Local social events
  • Generous paid time off
  • Wellness initiatives
  • Comprehensive healthcare, including medical, dental, and vision coverage
  • Professional development opportunities
  • Support for continued learning

Jobs Related To Dremio Senior Site Reliability Engineer (Remote - India)