Taro Logo

Senior Site Reliability Engineer

World's leading provider of enterprise open source software solutions, delivering Linux, cloud, container, and Kubernetes technologies.
Site Reliability
Senior Software Engineer
Hybrid
5,000+ Employees
5+ years of experience
Enterprise SaaS

Job Description

Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate their OpenShift managed cloud services. This role combines software engineering and operations expertise to ensure the reliability and scalability of Red Hat's enterprise Kubernetes distribution. As an SRE, you'll tackle complex challenges unique to managed cloud services, working with cutting-edge technologies and contributing to large-scale distributed systems.

The position offers opportunities to work with a global team in an open, transparent environment that values diverse perspectives. Red Hat emphasizes a blameless culture focused on continuous improvement, where individual contributions have high visibility and impact. The role involves both hands-on coding and operational responsibilities, including participating in on-call rotations and incident response.

Key responsibilities include developing automation solutions, improving monitoring systems, enabling customer self-service, and contributing to the scalability and reliability of OpenShift services. You'll work with modern technologies including Kubernetes, various programming languages, and major cloud platforms.

The ideal candidate brings a strong background in Linux systems administration, cloud platforms, and software development. Experience with monitoring systems, configuration management, and delivering hosted services is essential. Red Hat values both technical expertise and soft skills, emphasizing collaboration, problem-solving, and effective communication with customers and team members.

This hybrid role offers the flexibility of remote work while maintaining connection with the team in Bangalore. It's an excellent opportunity for experienced engineers looking to work with enterprise-grade open source technologies while contributing to one of the leading companies in the cloud-native space.

Last updated 2 days ago

Responsibilities For Senior Site Reliability Engineer

  • Contribute code to increase the scalability and reliability of the service
  • Contribute software tests and participate in peer review
  • Help and develop peers' capabilities through knowledge sharing and mentoring
  • Participate in regular on-call schedule
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from Global Support team
  • Work within a small agile team to develop and improve SRE software

Requirements For Senior Site Reliability Engineer

Python
Go
Java
Kubernetes
Linux
  • Bachelor's degree in Computer Science or related technical field
  • 5+ years experience managing Linux servers on cloud providers
  • 3+ years experience with enterprise systems monitoring
  • 3+ years experience with configuration management software
  • 2+ years programming experience with object-oriented languages
  • 2+ years experience delivering hosted services
  • Solid understanding of TCP/IP networking and common protocols
  • Strong communication skills and customer presentation experience
  • Experience with Kubernetes and docker containers is a plus

Related Jobs

Senior Software Engineer (Site Reliability Engineer)

Senior Site Reliability Engineer position at Maersk focusing on infrastructure automation, observability, and reliability engineering using Golang, Python, and modern DevOps tools.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft's Windows Cloud division in Hyderabad, focusing on Windows 365 and Azure Virtual Desktop platform reliability and automation.

Senior Site Reliability Engineer, DGX Cloud

Senior SRE position at NVIDIA focusing on managing and optimizing DGX Cloud clusters for AI workloads across major cloud providers.

Site Reliability Engineer - Career

Senior Site Reliability Engineer position at Equifax in Pune, focusing on cloud infrastructure, automation, and system reliability with 5+ years of experience required.

Senior Site Reliability Engineer, AI Infrastructure

Senior Site Reliability Engineer position at NVIDIA, focusing on maintaining and optimizing AI infrastructure systems across global cloud platforms.