Operations Site Reliability Engineer

Broadcom

A global technology leader that designs, develops and supplies semiconductor and infrastructure software solutions.

Bristol, UK

Site Reliability

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Enterprise SaaS

Description For Operations Site Reliability Engineer

Broadcom, a leading global technology company specializing in semiconductor and infrastructure software solutions, is seeking a Senior Site Reliability Engineer for their Bristol, UK location. This role is crucial for maintaining the reliability and performance of production services across their global infrastructure.

The position requires a seasoned professional with 5+ years of Linux systems administration experience and strong expertise in cloud platforms like AWS or GCP. The ideal candidate will be responsible for monitoring system availability, implementing automation solutions, and ensuring optimal performance of critical applications.

This is an excellent opportunity for an experienced SRE who enjoys working with modern technologies including Kubernetes, Docker, and various automation tools. The role involves both technical depth in systems administration and cross-functional collaboration with various stakeholders. You'll be part of a team that handles critical infrastructure and contributes to the continuous improvement of systems and processes.

The compensation package is highly competitive, including an equity package, comprehensive medical insurance, and additional benefits like ESPP and life assurance. The role requires participation in on-call rotations for weekend and holiday support, making it suitable for dedicated professionals who thrive in a high-responsibility environment.

Working at Broadcom means joining a global technology leader with a strong focus on innovation and excellence. The company offers a collaborative environment where you can make significant impacts on large-scale systems while working with cutting-edge technologies. This role is perfect for someone who combines strong technical skills with excellent communication abilities and enjoys solving complex infrastructure challenges.

Last updated 11 minutes ago

Responsibilities For Operations Site Reliability Engineer

Monitor availability and performance of production services
Respond to stakeholder requests within agreed SLOs
Drive automation to reduce failures and manual tasks
Perform systems administration activities across multiple platforms
Coordinate and communicate with stakeholders during incidents
Perform daily shift handovers across multiple geographies
Support maintenance activities and critical systems
Create and maintain troubleshooting documentation
Contribute to application/infrastructure releases and changes
Manage patching and upgrades of existing applications
Provide feedback and coaching to upstream teams

Requirements For Operations Site Reliability Engineer

Linux

Kubernetes

Degree in Systems Engineering, Computer Science or related fields
5+ years of experience administering Linux systems
Strong hands-on experience with Linux distributions
2+ years operational experience with AWS or GCP
Experience with automation platforms
Familiarity with deployment tools like Ansible Tower and Jenkins
Experience in global infrastructure deployments
Proficiency with Ansible and Terraform
Strong networking knowledge
Knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker
Experience in high-availability environments
Proficiency in scripting (Perl, shell, Ruby or Python)
Experience with monitoring systems optimization

Benefits For Operations Site Reliability Engineer

Medical Insurance

Equity

Highly competitive salary
Generous bonus scheme
Equity package
Competitive company pension
Employee stock purchase plan (ESPP)
Private Medical Insurance (Individual or family)
Life Assurance scheme (up to 4x salary)
Ample on-site parking

Broadcom

A global technology leader that designs, develops and supplies semiconductor and infrastructure software solutions.

Bristol, UK

Site Reliability

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To Broadcom Operations Site Reliability Engineer

Sr. Site Reliability Engineer

Broadcom

Senior Site Reliability Engineer position at Broadcom focusing on cloud infrastructure and SaaS platform operations.

Senior Site Reliability Engineer

Oracle

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure services and automation with 3-5+ years experience required.

Site Reliability Engineer - Database

Oracle

Senior Site Reliability Engineer position at Oracle focusing on Database Autonomous Recovery Service, requiring TS/SCI clearance and extensive cloud infrastructure experience.

Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Netflix

Senior Site Reliability Engineer position at Netflix focusing on live streaming platform reliability, cloud infrastructure, and scalability solutions.

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Bank of America

Senior Site Reliability Engineer position at Bank of America in Sydney, focusing on cloud platform reliability, automation, and DevOps practices.