Operations Site Reliability Engineer

A global technology leader that designs, develops and supplies semiconductor and infrastructure software solutions.
Bristol, UK
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS

Description For Operations Site Reliability Engineer

Broadcom, a leading global technology company specializing in semiconductor and infrastructure software solutions, is seeking a Senior Site Reliability Engineer for their Bristol, UK location. This role is crucial for maintaining the reliability and performance of production services across their global infrastructure.

The position requires a seasoned professional with 5+ years of Linux systems administration experience and strong expertise in cloud platforms like AWS or GCP. The ideal candidate will be responsible for monitoring system availability, implementing automation solutions, and ensuring optimal performance of critical applications.

This is an excellent opportunity for an experienced SRE who enjoys working with modern technologies including Kubernetes, Docker, and various automation tools. The role involves both technical depth in systems administration and cross-functional collaboration with various stakeholders. You'll be part of a team that handles critical infrastructure and contributes to the continuous improvement of systems and processes.

The compensation package is highly competitive, including an equity package, comprehensive medical insurance, and additional benefits like ESPP and life assurance. The role requires participation in on-call rotations for weekend and holiday support, making it suitable for dedicated professionals who thrive in a high-responsibility environment.

Working at Broadcom means joining a global technology leader with a strong focus on innovation and excellence. The company offers a collaborative environment where you can make significant impacts on large-scale systems while working with cutting-edge technologies. This role is perfect for someone who combines strong technical skills with excellent communication abilities and enjoys solving complex infrastructure challenges.

Last updated 11 minutes ago

Responsibilities For Operations Site Reliability Engineer

  • Monitor availability and performance of production services
  • Respond to stakeholder requests within agreed SLOs
  • Drive automation to reduce failures and manual tasks
  • Perform systems administration activities across multiple platforms
  • Coordinate and communicate with stakeholders during incidents
  • Perform daily shift handovers across multiple geographies
  • Support maintenance activities and critical systems
  • Create and maintain troubleshooting documentation
  • Contribute to application/infrastructure releases and changes
  • Manage patching and upgrades of existing applications
  • Provide feedback and coaching to upstream teams

Requirements For Operations Site Reliability Engineer

Linux
Kubernetes
  • Degree in Systems Engineering, Computer Science or related fields
  • 5+ years of experience administering Linux systems
  • Strong hands-on experience with Linux distributions
  • 2+ years operational experience with AWS or GCP
  • Experience with automation platforms
  • Familiarity with deployment tools like Ansible Tower and Jenkins
  • Experience in global infrastructure deployments
  • Proficiency with Ansible and Terraform
  • Strong networking knowledge
  • Knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker
  • Experience in high-availability environments
  • Proficiency in scripting (Perl, shell, Ruby or Python)
  • Experience with monitoring systems optimization

Benefits For Operations Site Reliability Engineer

Medical Insurance
Equity
  • Highly competitive salary
  • Generous bonus scheme
  • Equity package
  • Competitive company pension
  • Employee stock purchase plan (ESPP)
  • Private Medical Insurance (Individual or family)
  • Life Assurance scheme (up to 4x salary)
  • Ample on-site parking

Interested in this job?

Jobs Related To Broadcom Operations Site Reliability Engineer

Sr. Site Reliability Engineer

Senior Site Reliability Engineer position at Broadcom focusing on cloud infrastructure and SaaS platform operations.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure services and automation with 3-5+ years experience required.

Site Reliability Engineer - Database

Senior Site Reliability Engineer position at Oracle focusing on Database Autonomous Recovery Service, requiring TS/SCI clearance and extensive cloud infrastructure experience.

Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Senior Site Reliability Engineer position at Netflix focusing on live streaming platform reliability, cloud infrastructure, and scalability solutions.

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Senior Site Reliability Engineer position at Bank of America in Sydney, focusing on cloud platform reliability, automation, and DevOps practices.