Taro Logo

Site Reliability Engineer 3

Granicus provides cloud-based solutions for government communications, website design, meeting management, and digital services, serving over 5,500 government agencies globally.
Puerto Rico
Site Reliability
Staff Software Engineer
Hybrid
1,000 - 5,000 Employees
5+ years of experience
Enterprise SaaS · Government

Job Description

Granicus, a leading GovTech company serving over 5,500 government agencies worldwide, is seeking a Senior Site Reliability Engineer (SRE) to join their Platform Engineering & DevOps team. This role combines technical expertise with operational excellence, requiring 5+ years of experience in SRE, DevOps, or Software Engineering. The position involves managing large-scale systems, implementing automation, and ensuring high availability of services.

The ideal candidate will have strong knowledge of Linux/Unix systems, cloud platforms (AWS/Azure/GCloud), and modern DevOps tools including Kubernetes, Docker, and various monitoring solutions. They'll be responsible for production support, system maintenance, automation, and implementing best practices for reliability and security.

What makes this role unique is its impact on government digital transformation, working with a company that powers communications and digital services for hundreds of millions of citizens. The position offers flexibility with remote work options, though it requires participation in 24/7 on-call rotations.

The company culture emphasizes diversity, inclusion, and work-life balance, featuring regular engagement with leadership through CEO coffee sessions and various employee resource groups. This is an excellent opportunity for an experienced SRE to make a meaningful impact while working with modern technologies in a mission-driven organization.

Last updated 19 days ago

Responsibilities For Site Reliability Engineer 3

  • Provide production support on-call rotation
  • Monitor and maintain system health and performance
  • Develop and maintain automation scripts and tools
  • Perform incident management and root cause analysis
  • Implement system improvements for reliability and scalability
  • Collaborate with software engineers on application requirements
  • Create and maintain technical documentation
  • Assist in capacity planning
  • Implement security best practices

Requirements For Site Reliability Engineer 3

Python
Ruby
Go
Java
Linux
Kubernetes
  • 5+ years of experience in SRE, DevOps, or Software Engineering
  • Bachelor's or Master's degree in Computer Science, IT, or related field
  • Strong knowledge of Linux/Unix systems and cloud services (AWS, Azure, or Google Cloud)
  • Proficiency in scripting languages (Python, Bash, Ruby)
  • Experience with monitoring tools (Prometheus, Grafana, Splunk)
  • Knowledge of containerization (Docker, Kubernetes)
  • Experience with database management (SQL, NoSQL)
  • Strong analytical and problem-solving skills
  • Excellent verbal and written communication skills
  • Ability to work in 24/7 on-call rotation including weekends

Benefits For Site Reliability Engineer 3

  • Remote-first company with globally distributed workforce
  • Employee Resource Groups
  • Coffee sessions with CEO
  • Microsoft Teams communities focused on wellness, art, and more

Related Jobs

Solutions Architect III - AMZ9463127

AWS Solutions Architect III position in San Juan, PR, focusing on cloud architecture, customer solutions, and technical leadership with competitive compensation and benefits.