Site Reliability Engineer (Expert-level)

France
Site Reliability
Staff Software Engineer
Remote
Enterprise SaaS

Description For Site Reliability Engineer (Expert-level)

Last updated 11 days ago

Responsibilities For Site Reliability Engineer (Expert-level)

  • Managing global infrastructure
  • Monitoring KPIs
  • Automating processes
  • Planning for scalability
  • Working with distributed data stores
  • Managing and monitoring system infrastructure

Requirements For Site Reliability Engineer (Expert-level)

PostgreSQL
Cassandra
Kafka
Kubernetes
  • Experience with cloud provider GCP
  • Experience with configuration management tools Terraform and Ansible
  • Practical experience with distributed data stores (PostgreSQL, Cassandra, and Kafka)
  • Hands-on proficiency with modern monitoring tools (Prometheus and Grafana)
  • Ability to manage global infrastructure
  • Experience with monitoring KPIs
  • Experience with process automation
  • Experience with scalability planning

Interested in this job?

Jobs Related To Sinch Site Reliability Engineer (Expert-level)

Site Reliability Engineer (SRE) - Object Storage

Senior SRE position at Apple focusing on distributed storage systems, offering competitive compensation and the opportunity to impact millions of users.

Site Reliability Engineer

Remote Staff Site Reliability Engineer position at Axon, building and maintaining mission-critical cloud infrastructure for public safety solutions.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering team at Google Cloud, managing distributed systems and service reliability while driving technical excellence and team development.