Taro Logo

Site Reliability Engineer

A Fortune 500 company providing payments solutions for credit, debit, prepaid and merchant services, serving over 3 million companies worldwide.
Lindon, UT, USASalt Lake City, UT, USAAtlanta, GA, USA
Site Reliability
Mid-Level Software Engineer
Hybrid
5,000+ Employees
3+ years of experience
Finance

Job Description

Global Payments, a Fortune 500 company, is seeking a Site Reliability Engineer to join their API Operations team. This role is crucial for maintaining the stability and performance of their payment processing infrastructure that serves millions of transactions worldwide. The position offers a unique opportunity to work with cutting-edge payment technology while ensuring the reliability of systems that process transactions for over 3 million companies and 600 million cardholders.

The ideal candidate will be responsible for monitoring, diagnosing, and resolving production incidents across Apigee API Implementations. This role combines traditional SRE practices with specialized knowledge in payment systems, making it an excellent opportunity for engineers interested in both technical operations and fintech. You'll work with various teams including API engineering, Developer Services, and Product Management to maintain and improve system reliability.

The position requires strong technical skills in Python, cloud infrastructure (AWS/GCP), and monitoring tools, combined with excellent problem-solving abilities. You'll be part of a dynamic team that values learning and innovation, working on implementing cutting-edge solutions while maintaining high availability standards. The hybrid work environment offers flexibility while maintaining collaborative opportunities across multiple US locations.

This role is ideal for someone who thrives in a fast-paced environment, enjoys solving complex technical challenges, and wants to make a significant impact in the global payments technology landscape. The position offers exposure to enterprise-scale systems and the opportunity to work with modern technologies while contributing to the stability of critical financial infrastructure.

Last updated 2 days ago

Responsibilities For Site Reliability Engineer

  • Serve as the first line of defense for production incidents
  • Monitor system health and performance of deployed APIs
  • Track and investigate issues related to latency, failures, or broken integrations
  • Collaborate with platform engineers to implement observability, logging, and alerting
  • Build diagnostic tools, runbooks, and automated workflows
  • Maintain knowledge bases and playbooks
  • Partner with governance and compliance teams
  • Contribute to retrospectives and continuous improvement efforts

Requirements For Site Reliability Engineer

Python
Kubernetes
Kafka
  • 3+ years of experience in production support, SRE, or DevOps
  • Strong understanding of cloud infrastructure (AWS, GCP)
  • Proficiency in Python or shell scripting
  • Strong analytical, communication, and incident management skills
  • Bachelor's degree in Computer Science, Engineering, or related field (preferred)
  • Experience with CI/CD tools and Alerts/Monitoring automation
  • Familiarity with API Integrations

Related Jobs

Site Reliability Engineer

Site Reliability Engineer position at Global Payments, focusing on API operations and infrastructure management with hybrid work options in multiple US locations.

Site Reliability Engineer II - CTJ - Top Secret

Site Reliability Engineer II position at Microsoft working on Defender security products for government clouds, requiring Top Secret clearance and offering competitive compensation with comprehensive benefits.

Site Reliability Engineer (SRE) – Infrastructure and Observability

Site Reliability Engineer role at Worldpay focusing on infrastructure and observability, requiring 3+ years of experience in SRE/DevOps and expertise in monitoring tools and automation.

Site Reliability Engineer II - CTJ - Poly

Microsoft is hiring a Site Reliability Engineer II for their Identity team to support Azure Government Secret and Top-Secret Clouds, offering hybrid work and comprehensive benefits.

Software Developer II, Site Reliability

Site Reliability Developer position at Google focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.