Staff Site Reliability Engineer

Global leader in data-first contract lifecycle management (CLM) software, providing flexible Data-first Agreement Platform for managing contract processes.
United States
$150,000 - $220,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
8+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Site Reliability Engineer (Storage)

Senior Site Reliability Engineer position at LinkedIn focusing on storage infrastructure, requiring 5+ years of experience in web operations and expertise in storage systems.

Senior Site Reliability Engineer (Storage)

Senior Site Reliability Engineer position at LinkedIn focusing on storage infrastructure, requiring 5+ years of experience in large-scale web operations and expertise in SDNAS and GPFS.

Senior Site Reliability Engineer (Storage)

Senior Site Reliability Engineer position at LinkedIn focusing on storage systems, requiring expertise in UNIX operations, Python/Java programming, and experience with Software-Defined NAS and GPFS.

Senior Site Reliability Engineer - ASE iCloud Cross Functional

Senior Site Reliability Engineer role at Apple working on iCloud services, focusing on system reliability, cross-team collaboration, and service improvement initiatives.

Senior Site Reliability Engineer - Apple Services Engineering (ASE) / iCloud

Senior Site Reliability Engineer role at Apple Services Engineering, focusing on building and maintaining highly available systems that power Apple's customer-facing services.

Description For Staff Site Reliability Engineer

Agiloft, the trusted leader in contract lifecycle management (CLM) software, is seeking a Staff Site Reliability Engineer to join their team. As a pioneer in data-first contract management, Agiloft has earned recognition from top analysts like Gartner, Forrester, and IDC. The company boasts an impressive customer satisfaction rate with nearly 100% of new customers satisfied with initial implementations and a 97% annual renewal rate.

The Staff SRE role offers an opportunity to work with cutting-edge technology in a company that values diversity, inclusion, and work-life balance. You'll be responsible for developing and implementing highly reliable and scalable systems, working closely with cross-functional teams. The position requires expertise in cloud operations, monitoring tools, and security practices, with opportunities to lead complex projects and mentor team members.

Agiloft's culture emphasizes the philosophy that "EX = CX" - excellent employee experience leads to excellent customer experience. The company supports multiple Employee Resource Groups and offers benefits like floating holidays and quarterly wellness days. They're committed to building a diverse workplace where individuals from all backgrounds can thrive and bring their authentic selves to work.

This role is perfect for an experienced SRE professional who wants to make a significant impact in a growing, successful company that's at the forefront of the CLM market. You'll have the opportunity to shape the reliability and scalability of systems that are becoming increasingly critical for organizations worldwide.

Last updated 3 months ago

Responsibilities For Staff Site Reliability Engineer

  • Define and enforce SRE best practices and standards
  • Architect and implement highly reliable and scalable systems
  • Lead complex post-incident reviews and implement systemic improvements
  • Collaborate with product and engineering teams to set reliability targets
  • Manage high-impact incidents and coordinate incident response
  • Contribute to budget planning and resource allocation
  • Lead efforts to establish disaster recovery strategies
  • Provide technical leadership and mentorship to the SRE team
  • Continuously track and improve metrics to optimize software delivery and operational performance
  • Participate in on-call rotation

Requirements For Staff Site Reliability Engineer

Python
Linux
Kubernetes
  • 8-10 years of experience in similar or related role
  • Bachelor's degree in Computer Science, Information Technology, or related field
  • In-depth knowledge of Cloud Ops technologies including AWS and Terraform
  • Advanced knowledge in Linux operating systems
  • Expertise in setting up and managing monitoring tools
  • In-depth understanding of monitoring and alerting systems, networking principles
  • Strong understanding of incident management
  • Advanced experience with security measures and practices
  • Strong analytical and problem-solving skills
  • Strong understanding of programming/scripting languages
  • Excellent communication and teamwork skills

Benefits For Staff Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
  • Floating holidays
  • Quarterly wellness day
  • Employee Resource Groups (ERGs)
  • Healthy work/life balance

Interested in this job?