Taro Logo

Site Reliability Engineer

A technology company focused on AI solutions for manufacturing and the Aerospace and Defense Industrial Base.
$100,000 - $150,000
Site Reliability
Senior Software Engineer
Hybrid
4+ years of experience
AI · Enterprise SaaS · Aerospace

Description For Site Reliability Engineer

CADDi is seeking a Site Reliability Engineer to join their team in Chicago, focusing on building and securing infrastructure for their AI platform. This role is crucial for safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. As an SRE, you'll have significant ownership of US operations while working with a global team of 150+ engineers in a dynamic startup environment.

The position requires expertise in cloud infrastructure (primarily GCP), Infrastructure as Code with Terraform, and strong DevSecOps practices. You'll be responsible for designing and implementing highly available, scalable systems, managing CI/CD pipelines, and ensuring robust security measures. The role involves working with cutting-edge technologies including Kubernetes, containerization, and modern monitoring tools like Prometheus and Grafana.

This is an excellent opportunity for an experienced SRE who wants to make a significant impact in a growing company. The role offers competitive compensation ($100,000-$150,000), comprehensive benefits, and the chance to work on challenging problems in a regulated industry. The position requires 4+ years of relevant experience and a strong security-first mindset.

CADDi provides an inclusive work environment with excellent benefits, including fully covered health insurance, equity opportunities, 401k matching, and generous time off. The hybrid work arrangement offers flexibility while maintaining collaborative opportunities with the team. This role is perfect for someone who wants to contribute to the foundation of a growing technology company while working with advanced AI systems and manufacturing technologies.

Last updated a day ago

Responsibilities For Site Reliability Engineer

  • Design, implement, and operate highly available, scalable infrastructure on GCP and multi-cloud deployments
  • Lead Terraform-based infrastructure development with security best practices
  • Build robust CI/CD pipelines supporting developers and AI engineers
  • Implement monitoring strategies using Prometheus, Grafana, and ELK
  • Navigate regulatory requirements for U.S. Aerospace and Defense
  • Reduce operational toil through automation
  • Manage disaster recovery and ensure cost-effectiveness

Requirements For Site Reliability Engineer

Python
Go
Kubernetes
Linux
  • Bachelor's degree in Computer Science, Engineering, or equivalent experience
  • 4+ years in Site Reliability Engineering, DevOps, or Systems Engineering
  • Deep Terraform and Infrastructure as Code expertise
  • Proficiency in Python and other scripting languages
  • Modern CI/CD experience
  • Strong cloud platform experience, preferably GCP
  • Experience with containers and Kubernetes
  • Monitoring tools experience
  • Regulated industry experience
  • Security-first development mindset
  • Strong problem-solving and communication skills

Benefits For Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
Commuter Benefits
  • 100% company-covered employee comprehensive health insurance
  • Stock options plan
  • 401k plan with 4% company match
  • 15 days paid time off
  • 5 sick days
  • 10 company holidays
  • Company lunches and events
  • Professional development opportunities
  • Commuter and parking benefits
  • Referral bonuses

Interested in this job?

Jobs Related To CADDi Site Reliability Engineer