Kontakt.io is revolutionizing healthcare operations with their innovative platform that leverages AI, RTLS, and EHR data to optimize care delivery. As a Senior Site Reliability Engineer, you'll play a crucial role in ensuring the reliability and performance of their cloud-based, real-time platform that serves healthcare facilities with a commitment to 99.99% uptime.
The position offers an opportunity to work on mission-critical systems that directly impact healthcare delivery efficiency. You'll be responsible for designing and implementing self-healing, fault-tolerant systems, managing containerized environments, and developing robust monitoring solutions using cutting-edge technologies like Prometheus, Grafana, and OpenTelemetry.
The role combines technical challenges with meaningful impact - you'll be working on systems that help reduce waste, optimize resources, and improve patient care while delivering 10X ROI to healthcare facilities. You'll join a high-performing team of engineers, AI experts, and healthcare innovators solving real-world challenges.
Key technical aspects include working with AWS cloud infrastructure, Kubernetes orchestration, infrastructure as code using Terraform, and implementing comprehensive observability solutions. The position requires expertise in distributed systems, security compliance (HIPAA, SOC 2), and automated deployment processes.
This remote position offers the chance to work on the East Coast/New York City, collaborating with cross-functional teams to align SRE initiatives with business goals. The role requires 5+ years of experience in SRE or Cloud Infrastructure, with a strong background in scaling high-traffic, mission-critical platforms.
If you're passionate about using technology to improve healthcare operations and want to work with cutting-edge automation and observability tools while ensuring critical healthcare services remain available 24/7, this role offers an excellent opportunity to make a significant impact in the healthcare technology sector.