Taro Logo

Site Reliability Engineer

AI and machine learning platform that digitizes and contextualizes unstructured trade documents to unlock real-time shipment visibility and drive smart analytics for global trade.
DevOps
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Enterprise SaaS · Logistics

Description For Site Reliability Engineer

KlearNow.AI is revolutionizing global trade with its AI-powered platform that streamlines customs clearance, freight visibility, and document automation. As a Site Reliability Engineer, you'll join a fast-growing SaaS company with operations across the U.S., Canada, U.K., Spain, and the Netherlands. The role focuses on designing and maintaining scalable infrastructure, implementing monitoring solutions, and ensuring system reliability for global operations.

You'll work with cutting-edge technologies including DataDog, Grafana, Prometheus, and X-Ray, while being responsible for critical infrastructure decisions that impact the company's global user base. The position requires strong expertise in Linux administration, automation, and cloud technologies, with a focus on building robust, scalable systems.

The company offers a people-first culture that values personal growth and well-being, with opportunities to make a significant impact on the future of global trade. You'll be part of a diverse, inclusive workplace that encourages bold thinking and problem-solving. The role combines technical challenges with business impact, as you'll work closely with various teams to ensure system reliability and performance.

This is an excellent opportunity for an experienced SRE professional who wants to contribute to transforming the logistics industry through technology. The hybrid work environment offers flexibility while maintaining collaborative opportunities with a dynamic team. If you're passionate about using technology to solve complex global trade challenges and want to be part of a rapidly expanding organization, this role offers the perfect blend of technical depth and business impact.

Last updated 15 days ago

Responsibilities For Site Reliability Engineer

  • Design and maintain infrastructure for scaling to large global user groups
  • Build automation and robust monitoring/alerting for production systems
  • Debug production issues across services and tech stack
  • Serve as escalation point and provide 24x7 on-call support
  • Participate in builds, integration, deployment, and automation for cloud environments
  • Define metrics and build alerting systems
  • Create and execute load testing plans
  • Support internal and customer external performance indicators and SLAs
  • Provide reporting to various teams
  • Work closely with DevOps team

Requirements For Site Reliability Engineer

Linux
Python
Kubernetes
  • 3-5 years of SRE experience with real-time, concurrent global user traffic
  • Excellent Linux system administration and automation skills (Python, Bash)
  • Experience setting up monitoring tools within AWS or DataDog/Prometheus+Grafana
  • Passion for implementing best practices
  • Ability to solve mission critical services issues
  • Excellent problem solving and troubleshooting skills
  • Excellent written and oral communication skills
  • Bachelor's degree in computer science strongly preferred

Jobs Related To KlearNow.AI Site Reliability Engineer