Senior Site Reliability Engineer II (Kafka)

Leading customer engagement platform that empowers brands to be absolutely engaging, helping marketers collect and act on customer data in real-time across channels.
Ontario, Canada
DevOps
Senior Software Engineer
Remote
1,000 - 5,000 Employees
5+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer II (Kafka)

Braze, a leading customer engagement platform, is seeking a Senior Site Reliability Engineer II specializing in Kafka to join their remote team. This role combines software engineering and systems administration to ensure site reliability and infrastructure scalability. The position involves managing Kafka clusters that handle massive scale - processing hundreds of billions of data points monthly for over 3.3 billion monthly active users.

The ideal candidate will work at the intersection of infrastructure and engineering, focusing on Kafka streaming applications, performance tuning, and automation. You'll be responsible for creating infrastructure as code, developing deployment pipelines, and ensuring high availability of systems. The role requires deep expertise in Kafka, along with experience in modern DevOps tools like Kubernetes, Terraform, and Docker.

Braze offers a collaborative, transparent culture recognized as a Great Place to Work® across multiple regions. The company provides comprehensive benefits, including equity participation, flexible PTO, and strong professional development support. You'll be part of a global team working on cutting-edge customer engagement technology, with offices across major tech hubs worldwide.

This position offers the opportunity to work remotely while tackling challenging technical problems at scale. The role combines hands-on technical work with strategic infrastructure planning, making it ideal for engineers who enjoy both deep technical work and broader system design. If you're passionate about reliability engineering and want to work with modern technologies at scale, this role offers an excellent opportunity to make a significant impact.

Last updated 4 days ago

Responsibilities For Senior Site Reliability Engineer II (Kafka)

  • Partner with engineering teams on architecture and debugging
  • Create Infrastructure as code using Chef, Terraform, and Kubernetes
  • Develop deployment pipelines using Docker and Kubernetes
  • Manage incidents and be on PagerDuty rotation
  • Monitor and troubleshoot Kafka streaming applications
  • Set up alerting and dashboards for high-availability pipelines
  • Scale Kafka clusters and manage schema evolution

Requirements For Senior Site Reliability Engineer II (Kafka)

Kafka
Kubernetes
MongoDB
Redis
Ruby
  • 5+ years of experience as a Software, DevOps, or Site Reliability Engineer
  • 3+ years of Data Streaming Reliability Engineering
  • 3+ years of Kafka performance tuning & automation
  • Strong programming skills - Ruby and/or Go preferred
  • Experience with Docker, Kubernetes, Terraform
  • Experience with MongoDB, Redis, Kafka, Postgres
  • Knowledge of Linux and Unix Shell
  • Strong collaboration and documentation skills

Benefits For Senior Site Reliability Engineer II (Kafka)

401k
Dental Insurance
Education Budget
Equity
Medical Insurance
Vision Insurance
Parental Leave
  • Competitive compensation with equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive medical, dental, vision benefits
  • Family services including fertility benefits and parental leave
  • Professional development and learning stipend
  • Employee Resource Groups
  • Hybrid work environment

Interested in this job?

Jobs Related To Braze Senior Site Reliability Engineer II (Kafka)

Senior Site Reliability Engineer II (Kafka)

Senior Site Reliability Engineer role focused on Kafka at Braze, managing and scaling distributed systems and data streaming infrastructure.

Senior Site Reliability Engineer II (Kafka)

Senior Site Reliability Engineer role focused on Kafka at Braze, managing and scaling distributed systems and ensuring platform reliability.

Senior Software Engineer, Developer Experience (DX) - Provo

Senior Software Engineer position at Qualtrics focusing on Developer Experience (DX) and CI/CD infrastructure, based in Provo, UT. Build and maintain scalable development tools and workflows for thousands of engineers.

Senior IT Infrastructure Engineer

Senior IT Infrastructure Engineer position at Collibra in Prague, focusing on enterprise IT automation, security, and DevOps practices with Python development.

Senior DevOps Software Engineer

Senior DevOps Software Engineer position at Toyota North America in Plano, Texas, focusing on DevOps and software engineering practices.