Senior Site Reliability Engineer II (Kafka)

Braze

Leading customer engagement platform that empowers brands to be absolutely engaging, helping marketers collect and act on customer data in real-time across channels.

Ontario, Canada

DevOps

Senior Software Engineer

Remote

1,000 - 5,000 Employees

5+ years of experience

Enterprise SaaS

Description For Senior Site Reliability Engineer II (Kafka)

Braze, a leading customer engagement platform, is seeking a Senior Site Reliability Engineer II specializing in Kafka to join their remote team. This role combines software engineering and systems administration to ensure site reliability and infrastructure scalability. The position involves managing Kafka clusters that handle massive scale - processing hundreds of billions of data points monthly for over 3.3 billion monthly active users.

The ideal candidate will work at the intersection of infrastructure and engineering, focusing on Kafka streaming applications, performance tuning, and automation. You'll be responsible for creating infrastructure as code, developing deployment pipelines, and ensuring high availability of systems. The role requires deep expertise in Kafka, along with experience in modern DevOps tools like Kubernetes, Terraform, and Docker.

Braze offers a collaborative, transparent culture recognized as a Great Place to Work® across multiple regions. The company provides comprehensive benefits, including equity participation, flexible PTO, and strong professional development support. You'll be part of a global team working on cutting-edge customer engagement technology, with offices across major tech hubs worldwide.

This position offers the opportunity to work remotely while tackling challenging technical problems at scale. The role combines hands-on technical work with strategic infrastructure planning, making it ideal for engineers who enjoy both deep technical work and broader system design. If you're passionate about reliability engineering and want to work with modern technologies at scale, this role offers an excellent opportunity to make a significant impact.

Last updated 4 days ago

Responsibilities For Senior Site Reliability Engineer II (Kafka)

Partner with engineering teams on architecture and debugging
Create Infrastructure as code using Chef, Terraform, and Kubernetes
Develop deployment pipelines using Docker and Kubernetes
Manage incidents and be on PagerDuty rotation
Monitor and troubleshoot Kafka streaming applications
Set up alerting and dashboards for high-availability pipelines
Scale Kafka clusters and manage schema evolution

Requirements For Senior Site Reliability Engineer II (Kafka)

Kafka

Kubernetes

MongoDB

Redis

Ruby

5+ years of experience as a Software, DevOps, or Site Reliability Engineer
3+ years of Data Streaming Reliability Engineering
3+ years of Kafka performance tuning & automation
Strong programming skills - Ruby and/or Go preferred
Experience with Docker, Kubernetes, Terraform
Experience with MongoDB, Redis, Kafka, Postgres
Knowledge of Linux and Unix Shell
Strong collaboration and documentation skills

Benefits For Senior Site Reliability Engineer II (Kafka)

401k

Dental Insurance

Education Budget

Equity

Medical Insurance

Vision Insurance

Parental Leave

Competitive compensation with equity
Retirement and Employee Stock Purchase Plans
Flexible paid time off
Comprehensive medical, dental, vision benefits
Family services including fertility benefits and parental leave
Professional development and learning stipend
Employee Resource Groups
Hybrid work environment

Braze

Leading customer engagement platform that empowers brands to be absolutely engaging, helping marketers collect and act on customer data in real-time across channels.

Ontario, Canada

DevOps

Senior Software Engineer

Remote

1,000 - 5,000 Employees

5+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To Braze Senior Site Reliability Engineer II (Kafka)

Senior Site Reliability Engineer II (Kafka)

Braze

Senior Site Reliability Engineer role focused on Kafka at Braze, managing and scaling distributed systems and data streaming infrastructure.

Senior Site Reliability Engineer II (Kafka)

Braze

Senior Site Reliability Engineer role focused on Kafka at Braze, managing and scaling distributed systems and ensuring platform reliability.

Senior Software Engineer, Developer Experience (DX) - Provo

Qualtrics

Senior Software Engineer position at Qualtrics focusing on Developer Experience (DX) and CI/CD infrastructure, based in Provo, UT. Build and maintain scalable development tools and workflows for thousands of engineers.

Senior IT Infrastructure Engineer

Collibra

Senior IT Infrastructure Engineer position at Collibra in Prague, focusing on enterprise IT automation, security, and DevOps practices with Python development.

Senior DevOps Software Engineer

Toyota North America

Senior DevOps Software Engineer position at Toyota North America in Plano, Texas, focusing on DevOps and software engineering practices.