Braze, a leading customer engagement platform, is seeking a Senior Site Reliability Engineer II specializing in Kafka to join their remote team. This role combines software engineering and systems administration to ensure site reliability and infrastructure scalability. The position involves managing Kafka clusters that handle massive scale - processing hundreds of billions of data points monthly for over 3.3 billion monthly active users.
The ideal candidate will work at the intersection of infrastructure and engineering, focusing on Kafka streaming applications, performance tuning, and automation. You'll be responsible for creating infrastructure as code, developing deployment pipelines, and ensuring high availability of systems. The role requires deep expertise in Kafka, along with experience in modern DevOps tools like Kubernetes, Terraform, and Docker.
Braze offers a collaborative, transparent culture recognized as a Great Place to Work® across multiple regions. The company provides comprehensive benefits, including equity participation, flexible PTO, and strong professional development support. You'll be part of a global team working on cutting-edge customer engagement technology, with offices across major tech hubs worldwide.
This position offers the opportunity to work remotely while tackling challenging technical problems at scale. The role combines hands-on technical work with strategic infrastructure planning, making it ideal for engineers who enjoy both deep technical work and broader system design. If you're passionate about reliability engineering and want to work with modern technologies at scale, this role offers an excellent opportunity to make a significant impact.