Robinhood Markets is seeking a Staff Software Engineer to join their Reliability Engineering team, focusing on observability reliability. This role combines software engineering and systems operations to ensure the reliability, scalability, performance, and security of systems serving millions of users. The position involves working with applications in brokerage, crypto, and money, managing SLAs and SLOs, and improving incident metrics. As a staff engineer, you'll help build the roadmap and collaborate with cross-functional partners, with specific ownership of reducing incident metrics like MTTD and MTTR. This is an opportunity to join a newly formed team at a company that's democratizing finance for all.
The role requires expertise in large-scale distributed systems, with bonus points for experience with EKS on AWS and large infrastructure components. You'll be responsible for designing and maintaining critical systems, mentoring teammates, and driving innovation in infrastructure optimization. The position offers competitive compensation, comprehensive benefits, and the chance to work in a dynamic fintech environment.
The ideal candidate brings 8+ years of experience in building large-scale systems, proficiency in languages like Python/Go/C++, and strong expertise in Linux/Unix systems. You'll be based in Menlo Park, CA, working in an in-office environment with a team dedicated to operational excellence and system resilience. This role combines technical leadership with hands-on engineering, making it perfect for someone who wants to impact financial technology while working with cutting-edge infrastructure and observability tools.