Salesforce, the world's leading CRM platform, is seeking a Site Reliability Engineer / DevOps Engineer to join their infrastructure team. This role is central to maintaining and evolving Salesforce's massive cloud infrastructure that supports thousands of internal developers and tens of thousands of customers worldwide. The position focuses on building and operating the next-generation Microservices Platform, leveraging cutting-edge technologies like Service Mesh and Ingress Gateway load balancing.
The role offers an exciting opportunity to work with a large-scale distributed system, managing over 1000+ clusters running various technologies including Kubernetes, Docker, and service mesh. You'll be at the forefront of cloud-native and AI-driven operational practices, working to build highly reliable, self-healing, and scalable services. The position combines hands-on technical work with strategic thinking about infrastructure automation and optimization.
As a Site Reliability Engineer, you'll be responsible for maintaining high availability of critical microservices, implementing monitoring solutions, driving automation efforts, and improving CI/CD pipelines. You'll work with technologies like Prometheus, Grafana, Python, Golang, and various AWS services. The role requires strong technical skills in container orchestration, Linux systems administration, and network technologies.
The position offers the chance to work with a highly innovative team of developers and architects, collaborating across various infrastructure teams at Salesforce. You'll be involved in evaluating and implementing new technologies, driving AIOps automation, and contributing to the evolution of Salesforce's cloud infrastructure. This is an excellent opportunity for someone passionate about large-scale systems, automation, and cloud-native technologies to make a significant impact at a leading technology company.
The ideal candidate will bring 3+ years of SRE/DevOps experience, strong technical skills, and excellent problem-solving abilities. You'll be joining a company known for its innovative culture and commitment to customer success, working on systems that power the world's largest business automation cloud platform.