Taro Logo

Software Engineering SMTS

Global leader in CRM software providing cloud-based business solutions and enterprise software applications.
$172,000 - $236,500
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud

Job Description

Salesforce, the world's leading CRM platform, is seeking a Site Reliability Engineer / DevOps Engineer to join their infrastructure team. This role is central to maintaining and evolving Salesforce's massive cloud infrastructure that supports thousands of internal developers and tens of thousands of customers worldwide. The position focuses on building and operating the next-generation Microservices Platform, leveraging cutting-edge technologies like Service Mesh and Ingress Gateway load balancing.

The role offers an exciting opportunity to work with a large-scale distributed system, managing over 1000+ clusters running various technologies including Kubernetes, Docker, and service mesh. You'll be at the forefront of cloud-native and AI-driven operational practices, working to build highly reliable, self-healing, and scalable services. The position combines hands-on technical work with strategic thinking about infrastructure automation and optimization.

As a Site Reliability Engineer, you'll be responsible for maintaining high availability of critical microservices, implementing monitoring solutions, driving automation efforts, and improving CI/CD pipelines. You'll work with technologies like Prometheus, Grafana, Python, Golang, and various AWS services. The role requires strong technical skills in container orchestration, Linux systems administration, and network technologies.

The position offers the chance to work with a highly innovative team of developers and architects, collaborating across various infrastructure teams at Salesforce. You'll be involved in evaluating and implementing new technologies, driving AIOps automation, and contributing to the evolution of Salesforce's cloud infrastructure. This is an excellent opportunity for someone passionate about large-scale systems, automation, and cloud-native technologies to make a significant impact at a leading technology company.

The ideal candidate will bring 3+ years of SRE/DevOps experience, strong technical skills, and excellent problem-solving abilities. You'll be joining a company known for its innovative culture and commitment to customer success, working on systems that power the world's largest business automation cloud platform.

Last updated 3 days ago

Responsibilities For Software Engineering SMTS

  • Responsible for high availability of microservices supporting service mesh and ingress gateway on 1000+ clusters
  • Contribute code to drive service availability improvement
  • Implement monitoring and metrics with Prometheus, Grafana and other frameworks
  • Drive automation efforts in Python/Golang/Puppet/Jenkins
  • Improve CI/CD pipelines built on Terraform, Spinnaker and Argo
  • Implement AIOps automation, monitoring and self-healing mechanisms
  • Collaborate with various Infrastructure teams across Salesforce
  • Evaluate new technologies to solve problems

Requirements For Software Engineering SMTS

Kubernetes
Go
Python
Redis
PostgreSQL
  • 3+ years of experience in SRE/Devops/Systems Engineering roles
  • Experience operating large scale cluster management systems
  • Strong working experience with Kubernetes, Docker, Container Orchestration, Service Mesh, Ingress Gateway
  • Good knowledge with network technologies (TCP/IP, DNS, TLS termination, HTTP proxies, Load Balancers)
  • Excellent troubleshooting skills
  • Strong Experience in Observability tools like Prometheus, Grafana, Splunk, ElasticSearch
  • Strong working experience with Linux Systems Administration
  • Good experience in scripting/programming languages: Python, GoLang
  • Experience with AWS, Terraform, Spinnaker, ArgoCD
  • Excellent problem-solving, analytical and communication skills