Salesforce is seeking an Engineering Leader to join the Site Reliability organization. This customer-facing role requires on-site presence in Northern Virginia and is not remote. The Senior Manager, Systems Engineering will lead efforts for detecting and resolving incidents within minutes, working closely with Infrastructure and R&D organizations.
Key responsibilities include:
- Maintaining the health of customer-facing services
- Incident management, including acting in key support roles during major incidents
- Problem management, including participating in Root Cause Analysis (RCA)
- Ensuring alignment with company's internal compliance policies
- Leading team members in staying on top of industry innovations
- Identifying work opportunities and preparing technical proposals
- Automating detection and resolution of recurring issues
Requirements:
- Active TS/SCI clearance with polygraph
- 4-year technical degree
- 4+ years of experience managing Engineers and Site Reliability Engineers
- Systems engineering experience in enterprise-scale internet service
- Expertise in TCP/IP technologies and Unix variants
- Strong understanding of monitoring implementations
- Experience with AWS, incident management, and ITIL service operations
- Experience working within the Intelligence Community (IC)
Preferred qualifications include experience with scripting languages, automated deployment, database support, Java applications, and Docker orchestration.
This role offers the opportunity to work in a fast-paced environment, solving sophisticated issues and optimizing multiple priorities in a 24/7/365 operations center.