Demo

Manager, Site Reliability Engineering

ALO
Beverly Hills, CA Full Time
POSTED ON 11/3/2025
AVAILABLE BEFORE 12/2/2025
WHY JOIN ALO?

Mindful movement. It’s at the core of why we do what we do at ALO—it’s our calling. Because mindful movement in the studio leads to better living. It changes who yogis are off the mat, making their lives and their communities better. That’s the real meaning of studio-to-street: taking the consciousness from practice on the mat and putting it into practice in life.

Site Reliability Engineering (SRE) Manager

We are seeking a Site Reliability Engineering (SRE) Manager to support our fast-growing organization. This specialist will leverage their SRE engineering experience to address both short-term and long-term demands, providing solutions that enhance the reliability, scalability, and efficiency of our e-commerce and internal systems. They will work closely with our teams both onshore and offshore to scale and build SRE appropriately to meet our goals and expectations.

Responsibilities:

  • Work with Digital Technology Leadership: Spend time with each leader on the digital technology team to understand their current portfolio. Steer SRE teams to proactively identify and address issues before they occur and triage issues independently, reducing the need to escalate to engineering teams.
  • Incident Management & Response: Own the end-to-end incident response process for our SRE Level 3 roles, from on-call preparedness to post-incident reviews. Ensure clear severity definitions, escalation paths, and timely communication during incidents to minimize downtime. Co-lead blameless post-mortems and implement process improvements for faster, more accurate incident resolution.
  • Monitoring & Observability: Drive enhancements in monitoring and observability across all products and services. Expand meaningful alerting and dashboards using our tools (e.g., New Relic) to proactively detect issues and reduce alert noise. Continuously refine alert thresholds and ensure only high-priority alerts wake up the on-call team, routing lower-priority issues to the ticketing system. Champion the use of observability scorecards to measure coverage and address gaps.
  • Automation & Tooling: Identify opportunities to automate repetitive tasks and reduce operational toil. Oversee integration between our incident management platform (PagerDuty) and ITSM system. Leverage infrastructure-as-code and other automation tools.
  • Cross-Team Collaboration: Partner with software engineering and DevOps teams to ensure new features and services are production-ready. Establish production readiness checklists and work closely with QA, product, and change management teams to embed SRE practices into the SDLC.
  • Vendor & Partner Reliability: Maintain strong relationships with critical technology vendors. Develop clear vendor support and escalation plans. Collaborate on joint drills or reviews to ensure uptime and recovery objectives are met.
  • Reliability Strategy: Define and track reliability goals aligned with business needs. Report on SRE KPIs and continuously refine the SRE roadmap.

Qualifications:

  • Experience: 5 years in SRE/DevOps/Infrastructure roles, including 2 years in leadership. Proven experience with mission-critical systems is essential.
  • Technical Depth: Strong knowledge of system administration, networking, and cloud infrastructure (e.g., AWS). Hands-on experience with New Relic, PagerDuty, Freshservice, logging, APM, and tracing tools is required.
  • Engineering Expertise: Significant experience in engineering, with a deep understanding of software development, system architecture, and infrastructure management.
  • Automation Skills: Skilled in scripting (Python, Shell) and infrastructure-as-code (Terraform). Ability to build CI/CD and self-healing mechanisms.
  • Incident & Problem Solving: Deep understanding of incident response, ITIL/ITSM, and root cause analysis. Experience with alert tuning and communication plans is necessary.
  • Leadership & Communication: Strong management and communication skills. Ability to align cross-functional teams and translate reliability issues into business terms.
  • Growth Mindset: Committed to continuous improvement and learning. Ability to assess SRE maturity and drive iterative improvements.

For CA residents, Job Applicant Privacy Policy HERE.

Salary.com Estimation for Manager, Site Reliability Engineering in Beverly Hills, CA
$127,844 to $161,387
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Manager, Site Reliability Engineering?

Sign up to receive alerts about other jobs on the Manager, Site Reliability Engineering career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$116,865 - $160,036
Income Estimation: 
$142,019 - $220,718
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$154,184 - $199,940
Income Estimation: 
$189,563 - $242,917
Income Estimation: 
$120,143 - $165,703
Income Estimation: 
$182,708 - $261,704
Income Estimation: 
$154,184 - $199,940
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at ALO

ALO
Hired Organization Address Las Vegas, NV Full Time
WHY JOIN ALO? Mindful movement. It’s at the core of why we do what we do at ALO—it’s our calling. Because mindful moveme...
ALO
Hired Organization Address Boulder, CO Full Time
WHY JOIN ALO? Mindful movement. It’s at the core of why we do what we do at ALO—it’s our calling. Because mindful moveme...
ALO
Hired Organization Address Short Hills, NJ Full Time
WHY JOIN ALO? Mindful movement. It’s at the core of why we do what we do at ALO—it’s our calling. Because mindful moveme...
ALO
Hired Organization Address Shrewsbury, NJ Full Time
WHY JOIN ALO? Mindful movement. It’s at the core of why we do what we do at ALO—it’s our calling. Because mindful moveme...

Not the job you're looking for? Here are some other Manager, Site Reliability Engineering jobs in the Beverly Hills, CA area that may be a better fit.

Manager, Site Reliability Engineering, Infrastructure Engineering

Prime Video & Amazon MGM Studios, Culver, CA

AI Assistant is available now!

Feel free to start your new journey!