Demo

Platform Site Reliability Engineer

Cypress HCM
Lehi, UT Contractor
POSTED ON 4/23/2026
AVAILABLE BEFORE 4/29/2027
Job Details

  • Platform Site Reliability Engineer (Contract)
  • Location: Lehi, UT 84043 (Hybrid)
  • Duration: 1 Year
  • Team: Productivity and Insights US

Overview

  • We are seeking a Platform Site Reliability Engineer to ensure the reliability, availability, and operational health of a portfolio of SaaS?based solutions, including both in?house (customer?zero) platforms and vendor?managed services. This role applies Site Reliability Engineering principles in environments where full?stack control is not always possible, requiring strong observability practices, operational judgment, and effective collaboration across teams and vendors.
  • You will play a key role in on?call operations and incident response, establishing meaningful signals from systems you don’t fully own and driving practical improvements that reduce risk and customer impact. Success in this role depends on strong communication, creativity in instrumentation and monitoring, and the ability to influence reliability outcomes across organizational and vendor boundaries—leveraging modern, AI?assisted tooling to improve detection, diagnosis, and learning.

Key Responsibilities

  • Ensure the reliability, availability, and operational health of a portfolio of SaaS?based solutions, including vendor?managed services and in?house (customer?zero) platforms.
  • Participate in on?call rotations and incident response, leading investigation, mitigation, coordination, and post?incident follow?up.
  • Establish and maintain effective observability for systems that are not fully owned, identifying practical ways to obtain actionable metrics, logs, and signals from vendor and partner solutions.
  • Use operational data and incident learnings to identify reliability risks and drive targeted improvements that reduce customer impact.
  • Apply appropriate change controls at owned or influenced layers of the stack, balancing reliability, velocity, and business needs.
  • Partner with internal teams and external vendors to communicate expectations, coordinate response and remediation, and influence reliability outcomes.
  • Produce clear incident communications and post?incident analyses that inform stakeholders and drive lasting improvements.
  • Leverage automation and AI?assisted tooling to improve detection, triage, and operational efficiency.

Required Skills & Qualifications

  • Strong foundation in Site Reliability Engineering practices, including observability, incident response, and reliability measurement.
  • Hands?on experience operating SaaS or third?party systems where full?stack ownership is limited.
  • Deep understanding of monitoring, logging, and alerting, with the ability to design signals that are actionable rather than noisy.
  • Proven incident response experience, including on?call participation and cross?team coordination during high?impact events.
  • Ability to think creatively and pragmatically when instrumenting and improving systems with constrained control.
  • Excellent written and verbal communication skills, especially in high?pressure incident and vendor?coordination scenarios.
  • Experience working across organizational and vendor boundaries to resolve complex operational issues.
  • Sound engineering judgment when assessing risk, prioritizing work, and making reliability tradeoffs in production environments.

Education & Experience

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent practical experience.
  • 3–5 years of experience in Site Reliability Engineering, production operations, or a closely related role.
  • Experience supporting production systems with on?call responsibilities and incident response expectations.
  • Strong experience working with observability data (metrics, logs, alerts) to diagnose issues and drive improvements.
  • Comfort using automation and AI?assisted tools as part of everyday operational workflows.

Preferred

  • Experience supporting enterprise?scale SaaS platforms or shared services.
  • Prior experience working directly with vendors to resolve reliability or operational issues.
  • Familiarity with cloud?based and distributed system architectures.

Compensation

  • $51.00 to $56.34 per hour.

#37167695

inf

Salary : $51 - $56

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Platform Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Platform Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Cypress HCM

  • Cypress HCM San Francisco, CA
  • Executive Business Partner, 36676822 Duties Manage all day to day executive administrative responsibilities including advanced calendar management, travel ... more
  • 15 Days Ago

  • Cypress HCM Arlington, WA
  • Our client, an Aerospace Company in the Seattle area, is looking for a Machine Maintenance Technician to perform routine preventive and predictive maintena... more
  • 15 Days Ago

  • Cypress HCM Ashburn, VA
  • Associate Data Center Technician (L1 Graveyard) Overview We are the world’s largest professional network, built to help members of all backgrounds and expe... more
  • 15 Days Ago

  • Cypress HCM Mc Lean, VA
  • Job Details Senior Program Manager (Contract) Location: Remote Duration: 4/20/2026 to 11/20/2026 Team: Fraud ADUS Team Overview The Consumer Trust Organiza... more
  • 15 Days Ago


Not the job you're looking for? Here are some other Platform Site Reliability Engineer jobs in the Lehi, UT area that may be a better fit.

  • Software Guidance & Assistance, Inc. (SGA, Inc.) Lehi, UT
  • Software Guidance & Assistance, Inc., (SGA), is searching for a Platform Site Reliability Engineer for a contract assignment with one of our premier SaaS c... more
  • Just Posted

  • Jobs via Dice Lehi, UT
  • Software Guidance & Assistance, Inc., (SGA), is searching for a Platform Site Reliability Engineer for a contract assignment with one of our premier SaaS c... more
  • Just Posted

AI Assistant is available now!

Feel free to start your new journey!