Demo

Site Reliability Engineer

Matlen Silver
Chandler, AZ Contractor
POSTED ON 6/23/2026
AVAILABLE BEFORE 1/1/2027

The Site Reliability Engineer (SRE) Lead is responsible for ensuring the reliability, scalability, performance, and security of enterprise Linux-based systems. This role combines deep technical expertise in Linux administration with leadership in automation, observability, incident management, and infrastructure engineering. The SRE Lead drives operational excellence by implementing best practices, improving system resilience, and mentoring engineering teams.

Platform Engineering Operations

Lead the administration, monitoring, and performance tuning of Oracle Enterprise Linux (OEL) environments in a large-scale enterprise ecosystem.

Oversee the design, build, and lifecycle management of Linux servers, including storage, virtualization, and associated infrastructure.

Manage high availability (HA) configurations, clustering, and load-balanced environments to ensure minimal downtime.

Drive capacity planning, performance optimization, and system scalability initiatives.

Reliability Automation (SRE Practices)

Define and implement SRE principles, including SLIs, SLOs, and error budgets.

Lead initiatives for infrastructure automation (provisioning, configuration, patching) using tools such as Ansible.

Build and maintain self-healing systems, reducing manual intervention and improving system resilience.

Develop automation for system installation, configuration, and deployment pipelines.

System Administration Infrastructure Management

Install, configure, and maintain Oracle Enterprise Linux (OEL) operating systems and related software stacks.

Manage Logical Volume Manager (LVM) configurations, including volume groups and filesystem expansion.

Administer distributed file systems, NFS servers/clients, and automount configurations.

Maintain network services such as DNS, NTP, LDAP/Kerberos, SMTP (sendmail/postfix), and OpenSSH.

Troubleshoot and support network protocols (TCP/IP, HTTP, HTTPS, RPC).

Monitoring, Incident Management Support

Implement and enhance monitoring, alerting, and observability frameworks for proactive issue detection.

Lead incident response, root cause analysis (RCA), and postmortem reviews.

Drive continuous improvement by identifying systemic issues and implementing preventive solutions.

Oversee break/fix operations, ensuring timely resolution and minimal business impact.

Security Compliance

Ensure systems are secure, hardened, and compliant with enterprise security standards.

Manage patching, vulnerability remediation, and OS upgrades.

Partner with security teams to implement best practices for access control, auditing, and encryption.

Leadership Collaboration

Provide technical leadership and mentorship to SRE and infrastructure teams.

Collaborate with application, DevOps, and platform teams to improve system reliability and deployment processes.

Define and enforce operational standards, runbooks, and best practices.

Drive cross-functional initiatives to enhance platform stability and efficiency.

Documentation Governance

Maintain comprehensive documentation for architecture, processes, and operational procedures.

Ensure adherence to change management and incident governance frameworks.

Standardize operational workflows across environments.


Required Qualifications

5 years of experience in Linux system administration in enterprise environments.

Strong expertise in Oracle Enterprise Linux (OEL) systems.

Proven experience in high availability systems, virtualization, and storage management.

Hands-on experience with automation and configuration management tools (Ansible preferred).

Proficiency in at least one scripting/programming language (Bash, Python preferred).

Strong experience with infrastructure troubleshooting, performance tuning, and incident management.

Solid understanding of enterprise infrastructure (compute, storage, network).

Excellent analytical, problem-solving, and organizational skills.

Strong communication and collaboration skills in a global team environment.

Hourly Wage Estimation for Site Reliability Engineer in Chandler, AZ
$41.00 to $53.00
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$76,670 - $90,826
Income Estimation: 
$91,609 - $118,978
Income Estimation: 
$92,877 - $110,401
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Matlen Silver

  • Matlen Silver Washington, DC
  • Are you a competitive, relationship-driven sales professional with experience in the IT staffing industry? Matlen Silver is hiring an experienced Client So... more
  • Just Posted

  • Matlen Silver Chandler, AZ
  • **Must be able to relocate to Phoenix, AZ area for the start of the contract in order to work onsite 3 days a week** Matlen Silver is actively recruiting f... more
  • Just Posted

  • Matlen Silver Chandler, AZ
  • We are seeking a Senior DevSecOps Engineer to design and automate an enterprise dual?stack secrets management ecosystem built on CyberArk (PAM) and HashiCo... more
  • Just Posted

  • Matlen Silver Charlotte, NC
  • Job Title: DevOps Engineer Consultant Duration : 12 months with extension possible to 18 months Location : Charlotte, NC (3 days per week onsite) Pay Scale... more
  • Just Posted


Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Chandler, AZ area that may be a better fit.

  • Jobs via Dice Scottsdale, AZ
  • At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle , Paze , and so much m... more
  • 17 Days Ago

  • TEKsystems Chandler, AZ
  • Job Title: Site Reliability Engineer (SRE) – Identity Directory Services Location: Chandler, AZ (Hybrid - Onsite 3 days a week) Long Term Contract (Opportu... more
  • 17 Days Ago

AI Assistant is available now!

Feel free to start your new journey!