Demo

Site Reliability Engineer

Interon IT Solutions
Malvern, PA Full Time
POSTED ON 5/7/2026
AVAILABLE BEFORE 6/4/2026
#W2 Role

Senior Reliability Engineer

Job Description

As a Senior Reliability Engineer, you will play a critical role in solving impactful operational problems. You are curious and take a proactive approach to identifying problems and making improvements. You balance innovative thinking with pragmatism and understand the long-term impacts of technical decisions. You communicate complex ideas clearly and collaborate effectively to deliver scalable solutions.

Core Responsibilities

Team is focused on automating incident response and infrastructure management. While Java and Python receive a stronger emphasis, candidates with solid programming fundamentals in any language and the ability to adapt will be considered. Experience with AWS and event-driven architectures is also valuable.

From a technical standpoint, familiarity with observability concepts (e.g., distributed tracing) and tools like Prometheus or Grafana is beneficial, though not mandatory. More important is an understanding of the underlying principles, such as instrumentation and monitoring strategies.

  • Improve resiliency engineering practices across platforms and applications, including resilient application design patterns, system observability and deployment strategies
  • Incident detection, troubleshooting, and resolution.
  • Develop automation for incident response and infrastructure management
  • Develop and support OpenTelemetry integrations for multiple application platforms (browser, ECS, lambda, etc) and languages (JavaScript, Java)
  • Contribute to architectural decisions and support implementation of solutions.

Skills And Qualifications

  • Deep knowledge of Java or Javascript. Practical experience developing and operating software in distributed systems environments.
  • Problem-solving and analytical thinking: ability to diagnose complex issues and propose efficient solutions. Strong debugging and optimization skills for performance and scalability.
  • Cloud platforms: Hands-on experience with AWS services and cloud infrastructure
  • System architecture and design: ability to design scalable, secure, and maintainable systems.
  • Working knowledge of Python (or similar scripting language).
  • Strong knowledge of resiliency engineering techniques for both platforms and applications.
  • Experience troubleshooting complex production issues and implementing effective mitigations.
  • Familiarity with OpenTelemetry specification and core APIs.

From a Screening Perspective, We Recommend Focusing On

  • How candidates approach software releases and validate functionality
  • Their understanding of system dependencies and fault tolerance
  • Experience with diagnosing and resolving production issues
  • Their ability to reflect on past incidents and identify improvements
  • Evidence of systems thinking and architectural awareness

Salary.com Estimation for Site Reliability Engineer in Malvern, PA
$88,841 to $116,035
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Interon IT Solutions

  • Interon IT Solutions Chantilly, VA
  • #W2 Role Role: Oracle EPM Location: Remote JD; We need a strong resource proficient in any one of the Oracle EPM applications (ARCS/FCCS/EDM/NR) and nice t... more
  • 11 Days Ago

  • Interon IT Solutions Malvern, PA
  • Job Title: Technical Lead Location: Malvern, PA or Dallas, TX Work Type: Onsite / Hybrid Experience: 10 Years Job Description Our client is seeking a hands... more
  • 11 Days Ago

  • Interon IT Solutions Malvern, PA
  • Job Title- Delivery Engineering Lead Location: Malvern, PA or Dallas, TX Work Type: Onsite / Hybrid Experience: 10 Years Job Description Our client is seek... more
  • 11 Days Ago

  • Interon IT Solutions Herndon, VA
  • #W2 Role Role: AWS Developer Location: Reston, VA - HYBRID JD: Full stack developer, with focus on Dev/SecOps (AWS experience, strong programming skills wi... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Malvern, PA area that may be a better fit.

  • Liberty Personnel Services, Inc. Philadelphia, PA
  • Job Details: Lead Site Reliability Engineer The Lead Site Reliability Engineer is a senior technical leader responsible for the reliability, availability, ... more
  • 9 Days Ago

  • Comcast Mount Laurel, NJ
  • Make your mark at Comcast -- a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experienc... more
  • 10 Days Ago

AI Assistant is available now!

Feel free to start your new journey!