Demo

Site Reliability Engineer

Rapsys Technologies
Atlanta, GA Contractor
POSTED ON 5/1/2026
AVAILABLE BEFORE 5/31/2026

Proven experience in high-availability, high-transaction environments (preferably payments or financial services).
Strong background in production resiliency and recovery (recovery execution, runbooks/playbooks, RCA mindset).
Incident pattern analysis MTTR baselines (P2 Major/Minor) and recurring failure taxonomy (by rail/service).
Senior-level observability expertise: dashboards, monitors, and alerts (Datadog preferred; similar tools considered).
Splunk, Datadog, SQLS, JQL Jira Query language, Gitlab,
Experience of CI / CD metrics and generating code quality, changes, testing automation executives reports from Gitlab Understand quality of stories, metrics, monitoring experiences - help get data to showcase deficiencies Senior CI/CD experience: pipeline design/operation, release safety patterns, and rollback readiness.
Experience using metrics and monitoring data to identify and communicate deficiencies.
Automation skills: Python and/or PowerShell (or equivalent) for building repeatable recovery workflows and operational tooling.
Kubernetes/container platform production troubleshooting (deployments, pods, config drift, safe restarts, and "why did this change break prod" investigations
Experience with identity/credentials/certificate & secret-rotation resilience (preventing outages during password rotations, certificate upgrades, and secret propagation; implementing guardrails and monitoring for these events).
Batch/scheduler/job-execution reliability (detecting/preventing silent job failures, validating multi-D scenarios, and building controls to ensure scheduled processing does not impact customers).
Distributed integration failure-handling (timeouts, retries, backpressure, idempotency, duplicate prevention, and reconciliation-especially across vendor/downstream dependencies).
Nice-to-have (differentiators)
Experience with SRE-style reliability practices (SLO/SLI thinking, error budgets, operational metrics).
Experience with failover / DC flip / active-active or active-passive recovery concepts and scenario-based runbooks.
Cloud Engineering (Azure, AWS) Devops tools expertise, (Jenkins, Terraform, Sonar Cube, Helm Charts) Network & traffic-management incident triage (load balancers/firewalls/VLAN changes, DC traffic flips, and rapid isolation of "app vs infra vs network" to stabilize service)

Salary : $50 - $55

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Rapsys Technologies

  • Rapsys Technologies Phoenix, AZ
  • Develop and implement high-quality software solutions using Spring Cloud, Spring Boot, and Java to meet business requirements in Commercial Lending and Ass... more
  • 5 Days Ago

  • Rapsys Technologies Phoenix, AZ
  • Application Claims Architect – Claims IVR & Amazon Connect Role Summary The Application Claims Architect is responsible for designing and governing the end... more
  • 5 Days Ago

  • Rapsys Technologies Malvern, PA
  • * Please include the education dates for all degrees on the resume. Submissions without this information will not be considered. * Please note that this is... more
  • 5 Days Ago

  • Rapsys Technologies Atlanta, GA
  • The SAP BTP Developer will be responsible for designing, developing, integrating, customizing, and deploying SAP Forms used within SAP Ariba Guided Buying ... more
  • 2 Days Ago


Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Atlanta, GA area that may be a better fit.

  • InfoVision, Inc. Atlanta, GA
  • Job Title: Senior Site Reliability Engineer (VMware Infrastructure) Location: Atlanta, GA Duration: Long-term Required Experience: · 10 years of experience... more
  • 5 Days Ago

  • TEKsystems Alpharetta, GA
  • Site Reliability Engineer (Onsite – Alpharetta, GA | 3 Days/Week) Overview We are seeking an experienced Site Reliability Engineer (SRE) to join our team a... more
  • 12 Days Ago

AI Assistant is available now!

Feel free to start your new journey!