Demo

Site Reliability Engineer (SRE) Alpharetta - GA - Georgia

Jobs via Dice
Alpharetta, GA Full Time
POSTED ON 6/10/2026
AVAILABLE BEFORE 7/9/2026
ROLE_DESCRIPTION -

Skill Set - Expertise in UNIX LINUX Administration AWS AZURE Cloud monitoring Terraform Ansible Prometheus Grafana observability experience).

Work Location - Alpharetta

Experience required for role - 6 years

Production experience in SRE Infrastructure ops for large-scale systems

Strong programmingscripting skills (Python, Go, Java, or equivalent)

Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)

Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)

Familiarity with GPU AI compute clusters, high-performance data storage, and distributed architectures

Experience with monitoring observability logging alerting tools (Prometheus, Grafana, ELK EFK, Datadog, etc.)

Networking & systems engineering knowledge (TCPIP, DNS, routing, load balancing, distributed storage)

Solid experience in capacity planning, performance tuning, scaling, and incident response

Demonstrated ability to lead RCAs, deploy fixes, and drive reliability improvements

Experience in regulated environments (financial services, compliance, audit, security) is a strong plus

Excellent communication, documentation, and cross-team collaboration skills

Proven track record of reducing operational toil via automation

Experience: 6 years of experience as a Site Reliability Engineer or in a similar role, with hands-on experience in supporting IaaS platforms with networking and system engineering knowledge.

Operate, monitor, and maintain the infrastructure supporting GenAI applications (training, inference, feature store, data ingestion, model serving)

Design and build automation for core platform capabilities, reducing manual toil

Develop and maintain infrastructure-as-code (IaC) for provisioning and managing compute, storage, network, GPU clusters, Kubernetes container orchestration, etc.

Establish, monitor, and enforce SLOsSLIsSLAs, error budgets, alerting, and dashboards

Lead incident response, root cause analysis (RCA), postmortems, and systemic remediation

Perform capacity planning, scaling strategies, workload scheduling, and resource forecasting

Optimize cost vs. performance tradeoffs in large-scale compute environments

Harden systems for security, compliance, auditability, and data governance

Collaborate across teams (cloud engineers, data engineers, infrastructure, security) to ensure safe deployment, rollout, rollback, and integration of new systems

Define disaster recovery (DR) strategies, backuprestore practices, fault tolerance mechanisms

Maintain runbooks, operational playbooks, documentation, and training materials

Participate in on-call rotations and respond to production incidents 247 as needed

Continuously evaluate and integrate new tools, frameworks, or technologies to enhance platform reliability

Skills: Digital : PythonDigital : DockerDigital : KubernetesDigital : Site Reliability Engineering (SRE)

Experience Required: 6-8

Salary.com Estimation for Site Reliability Engineer (SRE) Alpharetta - GA - Georgia in Alpharetta, GA
$106,877 to $125,363
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer (SRE) Alpharetta - GA - Georgia?

Sign up to receive alerts about other jobs on the Site Reliability Engineer (SRE) Alpharetta - GA - Georgia career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Jobs via Dice

  • Jobs via Dice Douglas, WY
  • Energy Transfer , recognized by Forbes as one of America's best large employers , is dedicated to responsibly and safely delivering America's energy . We a... more
  • 1 Day Ago

  • Jobs via Dice Smithfield, RI
  • job summary: Focus on customer: Demonstrate understanding of customer's business domain. Ensuring the technology team is building the right software soluti... more
  • 1 Day Ago

  • Jobs via Dice Middletown, RI
  • Job ID: 2612055 Location: Middletown, RI, US Date Posted: 2026-05-03 Category: Quality Assurance Subcategory: Qual Assurance Technician Schedule: Full-Time... more
  • 1 Day Ago

  • Jobs via Dice Cranston, RI
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Talent Groups, is seeking the following. Apply via Dic... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Site Reliability Engineer (SRE) Alpharetta - GA - Georgia jobs in the Alpharetta, GA area that may be a better fit.

  • City of Alpharetta, GA Alpharetta, GA
  • Starting Pay: $81,350-$95,177 Under administrative direction of the Development and Planning Manager, plans, organizes, directs and coordinates the activit... more
  • 2 Days Ago

  • Sierra Business Solution LLC Alpharetta, GA
  • Alpharetta GA Job Description: >> The Privileged Access Engineer (PAM) with AI Capabilities is responsible for securing privileged access across enterprise... more
  • 3 Days Ago

AI Assistant is available now!

Feel free to start your new journey!