What are the responsibilities and job description for the Site Reliability Engineer - AWS, GCP- CST or EST ONLY position at Addison Group?
Title: Site Reliability Engineer (SRE)
Location: CST OR EST ONLY - Please include your city, state on resume
Salary: Base: $110,000 – $150,000 / Annually
No sponsorship / Must be U.S. Citizen or Permanent Resident (Green Card)
Benefits: Health, Medical, Dental, Vision, 401(k) with match, Stock Options, PTO, and additional perks
Overview
We are hiring a Site Reliability Engineer to design, secure, and operate highly available infrastructure supporting an AI-driven platform serving U.S. customers, including organizations within regulated industries. This role owns U.S.-based platform operations while collaborating with a global engineering organization in a fast-paced, high-growth environment. Must have GCP and Snowflake.
What You’ll Do
IND 005-009
Location: CST OR EST ONLY - Please include your city, state on resume
Salary: Base: $110,000 – $150,000 / Annually
No sponsorship / Must be U.S. Citizen or Permanent Resident (Green Card)
Benefits: Health, Medical, Dental, Vision, 401(k) with match, Stock Options, PTO, and additional perks
Overview
We are hiring a Site Reliability Engineer to design, secure, and operate highly available infrastructure supporting an AI-driven platform serving U.S. customers, including organizations within regulated industries. This role owns U.S.-based platform operations while collaborating with a global engineering organization in a fast-paced, high-growth environment. Must have GCP and Snowflake.
What You’ll Do
- Design, implement, and operate scalable, fault-tolerant infrastructure primarily on GCP with future multi-cloud expansion
- Lead Infrastructure-as-Code initiatives using Terraform with strong security and governance practices
- Build and maintain CI/CD and DevSecOps pipelines supporting large-scale engineering and AI workloads
- Implement observability and monitoring using Prometheus, Grafana, ELK, and similar tools
- Define SLOs/SLIs, manage error budgets, and lead incident response with blameless postmortems
- Support compliance requirements within regulated U.S. industries
- Automate operational workflows using Python, Go, or Bash
- Collaborate with global teams while owning U.S. platform operations and incidents
- Bachelor’s degree in Computer Science, Engineering, or equivalent experience
- 2 years of experience in SRE, DevOps, or Systems Engineering
- Strong Terraform and Infrastructure-as-Code experience
- Proficiency with Python and scripting languages
- Experience with CI/CD tools (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.)
- Cloud experience (GCP preferred; AWS/Azure a plus)
- Kubernetes and Docker experience
- Experience in regulated environments (Aerospace & Defense, Finance, Healthcare preferred)
- Strong communication skills and security-first mindset
- Hyper-growth startup experience
- AI safety, MLOps, or AI/ML infrastructure security experience
IND 005-009
Salary : $110,000 - $150,000