What are the responsibilities and job description for the SRE / Observability Architect position at Galactic Minds INC?
SRE / Observability Architect
Location: Fort Mill, SC (Onsite)
Key Responsibilities
- Act as a Solutions Architect and SRE advocate, providing strategic guidance on reliability engineering practices.
- Engage across the full lifecycle of application & cloud services—from design to deployment, operations, and continuous improvement.
- Define and implement SRE principles including CUJ, SLOs, SLIs, and error budgets based on NFRs.
- Lead development of:
- SRE dashboards
- Error Budget tracking
- Root cause analysis
- Automation & TOIL reduction
- Collaborate closely with development teams to ensure well-designed, planned, and monitored releases.
- Proactively identify anomalies, automation opportunities, and reliability improvements.
- Assess current SRE landscape and define future SRE approach & roadmap.
- Design & implement observability solutions for real-time monitoring and troubleshooting.
Technical Skill Requirements
Core Technologies
- .NET, SQL, React
- AWS Cloud
- Dynatrace, Splunk, Elastic Stack
- SolarWinds DPA
- Python, Shell or other scripting languages
- Ansible Tower, Terraform (IaC)
- CI/CD: Git, GitHub Actions, GitHub Workflows (plus Jenkins is a plus)
SRE Expertise
- Strong understanding of CUJs, SLOs, SLIs, Error Budgets
- Proven experience reducing TOIL through automation and process improvement
- Hands-on with monitoring, observability, alerting, and logging frameworks
- Ability to design and implement scalable, automated reliability solutions
- Experience in AIOps, anomaly detection, and performance optimization
Cloud & DevOps
- Strong experience in AWS
- Good understanding of container orchestration (EKS/Kubernetes preferred)
- Solid experience with Infrastructure as Code (Terraform)
Senior Observability Engineer
LPL Financial -
Fort Mill, SC
CyberArk Architect
Jobs via Dice -
Fort Mill, SC
Infra. Architect
Cognizant -
Fort Mill, SC