What are the responsibilities and job description for the Site Reliability Engineer position at A-Line Staffing Solutions?
Site Reliability Engineer (SRE)
Location: San Diego, CA (Hybrid) LOCAL CANDIDATES ONLY
Rate: $70–80/hr on W-2 (No C2C)
Overview
We are seeking an experienced Site Reliability Engineer (SRE) to join our cross-functional team supporting cloud-based systems in a regulated healthcare environment. This role is ideal for an engineer who thrives on automation, scalability, observability, and ensuring the reliability and performance of enterprise cloud infrastructure. You’ll work closely with development, platform, and operations teams to optimize our AWS and Azure environments.
Key Responsibilities
Cloud Infrastructure Management
- Design, deploy, and maintain scalable, secure, and highly available infrastructure on AWS and Azure.
- Develop Infrastructure-as-Code using Terraform, AWS CDK, or CloudFormation.
- Script and automate workflows using TypeScript, PowerShell, or Go.
- Ensure compliance with SOC II, ePHI, and healthcare data security standards.
Observability & Monitoring
- Implement and optimize Datadog for comprehensive application and infrastructure monitoring.
- Build alerting mechanisms for key performance indicators (latency, system health, error rates).
- Create and maintain real-time performance dashboards and incident response runbooks.
Performance Optimization & Troubleshooting
- Identify and resolve system bottlenecks; ensure reliability and scalability of production systems.
- Conduct root cause analysis and participate in on-call rotations.
- Continuously improve architecture, security posture, and disaster recovery strategies.
Collaboration & DevOps Enablement
- Partner with development teams to enhance CI/CD pipelines (Jenkins, GitHub Actions, or Azure DevOps).
- Champion infrastructure as code and automation across the organization.
- Collaborate with security and compliance teams to uphold all regulatory standards.
Security & Compliance
- Maintain security posture for healthcare data systems in alignment with SOC II and HIPAA/ePHI.
- Implement IAM best practices, encryption policies, and regular audit processes.
Qualifications
- Bachelor’s in Computer Science, Engineering, or related field (or equivalent experience).
- 3 years as an SRE managing cloud environments on AWS and/or Azure.
- Hands-on experience with observability tools (Datadog, Prometheus, Grafana, etc.).
- Expertise in Terraform, CloudFormation, or AWS CDK.
- Strong background in Kubernetes and Docker.
- Experience with Ansible, Puppet, or Chef for automation.
- Proficiency with CI/CD tools (Jenkins, GitHub Actions, Azure DevOps).
- Healthcare compliance experience (SOC II, ePHI) strongly preferred.
Nice to Have
- Experience in regulated industries (Healthcare, Medical Devices).
- Certifications: AWS Solutions Architect, Azure Administrator, CKA.
- Exposure to AI/ML models for predictive performance and maintenance.
- Familiarity with serverless technologies (AWS Lambda, Azure Functions).
Additional Attributes
- Strong analytical and decision-making skills.
- Collaborative and effective communicator with cross-functional teams.
- Action-oriented and solutions-focused mindset.
- Proven ability to influence without direct authority.
- Excellent written skills for documenting processes and technical plans.
Salary : $70 - $80