What are the responsibilities and job description for the SRE Lead position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Techno Talent Inc., is seeking the following. Apply via Dice today!
Key Responsibilities
Key Responsibilities
- Drive incident management, root cause analysis (RCA), and post-incident reviews
- Build and maintain highly available and resilient systems across AWS cloud environments
- Develop and automate CI/CD pipelines using tools like Jenkins, GitLab, or similar
- Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation
- Establish monitoring, alerting, and capacity planning strategies
- 10 years of experience in SRE / DevOps / Production Engineering
- Strong experience with AWS cloud services (EC2, S3, RDS, Lambda, EKS, etc.)
- Expertise in monitoring and observability tools (Dynatrace, Splunk, Prometheus, Grafana)
- Hands-on experience with automation and scripting (Python, Bash, or Groovy)
- Experience with CI/CD tools (Jenkins, GitLab CI/CD, or similar)
- Strong understanding of distributed systems, microservices architecture
- Experience with containerization (Docker, Kubernetes)
- Knowledge of incident management frameworks and ITIL practices
- Strong troubleshooting and performance tuning skills
- Excellent communication and leadership abilities