What are the responsibilities and job description for the Senior SRE AIOps Engineer (AWS) position at Calsoft Pvt Ltd?
Role Title: Senior SRE AIOps Engineer (AWS)
Experience Level: 5–8 Years
Location: Irvine, CA
Executive Summary
- Responsible for reliability, performance, and scalability of AWS-based microservices environments.
- Drive modernization of SRE operations through AIOps capabilities using Machine Learning and automation.
- Implement intelligent monitoring, anomaly detection, alert correlation, and self-healing infrastructure solutions.
Core Responsibilities
- Define and manage SLOs, SLIs, and Error Budgets for mission-critical AWS services.
- Manage, optimize, and scale Amazon EKS clusters ensuring high availability and cost efficiency.
- Implement and tune AIOps solutions using AWS DevOps Guru and Amazon Lookout for Metrics.
- Identify performance bottlenecks proactively before impacting end users.
- Lead incident response activities and perform root-cause analysis (RCA).
- Develop automated remediation workflows using AWS Lambda and AWS Step Functions.
- Reduce operational toil by building automation tools and utilities in Python or Go.
Must-Have Skills
- Strong expertise in AWS services including:
- Amazon EKS
- AWS Lambda
- Amazon S3
- VPC Networking
- Hands-on experience with observability and monitoring tools:
- Amazon CloudWatch (Logs Insights, ServiceLens)
- Managed Grafana
- Prometheus
- Expert-level proficiency in Infrastructure as Code:
- Terraform or AWS CDK
- Experience with AI-driven monitoring and AIOps platforms:
- AWS DevOps Guru
- Datadog Watchdog
- Dynatrace Davis
- Strong Python programming skills with AWS SDKs (Boto3).
- Advanced knowledge of:
- Docker
- Kubernetes manifests
- Helm charts
Good-to-Have Skills
- Exposure to Generative AI services:
- Amazon Bedrock
- Amazon Q
- Basic understanding of ML models for:
- Predictive scaling
- Forecasting
- Security monitoring experience with:
- AWS GuardDuty
- AWS Security Hub
- Relevant certifications preferred:
- AWS Certified DevOps Engineer – Professional
- Certified Kubernetes Administrator (CKA)
Thank You, we look forward to your response!