What are the responsibilities and job description for the Manager, AWS Cloud Operations position at American Recruiters?
Manager, AWS Cloud Operations
Position Summary
The Manager, AWS Cloud Operations is responsible for overseeing, optimizing, and safeguarding all cloud infrastructure running on Amazon Web Services. This role leads the day-to-day operations of cloud environments, ensuring reliability, security, scalability, and cost efficiency. The manager will direct a team of cloud operations engineers, drive automation initiatives, strengthen cloud governance, and partner across technical and business units to support enterprise-wide objectives.
Key focus areas include infrastructure performance, monitoring, DevOps automation, disaster recovery, compliance frameworks, cost optimization, and continuous improvement of cloud operational practices.
Essential Responsibilities
Cloud Infrastructure & Operations
Oversee the design, deployment, and management of AWS services such as EC2, S3, RDS, Lambda, and VPC networking.
Ensure availability, resilience, and scalability of all cloud environments.
Perform capacity planning and implement autoscaling strategies to support workload changes.
Monitor cloud resources for performance, reliability, and efficient utilization.
Monitoring & Incident Response
Configure and maintain monitoring and logging tools (e.g., CloudWatch, CloudTrail).
Troubleshoot and resolve AWS infrastructure issues with minimal downtime.
Lead incident response activities and ensure root-cause analysis is completed.
Security & Compliance
Enforce AWS security best practices, policies, and guardrails.
Maintain compliance with regulatory and organizational security requirements.
Partner with security teams to assess risks, remediate vulnerabilities, and improve cloud posture.
Cost Management
Evaluate and optimize AWS spend through rightsizing, Reserved Instances, Spot usage, and architectural improvements.
Utilize tools such as AWS Cost Explorer and Trusted Advisor to identify inefficiencies.
Provide cost reporting and recommendations for ongoing savings.
Team Leadership & Cross-Functional Collaboration
Lead, mentor, and develop a team of cloud operations engineers.
Collaborate with infrastructure, network, security, integration, and application teams to ensure alignment on cloud strategy.
Provide expert guidance and foster a culture of operational excellence.
Automation & DevOps Enablement
Drive Infrastructure as Code (IaC) adoption using CloudFormation or similar tools.
Automate deployment, scaling, and configuration processes to reduce manual work.
Support CI/CD pipelines and cloud-native release processes.
Backups & Business Continuity
Ensure backups and snapshots for critical cloud services meet RTO/RPO benchmarks.
Support disaster recovery planning and testing.
Performance Optimization
Continuously evaluate and tune cloud services for optimal performance.
Support mission-critical enterprise applications and workloads.
Documentation & Reporting
Maintain detailed architecture diagrams, procedures, and runbooks.
Produce reports on operational health, incidents, metrics, and improvement initiatives.
Migration Initiatives
Work with technical teams to migrate enterprise applications from on-premises environments into AWS.
Support modernization efforts and cloud-first strategies.
Cloud Operations Standards & Governance
Develop training programs, certification paths, and operational best practices for the team.
Participate in change management and post-incident review processes.
Identify emerging trends, tools, and technologies to enhance cloud operations.
Additional Responsibilities
Participate in IT projects and cross-functional initiatives.
Provide on-call support as needed, including nights, weekends, and holidays.
Perform other duties as required.
Job Requirements
Education
Bachelor’s degree in Computer Science, Engineering, or a related field.
Certifications
AWS certifications required; preference for AWS Solutions Architect and AWS SysOps Administrator.
Experience
Minimum five years leading a cloud operations team.
At least eight years in AWS operations or engineering roles.
Hands-on experience with monitoring, automation, and DevOps tools such as CloudWatch, CloudFormation, Terraform, Jenkins, or similar.
Knowledge, Skills & Abilities
Deep expertise in AWS services, architecture, and cloud best practices.
Strong understanding of cloud security, compliance, and governance.
Excellent troubleshooting, problem-solving, and incident management skills.
Ability to collaborate across engineering, operations, and business functions.