Demo

ECS Site Reliability Engineer

Alibaba Cloud
Seattle, WA Full Time
POSTED ON 12/24/2025
AVAILABLE BEFORE 1/30/2026

Elastic Compute Service (ECS) is a core product of Alibaba Cloud. The Elastic Compute team is dedicated to building world-leading cloud computing nfrastructure. As a key component of Alibaba Cloud's self-developed Apsara operating system , Elastic Compute Service (ECS) provides full-stack computing resources covering virtual machine instances, container services and Heterogeneous computing clusters.


Through technological innovation and product optimization, the Alibaba Cloud Elastic Compute team continuously drives advancements in cloud computing technologies, delivering high-quality computing services to users worldwide

. Our goal is not only to support enterprises in achieving elastic scalability but also to deeply empower infrastructure innovation in the New era . Our mission is to build an intelligent foundation of "Computing as a Service," enabling developers to focus on businesses to concentrate on breakthroughs, without worrying about the complex engineering implementations from chips to clusters .



SRE Team:

The Alibaba Cloud Elastic Compute Service (ECS) SRE (Site Reliability Engineering) team is a critical force in ensuring system stability and reliability. The SRE team focuses on guaranteeing the high availability, high performance, and robust stability of ECS products through technical expertise and innovation.


The Alibaba Cloud ECS SRE team is not only a core technical safeguard but also a driver of technological innovation and continuous optimization . By leveraging technical capabilities and collaborative teamwork, we ensure the stability and reliability of ECS products, safeguarding global customers' businesses. Additionally, we are committed to advancing cloud computing technologies through knowledge sharing and industry collaboration .


Joining the Alibaba Cloud ECS SRE team offers the opportunity to engage in the development and optimization of world-leading cloud computing technologies, while growing alongside a passionate and creative team.



This is an SRE or DevOps position focused on the entire Elastic Computing product line. The responsibilities of this role include:

1. Stability, Performance Optimization, Monitoring, and Operations: Oversee the stability, performance optimization, monitoring, and operational work for multiple core products of Alibaba Cloud (such as ECS, ACK, ACS, Heterogeneous computer cluster, OOS, Compute Nest, etc.), taking responsibility for the online stability of these products.

2. Operation System and Online System Development: Engage in the development of operation systems and some online systems. Through tools, process optimization, and system improvements, ensure the stability and performance of Alibaba Cloud's Elastic Computing-related products.

3. Customer and Team Collaboration: Work closely with other teams (such as R&D, after-sales support, etc.) to ensure efficient technical support and problem resolution.

Candidates can choose to take responsibility for one or more core duties based on their expertise. Meanwhile, we are looking for experts who possess cross-team collaboration skills and system-level thinking abilities.



Minimum qualification:

- Professional Knowledge and Experience

● Bachelor’s degree or higher in Computer Science, Information Technology, or a related field.

● At least 3 years of experience in system operations or SRE, with familiarity in cloud computing services and core products (e.g., ECS, K8S,Heterogeneous Computer, etc.).

● Familiarity with the design and optimization of cloud resource provisioning and delivery systems; experience in serving overseas customers is preferred.

● In-depth understanding of the overall architecture and operational mechanisms of the elastic computing product line, with the ability to quickly identify and resolve complex issues.



Preferred qualification:

- Possession of cloud-related certifications (e.g., ACP, ACE, or other major cloud vendor certifications).

● Participation in the architectural design or performance optimization projects of large cloud platforms.

● Outstanding contributions in system stability assurance, automation tool development, or cloud-native domains are highly valued.


Position Highlights

As a SRE for elastic computing, you will have the opportunity to:

● Deeply engage in the core operations of Alibaba Cloud's elastic computing product line, ensuring service stability for global users.

● Explore cutting-edge technologies in virtualization, containerization, cloud-native, driving technological innovation.

● Grow within an open and innovative team, collaborating with top engineers to solve complex technical challenges.

If you are passionate about technology, strive for excellence, and wish to leverage your expertise in the cloud computing domain, we welcome you to join us!




The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.



If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.


Alibaba U.S. based full time regular employees have access to medical, dental, and vision insurance, a 401(k) plan and basic life insurance, and wellbeing benefits like FSA, subject to the terms and conditions of the applicable plans then in effect. U.S. based employees are also eligible to receive up to 12 paid holidays, accrue up to 15 paid vacation days for this position, and receive up to 72 hours paid sick time (front-loaded) per calendar year.

Salary : $133,200 - $219,600

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a ECS Site Reliability Engineer?

Sign up to receive alerts about other jobs on the ECS Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$50,368 - $64,311
Income Estimation: 
$56,961 - $72,393
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$79,571 - $98,965
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Alibaba Cloud

  • Alibaba Cloud Sunnyvale, CA
  • The Alibaba Cloud Network Team is at the core of the Alibaba Cloud Apsara Platform, offering a rich array of network resources and solutions within the ind... more
  • 12 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description: Customer Relationship Building and Business Opportunity Development •Proactively analyze key industries within the assigned country/market... more
  • 12 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description ● Build and own relationships with AI-native companies founders, CTOs, engineers, and product leaders across the U.S. ● Understand technica... more
  • 3 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description: 1. Strategic Customer Growth & Relationship Leadership ● Own a portfolio of mid-to-large Media & Entertainment enterprises—from initial en... more
  • 3 Days Ago


Not the job you're looking for? Here are some other ECS Site Reliability Engineer jobs in the Seattle, WA area that may be a better fit.

  • cognitiv Bellevue, WA
  • Are you ready to revolutionize the advertising industry? At Cognitiv, we are not just another AdTech company—we are industry trailblazers redefining media ... more
  • 1 Month Ago

  • Qumulo Seattle, WA
  • About The Company Qumulo is the unstructured data platform to store and manage exabyte-scale data anywhere – at the edge, in the core data center and in th... more
  • 14 Days Ago

AI Assistant is available now!

Feel free to start your new journey!