Demo

Senior Network Engineer

Alibaba Cloud
Sunnyvale, CA Full Time
POSTED ON 1/7/2026
AVAILABLE BEFORE 2/8/2026

Job Description

1. Observability Link Construction for Operations and Maintenance

a. Have a global perspective on stability, capable of developing and implementing stability solutions.

b. Pre-event: Establish and continually optimize monitoring mechanisms for application operations and maintenance; develop and maintain corresponding monitoring platforms/tools.

c. During the event: Establish and continuously optimize warning mechanisms for application operations and maintenance, ensuring that faults can be quickly discovered, located, and addressed.

d. Post-event: Quickly analyze, diagnose, and locate problems, and collaborate with relevant personnel to resolve issues; establish and improve the rapid recovery service mechanism to reduce business impact and ensure stable business operations by identifying and eliminating potential risks through stability governance projects and architectural optimizations.

2. Stability Operations and Maintenance Platform Construction

a. Design, develop, and maintain reliable operations and maintenance platforms and tools, such as inspection systems, water level systems, delivery systems, cost management systems, etc., to address issues related to delivery, performance, stability, and cost encountered by production systems, ensuring business availability and enhancing performance and efficiency.

b. Responsible for data-driven analysis of operations and maintenance quality; analyze and study daily operations and maintenance metrics, issues, and risks to establish models and provide optimization suggestions for operations and maintenance.

3. Application Operations and Maintenance Standard Construction

a. Establish operation and maintenance process specifications and standardization (such as change standards, protection plans, cloud product configuration standards, etc.) to ensure the normativity and standardization of operations and maintenance, thereby enhancing stability.

b. Develop and implement emergency response specifications and standards for application operations and maintenance faults.

c. Develop and implement alarm handling specifications and standards for application operations and maintenance, as well as Service Level Agreements (SLA).

4. Resource Optimization

a. Based on business requirements, plan budget preparation, capacity planning, and readiness, and coordinate with development teams for predictions and estimates of resource consumption such as storage and computing.

b. Analyze business demands, ensuring stability while integrating water levels, specifications, and billing rules; control the reasonableness of resource estimation in technical solutions and collaborate with development to reduce resource costs.

5. Security Assurance Construction

a. 24/7 emergency response, daily monitoring alerts, and emergency handling, continuously identifying and rectifying existing issues.

b. Responsible for operations and maintenance support during major events (such as National Day, Spring Festival, New Year's Day, and significant activities).

c. Develop and drill emergency plans, respond to emergencies, and handle faults.

d. Establish a problem/fault record repository, conduct targeted analysis of the repository, and enhance and optimize the emergency plan repository and standard process repository.

6. Architecture Upgrade

a. Responsible for system architecture upgrades, such as kernel upgrades, architecture upgrades, inter-room service migration, and containerization transformation.

b. Responsible for the design and implementation of disaster recovery architecture, such as local disaster recovery and multi-active geographically distributed setups.


Job Requirements

1. Fluent in Chinese communication skills, able to clearly articulate technical issues and solutions.

2. Over 3 years of experience in operations and maintenance in related fields such as applications, networks, and containerization.

3. Basic mastery of professional abilities in architecture design, performance optimization, and stability optimization.

4. Capable of applying intelligent and automated operations and maintenance platforms and tools, designing and utilizing complex workflows and daily operational templates, quickly identifying, locating, and resolving relatively complex faults, thereby improving operational efficiency.

5. Able to summarize and consolidate issues discovered in daily operations and maintenance into operational experience, and apply this knowledge to enhance capabilities within the operations and maintenance platform.

6. Proficient in protocols such as TCP/IP, DNS, and HTTP, with the ability to perform preliminary analysis of network traffic and troubleshoot network issues.

7. Familiar with at least one cloud service platform (such as AWS, Alibaba Cloud, Azure, etc.) and its related mainstream products (such as Flink, MaxCompute, Log Service, RDS, Redis, etc.), able to preliminarily troubleshoot and resolve basic issues related to the use of corresponding cloud products.

8. Bonus Points: Familiarity with DPDK (Data Plane Development Kit) and experience in enhancing network processing performance.

9. Bonus Points: Some development capabilities to advance automation in operations and maintenance capabilities.

10. Bonus Points: Strong business understanding, capable of independently handling complex issues with real case examples.

11. Bonus Points: Possessing personal judgment regarding business issues, able to skillfully utilize processes and tools to identify risks and formulate solutions.

12. Bonus Points: Having a certain level of influence within the business line and able to gain recognition from surrounding teams.




The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.


If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Salary : $133,200 - $219,600

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Network Engineer?

Sign up to receive alerts about other jobs on the Senior Network Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$108,098 - $130,480
Income Estimation: 
$131,611 - $156,576
Income Estimation: 
$108,098 - $130,480
Income Estimation: 
$131,611 - $156,576
Income Estimation: 
$131,611 - $156,576
Income Estimation: 
$141,102 - $168,742
Income Estimation: 
$87,720 - $106,708
Income Estimation: 
$108,098 - $130,480
Income Estimation: 
$71,709 - $89,893
Income Estimation: 
$87,720 - $106,708
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Alibaba Cloud

  • Alibaba Cloud Sunnyvale, CA
  • The Alibaba Cloud Network Team is at the core of the Alibaba Cloud Apsara Platform, offering a rich array of network resources and solutions within the ind... more
  • 12 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description: Customer Relationship Building and Business Opportunity Development •Proactively analyze key industries within the assigned country/market... more
  • 12 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description ● Build and own relationships with AI-native companies founders, CTOs, engineers, and product leaders across the U.S. ● Understand technica... more
  • 3 Days Ago

  • Alibaba Cloud Sunnyvale, CA
  • Job Description: 1. Strategic Customer Growth & Relationship Leadership ● Own a portfolio of mid-to-large Media & Entertainment enterprises—from initial en... more
  • 3 Days Ago


Not the job you're looking for? Here are some other Senior Network Engineer jobs in the Sunnyvale, CA area that may be a better fit.

  • CoreWeave Sunnyvale, CA
  • CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innova... more
  • 1 Month Ago

  • ARMA Global Corporation Livermore, CA
  • Type of Requisition: Regular Clearance Level Must Currently Possess: Top Secret SCI Polygraph Clearance Level Must Be Able to Obtain: Top Secret SCI Polygr... more
  • 2 Months Ago

AI Assistant is available now!

Feel free to start your new journey!