Demo

Site Reliability Engineer/ Platform Engineer

Agile Fuel | World-class Dedicated Engineering Teams
Mountain View, CA Full Time
POSTED ON 11/27/2025
AVAILABLE BEFORE 12/27/2025
Our client is a fast-growing AI-driven technology company focused on building intelligent, automated solutions that transform how modern engineering teams work. They are committed to creating a development culture where speed, reliability, and data-driven decision-making are at the core. Their product leverages advanced analytics and AI to help organizations improve productivity, enhance visibility, and deliver software more efficiently.

They are seeking a hybrid Site Reliability Engineer / Platform Engineer with strong DevOps expertise and solid Python engineering skills. This person will design, build, and operate the next generation of their cloud infrastructure and internal developer platforms. The ideal candidate is passionate about automation, observability, reliability, and scalable system design. You will drive improvements across cloud architecture, CI/CD workflows, development tooling, and operational excellence — enabling the engineering organization to ship faster and more reliably.

If you thrive in a fast-moving, AI-native environment and enjoy building intelligent, highly automated platforms, this role is an excellent fit.



Responsibilities

  • Design, build, and maintain highly reliable, scalable Azure infrastructure using Container Apps, ACR, managed databases, serverless components, and other PaaS services;
  • Own and enhance CI/CD pipelines, deployment workflows, platform automation, and the full observability stack;
  • Develop Python-based tooling and infrastructure to support a scalable, reliable AI-driven platform;
  • Architect and maintain secure, fault-tolerant integrations with external systems (GitHub, Jira, Azure, Redis, Sentry, etc.);
  • Build and operate monitoring, logging, alerting, and SLO/SLA frameworks to ensure reliability and performance;
  • Partner with backend and data engineering teams to design a scalable infrastructure foundation for high-growth AI products;
  • Continuously optimize cost efficiency, reliability, and deployment velocity;
  • Scale AI infrastructure and support the transition to an AI-native engineering organization;
  • Drive an AI-native culture by leveraging LLM-powered workflows and automation for speed and efficiency.

Requirements

  • 5 years in DevOps, SRE, Platform Engineering, or similar roles;
  • Expert-level understanding of cloud infrastructure, ideally Azure, including container services, serverless patterns, networking, and identity;
  • Strong Python software engineering ability — building platform tools, automation frameworks, or backend services;
  • Hands-on experience with containerization, Docker, and cloud-native operational patterns;
  • Strong understanding of external system integrations, how to design around them, and how to build reliable abstractions when they fail;
  • Experience designing and operating production-grade pipelines, monitoring, alerting, and observability tools;
  • Practical understanding of resilience engineering: retries, backoff, idempotency, state management, and failure modes;
  • A bias toward automation: if something can be automated, you automate it;
  • A startup mindset: ownership, speed, pragmatic decision-making, and willingness to wear multiple hats;
  • Interest in and excitement about AI-native development workflows using tools like ChatGPT, GitHub Copilot, and automated pipeline orchestration;
  • Upper-Intermediate English level.

Bonus points for

  • Experience with Bicep, Terraform or other IaC tools;
  • Background supporting Python/Django or data pipelines;
  • Familiarity with Celery, distributed queues, or event-driven systems;
  • Experience working in SOC2-compliant or enterprise-grade environments;
  • Experience building internal developer platforms (IDPs) or self-service infrastructure.

We offer excellent benefits, including but not limited to

  • People-oriented management without bureaucracy;
  • Flexible schedule (≈ 3 hours overlap with ET);
  • 15 working days of annual paid vacation;
  • Paid sick-leaves;
  • Friendly and engaging professional team;
  • Opportunities for self-realization, career, and professional growth.

Salary.com Estimation for Site Reliability Engineer/ Platform Engineer in Mountain View, CA
$130,475 to $156,544
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer/ Platform Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer/ Platform Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$169,957 - $202,398
Income Estimation: 
$151,875 - $212,356
Income Estimation: 
$120,143 - $165,703
Income Estimation: 
$76,670 - $90,826
Income Estimation: 
$91,609 - $118,978
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Agile Fuel | World-class Dedicated Engineering Teams

Agile Fuel | World-class Dedicated Engineering Teams
Hired Organization Address Mountain View, CA Full Time
Our client is building power plants for gaming. They are doing this by redefining how people engage with renewable energ...
Agile Fuel | World-class Dedicated Engineering Teams
Hired Organization Address Mountain View, CA Full Time
Our client is a fast-growing global organization with a mission to become a worldwide leader in engineering and technica...
Agile Fuel | World-class Dedicated Engineering Teams
Hired Organization Address Mountain View, CA Full Time
Our client is an early-stage startup, officially launched in Q2 2024, with a mission to transform the job search experie...

Not the job you're looking for? Here are some other Site Reliability Engineer/ Platform Engineer jobs in the Mountain View, CA area that may be a better fit.

Cloud Platform Site Reliability Engineer

Alibaba Cloud, Sunnyvale, CA

Site Reliability Engineer – Openstack

Candidate Experience site, Sunnyvale, CA

AI Assistant is available now!

Feel free to start your new journey!