Demo

Senior Site Reliability Engineer

Spotify
York, NY Full Time
POSTED ON 3/13/2026
AVAILABLE BEFORE 5/13/2026

Are you excited by the idea of building fast, reliable, and intelligent infrastructure for a product used by engineering teams around the world? We’re looking for a Senior Site Reliability Engineer to join the Backstage team at Spotify. We’re building the next generation of our developer platform — one that doesn’t just manage software, but actively helps create and maintain it through AI-native workflows.


In 2026, SRE isn’t just about uptime; it’s about symbiosis. As part of our growing engineering team, you’ll design, build, and operate the cloud infrastructure behind our external developer portal product and our internal fleet of background coding agents. You’ll collaborate closely with experienced engineers (both human and AI-assisted) while operating at real-world scale, with deep observability, strong safety boundaries, and the unique reliability challenges of agentic production systems.


Backstage is more than just a platform — it’s a foundational force in the developer community. Born out of Spotify’s quest for better developer tooling, Backstage now powers developer portals across the globe. But we didn’t stop at catalogs and templates. Today, Backstage is becoming the command center for AI-native engineering. From enterprises orchestrating large-scale migrations to fast-moving teams using AI to improve velocity and quality, our solutions are redefining what great developer experience looks like.


As part of the Backstage team, you’ll shape developer experience for companies large and small, for our thriving open-source community, and for Spotify itself. You’ll help define how reliable, secure infrastructure enables the next wave of agentic developer tooling.

\n


What You'll Do
  • Own fleet reliability. Lead the reliability, security, and scalability strategy for Portal’s SaaS infrastructure, including the runtime environments that power our platform and LLM-driven agent workflows. Define SLOs, drive capacity planning, and ensure our systems meet the demands of a rapidly growing product.
  • Architect for the agentic era. Design and evolve infrastructure on GCP and AWS using Terraform and infrastructure-from-code patterns. Shape how we structure environments for non-deterministic AI workloads — including sandboxing, resource isolation, cost governance, and security boundaries.
  • Drive operational excellence. Evolve our incident management, on-call, and postmortem practices. Leverage AI assistants to accelerate root cause analysis and build increasingly self-healing capabilities into our production systems.
  • Lead fullstack reliability. Operate across a modern web stack (TypeScript, React, Python). While not frontend-heavy, you’ll diagnose and resolve issues across the stack and drive reliability improvements end-to-end.
  • Mentor and multiply. Raise the reliability IQ of the broader engineering team. Establish SRE best practices, conduct production-readiness reviews, and mentor engineers on operational thinking.
  • Shape the roadmap. Partner with engineering and product leadership to evolve our infrastructure in step with generative AI features. Translate operational insights into strategic input on the product roadmap.


Who You Are
  • You have 5 years of hands-on experience operating cloud infrastructure (GCP and/or AWS), using Terraform and Kubernetes to run production systems at scale.
  • You have practical experience — or a strong demonstrated interest — in operating LLM-based systems, RAG pipelines, or agentic workloads, and understand the reliability challenges of non-deterministic systems.
  • You think in distributed systems first principles — consistency, availability, partition tolerance — and translate that thinking into pragmatic infrastructure decisions.
  • You are proficient in at least one modern language (TypeScript, Java, Go, or Python) and comfortable navigating large, heterogeneous codebases, including environments where AI-generated PRs are common.
  • You build automation and improve systems so that whole categories of operational issues disappear over time.
  • You communicate complex infrastructure trade-offs clearly to both technical and non-technical stakeholders, and you write postmortems that lead to meaningful change.


Where You'll Be
  • This role is based in New York, NY.
  • We offer you the flexibility to work where you work best! There will be some in person meetings, but still allows for flexibility to work from home.


\n

The United States base range for this position is $164,448–$234,926 USD, plus equity. The benefits available for this position include health insurance, six-month paid parental leave, 401(k) retirement plan, monthly meal allowance, 23 paid days off, paid flexible holidays, and paid sick leave. These ranges may be modified in the future.


Spotify is an equal opportunity employer. You are welcome at Spotify for who you are, no matter where you come from, what you look like, or what’s playing in your headphones. Our platform is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all thrive, contribute, and be forward-thinking! So bring us your personal experience, your perspectives, and your background. It’s in our differences that we will find the power to keep revolutionizing the way the world listens.


At Spotify, we are passionate about inclusivity and making sure our entire recruitment process is accessible to everyone. We have ways to request reasonable accommodations during the interview process and help assist in what you need. If you need accommodations at any stage of the application or interview process, please let us know - we’re here to support you in any way we can.


Spotify transformed music listening forever when we launched in 2008. Our mission is to unlock the potential of human creativity by giving a million creative artists the opportunity to live off their art and billions of fans the chance to enjoy and be passionate about these creators. Everything we do is driven by our love for music and podcasting. Today, we are the world’s most popular audio streaming subscription service.

Salary.com Estimation for Senior Site Reliability Engineer in York, NY
$116,765 to $136,984
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Senior Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Spotify

  • Spotify York, NY
  • Our mission on the Advertising Product & Technology team is to build a next generation advertising platform that aligns with our unique value proposition f... more
  • 9 Days Ago

  • Spotify York, NY
  • Spotify Trust & Safety is dedicated to creating a platform that’s safe for users and creators and true to our values. We’re looking for a Manager to lead o... more
  • 9 Days Ago

  • Spotify York, NY
  • The Platform team creates the technology that enables Spotify to learn quickly and scale easily, enabling rapid growth in our users and our business around... more
  • 9 Days Ago

  • Spotify York, NY
  • Sell what you love. For us and millions of users across the globe, that’s Spotify. Join the Sales team and you’ll build the relationships that help grow ou... more
  • 9 Days Ago


Not the job you're looking for? Here are some other Senior Site Reliability Engineer jobs in the York, NY area that may be a better fit.

  • Stellar Development Foundation York, NY
  • Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven te... more
  • 15 Days Ago

  • Ro York, NY
  • Ro is a direct-to-patient healthcare company with a mission of helping patients achieve their health goals by delivering the easiest, most effective care p... more
  • 1 Day Ago

AI Assistant is available now!

Feel free to start your new journey!