Demo

Senior Site Reliability Engineer

Northwood
Torrance, CA Full Time
POSTED ON 11/18/2025
AVAILABLE BEFORE 12/17/2025
Role:

Northwood is looking for a Senior Site Reliability Engineer to architect and lead the monitoring and reliability systems that keep satellites connected to Earth. As we rapidly scale our ground station network across multiple continents, you'll design and build the observability infrastructure that ensures our space communications systems operate 24/7 for customers ranging from commercial satellite operators to national security missions.

This is a high-impact leadership role where you'll architect global-scale reliability platforms while mentoring junior engineers and establishing SRE practices across the organization. You'll work directly with our founding engineering team and department heads to define the monitoring, alerting, and deployment strategies that will scale with us from startup to enterprise. If you're excited about space technology and want to architect infrastructure that directly supports mission-critical satellite operations while building and leading technical teams, this role offers that opportunity.

Responsibilities:

  • Architect and maintain enterprise observability stack (Grafana, Prometheus, Loki, Vector, VictoriaMetrics) monitoring ground stations, satellite communications, and multi-region AWS infrastructure
  • Design SRE practices, error budgets, and SLO/SLI frameworks for mission-critical satellite systems with 99.9% uptime requirements
  • Build advanced AWS infrastructure with Terraform, implementing multi-region reliability, automated scaling, and disaster recovery for ground station operations
  • Lead CI/CD pipeline architecture using GitLab and ArgoCD with advanced deployment strategies for mission-critical software releases
  • Mentor junior engineers and establish reliability standards across the growing engineering organization
  • Design comprehensive Kubernetes deployments with Helm, focusing on high availability and zero-downtime operations
  • Lead incident response, conduct post-mortems, and drive systematic reliability improvements

Basic Qualifications

  • 5-8 years of production infrastructure and SRE experience with demonstrated leadership in reliability improvements and team mentorship
  • Expert-level experience with Kubernetes, Docker, and container orchestration in large-scale production environments
  • Strong background in infrastructure as code (Terraform) and advanced CI/CD practices with experience mentoring others on these technologies
  • Advanced AWS experience including multi-region architectures, networking, security, and cost optimization, with demonstrated ability to architect complex cloud solutions
  • Proven track record of leading technical projects from conception to production in fast-moving, high-growth environments
  • Deep understanding of SRE principles, error budgets, SLOs/SLIs, and experience implementing reliability frameworks across engineering organizations

Preferred Qualifications

  • Production experience architecting and scaling observability tools (Vector, Loki, Grafana, Prometheus, VictoriaMetrics) in high-throughput environments
  • Advanced experience with HashiCorp Vault, Okta, and enterprise identity/secrets management systems including policy design and implementation
  • Previous experience scaling infrastructure and leading technical teams at high-growth companies (startup to 500 employees)
  • AWS Professional certification or equivalent demonstrated expertise with advanced cloud networking, security, and compliance frameworks
  • Strong Linux system administration and networking expertise with experience troubleshooting complex distributed systems
  • Background in aerospace, telecommunications, defense contracting, or other mission-critical, highly regulated industries
  • Experience with ITAR, NIST 800-171, or other defense/aerospace compliance requirements

Salary.com Estimation for Senior Site Reliability Engineer in Torrance, CA
$121,134 to $141,586
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Senior Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$151,875 - $212,356
Income Estimation: 
$169,957 - $202,398
Income Estimation: 
$154,184 - $199,940
Income Estimation: 
$189,563 - $242,917
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Northwood

Northwood
Hired Organization Address Northwood, OH Full Time
Make an Impact. Be Inspired. Join Northwood! Are you looking for a career where you can make a real impact? At Northwood...
Northwood
Hired Organization Address Torrance, CA Full Time
About Northwood Space: Northwood is on a mission to transform connectivity between earth and space and bring the benefit...
Northwood
Hired Organization Address Torrance, CA Full Time
About Northwood : Northwood is on a mission to transform connectivity between earth and space and bring the benefits of ...
Northwood
Hired Organization Address Torrance, CA Full Time
About Northwood Space: Northwood is on a mission to transform connectivity between earth and space and bring the benefit...

Not the job you're looking for? Here are some other Senior Site Reliability Engineer jobs in the Torrance, CA area that may be a better fit.

Senior Vehicle Reliability Engineer

czinger, Torrance, CA

AI Assistant is available now!

Feel free to start your new journey!