Demo

Site Reliability Engineer (Space Communications)

Northwood
Torrance, CA Full Time
POSTED ON 12/8/2025
AVAILABLE BEFORE 2/8/2026
About Northwood:

Northwood is on a mission to transform connectivity between earth and space and bring the benefits of space to the masses through innovations in space communications technologies. If you like building quickly and seeing your work deployed in locations around the globe with real impact, we want you at Northwood.

Role:

Northwood is looking for an Site Reliability Engineer to help build the monitoring and reliability systems that keep satellites connected to Earth. As we rapidly scale our ground station network across multiple continents, you'll build the observability infrastructure that ensures our space communications systems operate 24/7 for customers ranging from commercial satellite operators to national security missions.

This is a high-growth role where you'll evolve from building core monitoring systems to potentially leading infrastructure teams and architecting global-scale reliability platforms. You'll work directly with our founding engineering team to establish the monitoring, alerting, and deployment practices that will scale with us from startup to enterprise. If you're excited about space technology and want to build infrastructure that directly supports mission-critical satellite operations, this role offers that opportunity.

Responsibilities:

  • Build and maintain observability stack (Grafana, Prometheus, Loki, Vector, VictoriaMetrics) that monitors ground stations, satellite communication systems, and cloud infrastructure across multiple AWS regions
  • Support CI/CD pipelines using GitLab and ArgoCD, partnering with development teams to ensure reliable deployments of mission-critical software
  • Develop and maintain AWS infrastructure using Terraform, with focus on multi-region reliability and automated scaling for ground station operations
  • Deploy and manage Kubernetes applications with Helm, ensuring both developer productivity and system uptime for satellite communication services
  • Establish monitoring strategies, alerting frameworks, and incident response procedures for infrastructure supporting real-time satellite communications
  • Participate in on-call rotation and lead post-incident reviews to continuously improve system reliability

Basic Qualifications

  • 2-5 years of production infrastructure and monitoring experience with measurable reliability improvements
  • Strong experience with Kubernetes, Docker, and container orchestration in production environments
  • Hands-on experience with CI/CD tools and infrastructure as code (Terraform preferred)
  • AWS experience with multi-service deployments and Python programming skills for automation
  • Self-directed work style with ability to own projects from conception to production in fast-moving environments
  • Understanding of SRE principles, SLOs/SLIs, and systematic approaches to system reliability

Preferred Qualifications

  • Experience with observability tools (Vector, Loki, Grafana, Prometheus) in production environments
  • Familiarity with HashiCorp Vault, Okta, or similar identity/secrets management systems
  • Previous experience scaling infrastructure at high-growth companies (startup to 100 employees)
  • AWS certification or demonstrated expertise with advanced cloud networking and security
  • Linux system administration experience and networking fundamentals
  • Interest in aerospace, telecommunications, or mission-critical systems

Additional Information:

To conform to U.S. Government space technology export regulations, including the International Traffic in Arms Regulations (ITAR) you must be a U.S. citizen, lawful permanent resident of the U.S., protected individual as defined by 8 U.S.C. 1324b(a)(3), or eligible to obtain the required authorizations from the U.S. Department of State.

Northwood is an Equal Opportunity Employer; employment with Northwood is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

Salary.com Estimation for Site Reliability Engineer (Space Communications) in Torrance, CA
$103,294 to $121,427
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer (Space Communications)?

Sign up to receive alerts about other jobs on the Site Reliability Engineer (Space Communications) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$76,670 - $90,826
Income Estimation: 
$91,609 - $118,978
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$169,957 - $202,398
Income Estimation: 
$151,875 - $212,356
Income Estimation: 
$120,143 - $165,703
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Northwood

  • Northwood Torrance, CA
  • About Northwood : Northwood is on a mission to transform connectivity between earth and space and bring the benefits of space to the masses through innovat... more
  • 12 Days Ago

  • Northwood Torrance, CA
  • About Northwood Space: Northwood is on a mission to transform connectivity between earth and space and bring the benefits of space to the masses through in... more
  • 12 Days Ago

  • Northwood Torrance, CA
  • About Northwood: Northwood is a modern space infrastructure company focused on connecting space and Earth. The world runs on space. Space will run on North... more
  • 12 Days Ago

  • Northwood Torrance, CA
  • Role: Northwood is looking for a Senior Site Reliability Engineer to architect and lead the monitoring and reliability systems that keep satellites connect... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Site Reliability Engineer (Space Communications) jobs in the Torrance, CA area that may be a better fit.

  • Northwood Torrance, CA
  • Role: Northwood is looking for a Senior Site Reliability Engineer to architect and lead the monitoring and reliability systems that keep satellites connect... more
  • 12 Days Ago

  • divergent Torrance, CA
  • Divergent is a technology company that has architected, invented, built, and commercialized an end-to-end factory system called the Divergent Adaptive Prod... more
  • 20 Days Ago

AI Assistant is available now!

Feel free to start your new journey!