Demo

Site Reliability Engineer

Archetype AI
Palo Alto, CA Full Time
POSTED ON 1/10/2026
AVAILABLE BEFORE 3/14/2026
About Archetype AI

Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team from Google, Archetype AI is building a foundation model for the physical world, a real-time multimodal LLM for real life, transforming real-world data into valuable insights and knowledge that people will be able to interact with naturally. It will help people in their real lives, not just online, because it understands the real-time physical environment and everything that happens in it.

Supported by deep tech venture funds in Silicon Valley, Archetype AI is currently pre-Series A, progressing rapidly to develop technology for their next stage. This presents a unique and once-in-a-lifetime opportunity to be part of an exciting AI team at the beginning of their journey, located in the heart of Silicon Valley.

Our team is headquartered in Palo Alto, California, with team members throughout the US and Europe.

We are actively growing, so if you are an exceptional candidate excited to work on the cutting edge of physical AI and don’t see a role that exactly fits you below you can contact us directly with your resume via jobsarchetypeaiio.

About The Role

As a Site Reliability Engineer (SRE) at Archetype AI, you will be responsible for designing, scaling, and maintaining the infrastructure that powers our AI-driven products. You will collaborate with backend engineers and ML researchers to ensure that our distributed platforms are fault-tolerant, performant, and highly available.

Core Responsibilities

  • Design, build, and operate highly available distributed systems.
  • Collaborate with engineering and ML teams to ensure reliable deployment of backend services (in Rust, C or similar).
  • Implement monitoring, alerting, and observability solutions across infrastructure.
  • Automate deployments, scaling, and infrastructure provisioning using infrastructure-as-code.
  • Diagnose and resolve performance bottlenecks, system outages, and production incidents.
  • Support AI/ML infrastructure for training and serving models at scale, including GPU clusters, pipelines, and inference services.
  • Contribute to infrastructure architecture, standards, and operational best practices.

Minimum Qualifications

  • 5 years of experience as SRE, DevOps, or Systems Engineer.
  • Strong expertise in distributed systems, fault-tolerant architectures, and large-scale production environments.
  • Proficiency in Rust, C , or other backend languages with willingness to learn.
  • Solid experience with Kubernetes, containers, and cloud platforms (AWS, GCP, Azure).
  • Hands-on experience with monitoring and observability tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Experience with data pipelines, messaging systems, and streaming technologies (Kafka, Pulsar, etc.).
  • Familiarity with AI/ML infrastructure (training pipelines, GPU clusters, inference systems).
  • Strong debugging, problem-solving, and automation mindset (Terraform, Ansible, Pulumi, scripting).
  • Excellent communication and collaboration skills.

Preferred Qualifications

  • Experience with real-time or low-latency systems.
  • Open-source contributions to distributed systems or infrastructure projects.
  • Knowledge of security best practices for distributed environments.
  • Experience with edge or embedded systems and sensor-based infrastructure.
  • Background in multimodal data fusion or physical-world perception systems.

What We Value

  • Ownership – You take initiative, follow through, and care deeply about quality and outcomes.
  • Motivation – You’re driven to solve complex problems and continuously raise the bar for yourself and your team.
  • Excellence – You bring discipline, clarity, and rigor to your craft—and help others do the same.
  • Collaboration – You work well with others, mentor generously, and contribute to a high-trust, high-performance culture.

Salary.com Estimation for Site Reliability Engineer in Palo Alto, CA
$114,571 to $145,057
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$158,960 - $205,707
Income Estimation: 
$154,509 - $200,187
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Archetype AI

  • Archetype AI Palo Alto, CA
  • About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team f... more
  • 3 Days Ago

  • Archetype AI Palo Alto, CA
  • About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team f... more
  • 3 Days Ago

  • Archetype AI Palo Alto, CA
  • About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team f... more
  • 3 Days Ago

  • Archetype AI Palo Alto, CA
  • About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team f... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Palo Alto, CA area that may be a better fit.

  • Candidate Experience site Santa Clara, CA
  • At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members... more
  • 22 Days Ago

  • ExecutivePlacements.com Sunnyvale, CA
  • Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. ... more
  • 13 Days Ago

AI Assistant is available now!

Feel free to start your new journey!