Demo

Founding Software Engineer, Data Infrastructure

Airweave (YC X25)
San Francisco, CA Full Time
POSTED ON 12/25/2025
AVAILABLE BEFORE 1/25/2026
We're looking for a founding engineer to own Airweave's data and infrastructure layer, the systems that make our distributed search and data pipelines scalable, reliable and observable.

At Airweave, you'll build and operate the platform that thousands of AI agents depend on. That means distributed sync pipelines pulling data from dozens of sources, vector databases powering LLM search, and the orchestration layer that keeps it all running. You'll work closely with the product team, but your focus is on the foundation: making sure data flows reliably at scale, LLM inference stays fast, and the whole system holds up under real production load.

This is early-stage infrastructure work. The architecture is still being shaped, and your decisions will define how we scale.

What you'll work on

  • Design and scale distributed data pipelines that sync hundreds of millions of documents from dozens sources into advanced search indexes
  • Build and improve Temporal workflows for parallel sync orchestration: retries, backpressure, and failure recovery across workers
  • Own our Kubernetes deployments with Helm charts: autoscaling, and resource management for bursty search, sync and LLM workloads
  • Scale PostgreSQL for high-throughput; connection pooling, read replicas, partitioning (we ask a lot from this database)
  • Manage vector database (Vespa) infrastructure: sharding, replication, backup strategies for large-scale agentic search
  • Orchestrate and optimize LLM inference pipelines: batching, caching, provider failover
  • Build monitoring and alerting with Prometheus, Grafana, and custom instrumentation for cluster health
  • Infrastructure as code for the base with Terraform

You might be a fit if

  • You've built or operated data pipelines at scale: ETL, event processing, streaming, or sync infrastructure
  • You're comfortable with Kubernetes, Terraform, and infrastructure as code
  • You've scaled databases and understand the tradeoffs (pooling, replication, sharding)
  • You have experience with distributed systems: workflow orchestration, message queues, eventual consistency
  • You're interested in LLM infrastructure: embeddings, vector search, inference optimization
  • You like building reliable systems and have opinions about observability
  • You're drawn to early-stage environments where you own the whole problem

Bonus Points

  • Experience with Temporal, Airflow, or similar workflow engines
  • Background in scaling search (Elastic, Qdrant, Pinecone, Weaviate)
  • Familiarity with LLM inference

What we offer

  • Customers including one of the world's leading AI labs
  • Competitive salary ($120K–$160K) with meaningful equity (0.25%–1.00%)
  • Health, dental, and vision coverage
  • Work in-person in San Francisco with a highly-skilled, technical team
  • Direct impact on architecture and infrastructure decisions from the first week

Salary : $120,000 - $160,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Founding Software Engineer, Data Infrastructure?

Sign up to receive alerts about other jobs on the Founding Software Engineer, Data Infrastructure career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$176,149 - $220,529
Income Estimation: 
$156,679 - $196,968
Income Estimation: 
$77,657 - $95,021
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other Founding Software Engineer, Data Infrastructure jobs in the San Francisco, CA area that may be a better fit.

  • thinkingmachines San Francisco, CA
  • Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has acc... more
  • 2 Months Ago

  • OpenAI San Francisco, CA
  • About The Team Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows. We operate some of th... more
  • 1 Day Ago

AI Assistant is available now!

Feel free to start your new journey!