Demo

Member of Technical Staff, Synthetic Data

Recruiting from Scratch
San Francisco, CA Full Time
POSTED ON 4/21/2026
AVAILABLE BEFORE 5/20/2026
Who is Recruiting from Scratch:

Recruiting from Scratch is a specialized talent firm dedicated to helping companies build exceptional teams. We partner closely with our clients to deeply understand their needs, then connect them with top-tier candidates who are not only highly skilled but also the right fit for the company’s culture and vision. Our mission is simple: place the best people in the right roles to drive long-term success for both clients and candidates.

https://www.recruitingfromscratch.com/

Title of Role: Member of Technical Staff - Synthetic Data

Location: San Francisco, CA or Manhattan, NY

Company Stage of Funding: Seed-stage, venture-backed

Office Type: Onsite (5 days per week)

Salary: $150,000 - $350,000 base equity

Company Description

Our client is a fast-growing, venture-backed AI startup building next-generation infrastructure to enable more capable and reliable AI agents in enterprise environments. Backed by top-tier investors and operating at the frontier of AI, the team focuses on creating highly realistic synthetic environments and data that allow agents to train, evaluate, and operate in complex, real-world scenarios.

This is a small, deeply technical team with a strong culture of ownership, speed, and precision. They are scaling quickly and tackling problems that sit at the intersection of data engineering, simulation, and applied AI research.

What You Will Do

As a Member of Technical Staff focused on Synthetic Data, you will play a critical role in building and owning the company’s synthetic data and simulation pipelines. This role is data engineering-first, with meaningful exposure to applied research.

You will:

  • Design, build, and maintain end-to-end data engineering pipelines, including ingestion, transformation, storage, and schema evolution
  • Develop large-scale synthetic data pipelines that simulate realistic enterprise environments, including logs, usage data, and operational metrics
  • Work with massive datasets (tens of millions of rows) and optimize systems for scalability, performance, and cost
  • Create realistic, noisy, and imperfect data that mirrors real enterprise complexity and inconsistency
  • Ensure cross-system consistency so AI agents can navigate multi-application workflows
  • Read, interpret, and apply research related to simulation and synthetic data generation
  • Collaborate closely with a highly technical team to iterate quickly and ship impactful infrastructure

Ideal Candidate Background

We’re looking for candidates who are excited by early-stage environments and high-ownership roles.

You may be a strong fit if you have:

  • 2 years of experience in data engineering, or are a recent graduate with substantial internships or research experience
  • Strong programming skills in Python and SQL
  • Experience building or maintaining data pipelines in production environments
  • Comfort working with ambiguity and rapidly evolving requirements
  • A strong sense of ownership and the ability to move quickly without sacrificing quality

Preferred

While not required, the following are strong pluses:

  • Experience with synthetic data, simulations, or related research
  • Open-source contributions or published research
  • Experience working at early-stage or fast-growing startups
  • Familiarity with AI systems, agent workflows, or applied ML infrastructure
  • Experience working with large-scale datasets and distributed systems

Compensation, Benefits, and Other Details

  • Base Salary: $150,000 - $350,000 (depending on experience)
  • Equity: Competitive equity package
  • Employment Type: Full-time
  • Work Authorization: Visa sponsorship may be available for the right candidate
  • Location Requirement: Onsite in San Francisco or Manhattan
  • Hiring Goal: 2 hires, with high urgency

Why This Role Is Exciting

This is an opportunity to work directly on core infrastructure that enables the next generation of AI agents. You’ll operate at the intersection of data engineering and AI research, have real ownership over critical systems, and collaborate with a small team moving at exceptional speed.

If you’re motivated by impact, technical depth, and building from the ground up in a high-growth startup environment, this role offers both challenge and upside.

Salary : $150,000 - $350,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Recruiting from Scratch

  • Recruiting from Scratch San Francisco, CA
  • Who is Recruiting from Scratch: Recruiting from Scratch is a specialized talent firm dedicated to helping companies build exceptional teams. We partner clo... more
  • 13 Days Ago

  • Recruiting from Scratch San Francisco, CA
  • Title: Fullstack Software Engineer Location: San Francisco, CA (Hybrid) Company Stage of Funding: Private Equity–Backed (Profitable Portfolio) Office Type:... more
  • 14 Days Ago

  • Recruiting from Scratch Phoenix, AZ
  • Title of the Role: Front End Engineer Location: Phoenix, AZ Company Stage of Funding: Early-Stage / Defense Technology Startup Office Type: Onsite, Full-Ti... more
  • 1 Day Ago

  • Recruiting from Scratch San Francisco, CA
  • Who is Recruiting from Scratch : Recruiting from Scratch is a premier talent firm that focuses on placing the best product managers, software, and hardware... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Member of Technical Staff, Synthetic Data jobs in the San Francisco, CA area that may be a better fit.

  • Cohere San Francisco, CA
  • Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are bui... more
  • 16 Days Ago

  • fireworksai San Mateo, CA
  • About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and mo... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!