Demo

Research Engineer, Data Infrastructure

Mistral AI
San Francisco, CA Full Time
POSTED ON 4/30/2026
AVAILABLE BEFORE 6/30/2026

About Mistral 

 

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

 

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.

 

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

 

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.


Role Summary 

 

This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability.
You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs.

 

 

What will you do

 

Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems

Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions.
Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth.
Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments.
Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by o

 

About you

 

Have 4 years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.
Have experience or a strong interest in supporting foundational compute and storage platforms.
Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.
Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments.
Take pride in building and operating scalable, reliable, and secure systems from the ground up.
Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment.

 

 

\n


What we offer
  • 💰 Competitive salary and equity.
  • 🚑 Healthcare: Medical/Dental/Vision covered for you and your family.
  • 👴🏻 Pension : 401K (6% matching)
  • 🏝️ PTO : 18 days 
  • 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
  • 🏀 Sport: $120/month reimbursement for gym membership
  • 🥕 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
  • 🌎 Visa sponsorship 
  • 🤝 Coaching: we offer BetterUp coaching on a voluntary basis
 
By applying, you agree to our Applicant Privacy Policy.


\n

Salary.com Estimation for Research Engineer, Data Infrastructure in San Francisco, CA
$148,640 to $182,393
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Engineer, Data Infrastructure?

Sign up to receive alerts about other jobs on the Research Engineer, Data Infrastructure career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$115,390 - $147,559
Income Estimation: 
$136,671 - $177,110
Income Estimation: 
$128,093 - $158,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Mistral AI

  • Mistral AI York, NY
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 16 Days Ago

  • Mistral AI York, NY
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 1 Day Ago

  • Mistral AI Palo Alto, CA
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 3 Days Ago

  • Mistral AI Munich, ND
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 6 Days Ago


Not the job you're looking for? Here are some other Research Engineer, Data Infrastructure jobs in the San Francisco, CA area that may be a better fit.

  • OpenAI San Francisco, CA
  • About The Team The Workload team is responsible for designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at... more
  • 2 Days Ago

  • Bedrock Data San Mateo, CA
  • About Bedrock Data Data is moving faster than security can keep up, fragmented across clouds, SaaS apps, and AI systems, without the context teams need to ... more
  • 3 Days Ago

AI Assistant is available now!

Feel free to start your new journey!