Demo

Applied Scientist / Research Engineer - Multimodal (Come to Singapore)

Mistral AI
Palo Alto, CA Full Time
POSTED ON 4/3/2026
AVAILABLE BEFORE 5/1/2026
About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

About The Job

Mistral AI is seeking world class Applied Scientists and Research Engineers who wish to relocate to Singapore.

You will be focused on multimodal learning (text, image, audio, video) to drive innovative research and collaborate with clients on complex projects.

You will design, train, and deploy SOTA multimodal models (e.g., Omni-models, VLMs, Audio, Image generation, Robotics and much more) and apply them to diverse use cases: enterprise search, agents grounded in images and documents, video understanding, and speech interfaces. You’ll work cross‑functionally with internal and external science, engineering, and product teams to deliver high‑impact AI solutions.

What You Will Do

  • Run pre-training, post-training and deploy state of the art models on clusters with thousands of GPUs. You don’t panic when you see OOM errors or when NCCL feels like not wanting to talk
  • Generate and curate multimodal datasets (web‑scale image‑text, document‑image, audio‑text, video‑text), and build robust evaluators/benchmarks for perception, grounding, OCR, and captioning
  • Develop the necessary tools and frameworks to facilitate data generation, model training, evaluation and deployment
  • Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines
  • Manage research projects and communications with client research teams


About You

  • You are fluent in English, and have excellent communication skills. You are at ease explaining complex technical concepts to both technical and non-technical audiences
  • You’re an expert with PyTorch or JAX
  • You’re not afraid of contributing to a big codebase and can find yourself around independently with little guidance
  • You have experience in one of the following: VLMs, diffusion for image/video, audio processing (ASR/TTS), image processing, robotics
  • You write clean, readable, high-performance, fault-tolerant Python code
  • You don’t need roadmaps: you just do. You don’t need a manager: you just ship
  • Low-ego, collaborative and eager to learn
  • You have a track record of success through personal projects, professional projects or in academia


It would be great if you

  • Hold a PhD / master in a relevant field (e.g., Mathematics, Physics, Machine Learning), but if you’re an exceptional candidate from a different background, you should apply
  • Can bring a variety of research experience (agents, multi-modality, robotics, diffusion, time-series)
  • Have contributed to a large codebase used by many (open source or in the industry)
  • Have a track record of publications in top academic journals or conferences
  • Love improving existing code by fixing typing issues, adding tests and improving CI pipelines


Benefits

Singapore

💰 Competitive cash salary and equity

🚑 Health Insurance

🥎 Sport : $90 for gym membership allowance

🥕 Food : $200 monthly allowance for meals (solution might evolve as we grow bigger)

🚴 Transportation : $120/month for public transport or Parking charges reimbursed

Salary : $120 - $200

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Applied Scientist / Research Engineer - Multimodal (Come to Singapore)?

Sign up to receive alerts about other jobs on the Applied Scientist / Research Engineer - Multimodal (Come to Singapore) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$100,407 - $125,193
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Mistral AI

  • Mistral AI Munich, ND
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 9 Days Ago

  • Mistral AI York, NY
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 10 Days Ago

  • Mistral AI Palo Alto, CA
  • About Mistral At Mistral we are on a mission to democratize AI, producing frontier intelligence for everyone, developed in the open, and built by engineers... more
  • 11 Days Ago

  • Mistral AI Palo Alto, CA
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 11 Days Ago


Not the job you're looking for? Here are some other Applied Scientist / Research Engineer - Multimodal (Come to Singapore) jobs in the Palo Alto, CA area that may be a better fit.

  • Apple Sunnyvale, CA
  • Summary The Video Computer Vision (VCV) organization is a centralized applied research and engineering team developing real-time, on-device Computer Vision... more
  • 10 Days Ago

  • Luma Palo Alto, CA
  • About Luma AI Luma’s mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to ... more
  • 16 Days Ago

AI Assistant is available now!

Feel free to start your new journey!