Demo

Research Scientist / Engineer — Multimodal Agent

Luma
Palo Alto, CA Full Time
POSTED ON 4/2/2026
AVAILABLE BEFORE 4/30/2026
About Luma AI

Luma’s mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to be jointly trained over all signal modalities – text, video, audio, images – analogous to the human brain.

To advance our mission, we build and operate the full stack end-to-end, spanning foundation models, inference systems, and products. This integrated approach powers technologies like Ray3, which is seeing rapidly growing adoption among Fortune 500 companies across media, entertainment, and advertising. Backed by a recent $900M Series C and our partnership with Humain to build a 2 GW compute supercluster (Project Halo), our models and the Dream Machine platform are now enabling creatives worldwide to tell some of the most impactful stories of our time.

Where You Come In

This is a rare and foundational opportunity to define the future of multimodal AI. You will be at the forefront of building and training large-scale multimodal models, directly impacting how users interact with pixels. This role offers the chance to bridge cutting-edge research with magical, shipped products, working end-to-end on novel problems with no existing playbook.

What You'll Do

This opportunity involves both the “science” and “engineering” parts of research, two aspects that are of equal importance.

This is a multi-stack opportunity where you will work on the intersection of modeling, data, systems, and evaluation.

  • Modeling: Architect large-scale multimodal agentic models that use reasoning, planning, coding, and tool calling to achieve complex, multi-step multimodal work.
  • Data: Hillclimbing existing tasks and formulating new tasks through data. Design, implement, and run robust data pipelines for constructing, enriching, and filtering massive pixel datasets.
  • Systems: Train large-scale multimodal models on massive datasets and GPU clusters.
  • Evaluation: Define and build novel evaluation frameworks to measure multimodal agents.

Who You Are

  • Strong foundation in machine learning, foundation models and agentic systems.
  • Deep understanding of agentic systems and approaches in LLM/VLM reasoning, coding models, LLM/VLM tool calling.
  • Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets).

What Sets You Apart (Bonus Points)

Experience in the following around data, modeling, or evaluation:

  • State-of-the-art foundation models in reasoning
  • State-of-the-art foundation models in coding
  • State-of-the-art foundation models in tool calling
  • State-of-the-art multimodal agents

Your application are reviewed by real people.

Salary.com Estimation for Research Scientist / Engineer — Multimodal Agent in Palo Alto, CA
$110,524 to $139,925
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist / Engineer — Multimodal Agent?

Sign up to receive alerts about other jobs on the Research Scientist / Engineer — Multimodal Agent career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$100,407 - $125,193
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$98,763 - $126,233
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$113,077 - $147,784
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Luma

  • Luma International Falls, MN
  • About Luma Luma’s mission is to build unified general intelligence that can generate, understand, and operate in the physical world. We believe that multim... more
  • 10 Days Ago

  • Luma Palo Alto, CA
  • About Luma AI Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intellig... more
  • 14 Days Ago

  • Luma Palo Alto, CA
  • About Luma Luma’s mission is to build unified general intelligence that can generate, understand, and operate in the physical world. We believe that multim... more
  • 15 Days Ago

  • Luma Palo Alto, CA
  • We’re hiring a Talent Brand Partner to build Luma’s recruiting and people brand from the ground up. You’ll own how the world sees us as a place to work: th... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Research Scientist / Engineer — Multimodal Agent jobs in the Palo Alto, CA area that may be a better fit.

  • Luma AI Palo Alto, CA
  • About Luma AI Luma’s mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to ... more
  • 23 Days Ago

  • Mistral AI Palo Alto, CA
  • About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to... more
  • 15 Days Ago

AI Assistant is available now!

Feel free to start your new journey!