Demo

Multimodal Speech Engineer, AI Companion

1X Technologies AS
Palo Alto, CA Full Time
POSTED ON 1/9/2026
AVAILABLE BEFORE 2/9/2026

About 1X
We’re an AI and robotics company based in Palo Alto, California, on a mission to build a truly abundant society through general‑purpose robots capable of performing any kind of work autonomously.
We believe that to truly understand the world and grow in intelligence, humanoid robots must live and learn alongside us. That’s why we’re focused on developing friendly home robots designed to integrate seamlessly into everyday life.
We’re looking for curious, driven, and passionate people who want to help shape the future of robotics and AI. If this mission excites you, we’d be thrilled to hear from you and explore how you might contribute to our journey.

Role Overview

The AI Companion team creates the speech interface for NEO, as well as the physical awareness behaviors that evokes trust, warmth, and competence when NEO interacts with people. 

As a Multimodal Speech Engineer on the AI Companion Team, you will lead the effort to create a conversational speech model, from design to data collection to deployment. You will develop real-time architectures that enable NEO to not only converse with users, but also incorporate other modalities like vision, spatial audio, and body language.

You will work closely with the design team to reflect NEO’s personality and 1X’s brand values in the way NEO speaks and responds to users, and the autonomy team to ensure that NEO’s speech models are aware of its own physical capabilities.

Responsibilities

  • Design and implement data pipelines for large scale speech interactions from NEO data and external datasets

  • Train speech2speech models to be aware of NEO’s embodiment

  • Design appropriate responses for a variety of user queries

  • Synchronize speech with body language

  • Customize NEO with different personalities



  • 3 years of experience in speech and audio modeling domains

  • Experience in multi-modal conversational models (language, audio, vision) is a strong plus

  • Ability to take open-ended problems in conversation models, come up with creative solutions, implement proof-of-concepts, and translate those to production.

Benefits & Compensation

  • Salary Range: $150,000 - $250,000

  • Health, dental, and vision insurance

  • 401(k) with company match

  • Paid time off and holidays

Equal Opportunity Employer
1X is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, ancestry, citizenship, age, marital status, medical condition, genetic information, disability, military or veteran status, or any other characteristic protected under applicable federal, state, or local law.

Salary : $150,000 - $250,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Multimodal Speech Engineer, AI Companion?

Sign up to receive alerts about other jobs on the Multimodal Speech Engineer, AI Companion career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$99,929 - $128,438
Income Estimation: 
$121,609 - $163,363
Income Estimation: 
$128,327 - $171,691
Income Estimation: 
$73,784 - $86,677
Income Estimation: 
$90,372 - $103,622
Income Estimation: 
$61,825 - $80,560
Income Estimation: 
$90,032 - $105,965
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$136,611 - $163,397
Income Estimation: 
$135,163 - $163,519
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$150,859 - $181,127
Income Estimation: 
$162,237 - $199,353
Income Estimation: 
$222,110 - $256,974
Income Estimation: 
$224,976 - $270,947
Income Estimation: 
$205,834 - $254,869
Income Estimation: 
$242,530 - $287,120
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at 1X Technologies AS

  • 1X Technologies AS Hayward, CA
  • Materials Handler, Operations / Logistics Location: Hayward, CA (on‑site) Schedule: Swing Shift (2:00pm -10:30pm) About 1X We build humanoid robots that wo... more
  • 14 Days Ago

  • 1X Technologies AS Palo Alto, CA
  • 1X builds humanoid robots that work alongside people. Every demo—internal or external—must run perfectly on the first try. Own planning and execution of al... more
  • 14 Days Ago

  • 1X Technologies AS Palo Alto, CA
  • About 1X We build humanoid robots that work alongside people to solve labor shortages and create abundance. Role Overview We are seeking a Mechanical Engin... more
  • 15 Days Ago

  • 1X Technologies AS Hayward, CA
  • Production Lead, Manufacturing Location: Hayward, CA (on‑site) About 1X We build humanoid robots that work alongside people to solve labor shortages and cr... more
  • 16 Days Ago


Not the job you're looking for? Here are some other Multimodal Speech Engineer, AI Companion jobs in the Palo Alto, CA area that may be a better fit.

  • Luma AI Palo Alto, CA
  • The Opportunity Luma AI is building the next era of AI with Omni models that can see, hear, and understand the world. As a full-stack company, we train our... more
  • 11 Days Ago

  • Luma AI Palo Alto, CA
  • About Luma AI Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intellig... more
  • 9 Days Ago

AI Assistant is available now!

Feel free to start your new journey!