Demo

Member of Technical Staff, Multimodal Understanding (Visual / Audio)

xAI
Palo Alto, CA Full Time
POSTED ON 1/1/2026
AVAILABLE BEFORE 1/29/2026
About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

Focus
  • Creating and driving engineering agenda to toward superhuman multimodal capabilities, which include both multimodal understanding and multimodal generation, across different modalities including image, video and audio.
  • Improving data quality, developing data filtering/generation techniques, and performing data study, on pretraining scale.
  • Creating evaluation frameworks and internal benchmarks.
  • Designing and implementing effective and efficient algorithms for achieving state-of-the-art model performance.
Ideal Experience
  • Hands-on experience on visual, audio or multimodal pretraining.
  • Track record in leading engineering that significantly improves the capability and performance of neural networks, whether better data or better modeling.
  • Experience in data-driven experiment designs and systematic analysis for iterative model debugging.
  • Experience in developing or working with large-scale distributed machine learning systems.
  • Ability to do whatever is necessary to deliver the best end-to-end user experience.
Tech Stack
  • Python
  • Jax
  • Rust
Interview Process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“intro call”) during which a member of our team will ask some basic questions. If you clear the initial , you will enter the main process, which consists of four technical interviews:

  • One-on-one engineering discussion & coding interviews (three meetings total)
  • Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.

Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.

Location

The role is based in the SF Bay Area. Candidates are expected to be located near the Bay Area or open to relocation.

Annual Salary Range

$180,000 - $440,000 USD

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Salary : $180,000 - $440,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Member of Technical Staff, Multimodal Understanding (Visual / Audio)?

Sign up to receive alerts about other jobs on the Member of Technical Staff, Multimodal Understanding (Visual / Audio) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$36,436 - $44,219
Income Estimation: 
$50,145 - $86,059
Income Estimation: 
$48,515 - $60,705
Income Estimation: 
$90,032 - $105,965
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at xAI

  • xAI Memphis, TN
  • About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small... more
  • 14 Days Ago

  • xAI Memphis, TN
  • About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small... more
  • 14 Days Ago

  • xAI Seattle, WA
  • About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small... more
  • 14 Days Ago

  • xAI Seattle, WA
  • About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small... more
  • 14 Days Ago


Not the job you're looking for? Here are some other Member of Technical Staff, Multimodal Understanding (Visual / Audio) jobs in the Palo Alto, CA area that may be a better fit.

  • Boson AI Santa Clara, CA
  • Boson AI is an early-stage startup building large language tools for everyone to use. Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimiz... more
  • 12 Days Ago

  • xAI Palo Alto, CA
  • About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small... more
  • 5 Days Ago

AI Assistant is available now!

Feel free to start your new journey!