Demo

AI Research Scientist, Audio-Visual Understanding, FAIR

Meta
San Francisco, CA Full Time
POSTED ON 5/30/2026
AVAILABLE BEFORE 7/31/2026
Meta is seeking a Research Scientist to join Fundamental AI Research (FAIR), a research organization focused on making significant advances in AI. Our organization is driven by advancing the science of intelligence and developing technology toward achieving superintelligence. We are seeking researchers with experience in computer vision, speech and multimodal learning to join our team and help build the perceptual foundations for real-time embodied conversational agents. This role offers the opportunity to collaborate with a highly interdisciplinary team of scientists, engineers, and cross-functional partners, with access to cutting-edge technology, resources, and research facilities.

AI Research Scientist, Audio-Visual Understanding, FAIR Responsibilities:

  • Develop joint audio-visual understanding systems that integrate visual and auditory signals for advanced perception
  • Build and evaluate audiovisual language models for social interactions and understanding, including predicting social intent, semantic function, and reasoning from human-centric inputs
  • Contribute to benchmarks and evaluation frameworks for visual social understanding and interactions
  • Train and optimize state-of-the-art machine learning and neural network methodologies
  • Conduct and collaborate on research projects within a globally-based team

Minimum Qualifications:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • A PhD in AI, computer science, data science, or related technical fields
  • Experience holding an industry, postdoctoral, faculty, or government researcher position
  • Research background in machine learning, artificial intelligence, computational statistics, or applied mathematics, or related areas
  • Research publications reflecting experience in theoretical or empirical research
  • Experience in developing and debugging in Python or similar programming languages
  • Experience in analyzing and collecting data from various sources
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment

Preferred Qualifications:

  • Demonstrated research and software engineering experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub)
  • Experience with audio-visual learning or multimodal fusion techniques
  • Familiarity with human action recognition, social signal processing, or human-centric video understanding
  • Experience with long-form video understanding, video-language models, or streaming perception systems
  • Experience with vision-language models (VLMs) such as LLaVA, GPT-4V, Gemini, or similar architectures
  • Experience with temporal modeling, video transformers, or recurrent architectures for sequential data

About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@meta.com.

$154,000/year to $217,000/year bonus equity benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Salary : $154,000 - $217,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AI Research Scientist, Audio-Visual Understanding, FAIR?

Sign up to receive alerts about other jobs on the AI Research Scientist, Audio-Visual Understanding, FAIR career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$108,245 - $136,486
Income Estimation: 
$136,683 - $171,343
Income Estimation: 
$108,245 - $136,486
Income Estimation: 
$136,683 - $171,343
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Meta

  • Meta Lebanon, IN
  • Meta is seeking an Environmental, Health & Safety (EHS) Lead to join our Data Center Facility Operations team. Our data centers serve as the foundation upo... more
  • 3 Days Ago

  • Meta Redmond, WA
  • As a Hardware Electrical Engineer at Meta, you will design, develop, and integrate electrical systems for a wide range of applications, from next-generatio... more
  • 3 Days Ago

  • Meta York, NY
  • At Meta, we’re shaping innovative experiences in service of giving people the power to build community and bring the world closer together. Our multidiscip... more
  • 3 Days Ago

  • Meta Miami, FL
  • Meta is seeking a Regional Quality Manager to join our Data Center Design, Engineering & Construction Team. Our team's mission is to optimize the delivery ... more
  • 3 Days Ago


Not the job you're looking for? Here are some other AI Research Scientist, Audio-Visual Understanding, FAIR jobs in the San Francisco, CA area that may be a better fit.

  • Canva San Francisco, CA
  • Company Description Join the team redefining how the world experiences design. Hey, g'day, mabuhay, kia ora,你好, hallo, vítejte! Thanks for stopping by. We ... more
  • 25 Days Ago

  • Skild AI San Francisco, CA
  • Company Overview At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without ... more
  • 5 Days Ago

AI Assistant is available now!

Feel free to start your new journey!