Demo

Research Scientist Intern, Multimodal Audio Generation (PhD)

Meta
Menlo Park, CA Intern
POSTED ON 10/30/2025
AVAILABLE BEFORE 11/30/2025
Meta was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Meta offers countless ways to make an impact in a fast growing organization.Meta’s Core AI team is seeking a Research Scientist Intern with a focus on audio generation, especially music and song generation from multimodal input. Our team is pioneering AI research across text, audio, and video domains, with a mission to develop AI-driven foundational models and their applications. We are committed to advancing state-of-the-art algorithms, promoting open research, and fostering scientific innovation in all aspects of AI for language, including language modeling, natural language understanding and generation, audiovisual learning, on-device/personalized LM, and multimodal applications.As a Research Scientist Intern, you will play a crucial role in developing cutting-edge models and algorithms in AI Research. We are seeking a candidate with expertise in multimodal learning and audio generation. The ideal candidate will have a strong background in deep learning and general machine learning, coupled with a deep passion for computer vision and audio processing. In this position, you will work with the domain experts to understand the challenges and build state-of-the-art models to tackle them. Our internships are twelve (12) to twenty-four (24) weeks long and we have various start dates throughout the year.

Research Scientist Intern, Multimodal Audio Generation (PhD) Responsibilities:

  • Lead and contribute to cutting-edge audio (music and song) generation model research that leads to publications on top-tier conferences
  • Perform research to tackle unsolved real-world problems and push the state of the art
  • Independently design and implement algorithms, train advanced foundational models on large datasets, and evaluate their performance
  • Define, plan and execute cutting-edge deep learning research to advance product experiences using the audio generation features
  • Communicate the experimental results and the recommendations clearly, both within the group as well as to the cross-functional groups

Minimum Qualifications:

  • Currently is in the process of obtaining a PhD in the field of Artificial Intelligence or related field
  • Research experience in one or more of these areas: machine learning, deep learning, generative AI, audio processing or related fields
  • Knowledge of state of the art deep learning methods and neural networks
  • Experience working with machine learning libraries like Pytorch, Jax, etc
  • Experience with scripting languages such as Python and shell scripts
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Preferred Qualifications:

  • Intent to return to degree-program after the completion of the internship
  • Experience with developing scalable machine learning models in at least one of the following areas: large language models, natural language understanding or generation, efficient training and inference, multimodals, or relevant areas
  • Experience with large scale model training, implementing algorithms, and evaluating language systems
  • Proven track record of achieving significant results as demonstrated by publications at leading conferences/journals such as NeurIPS, ICLR, ICML, CVPR, ICCV, ICASSP, Interspeech, AAAI, IEEE TASLP or similar
  • Experience working and communicating cross functionally in a team environment
  • Experience solving complex problems and comparing alternative solutions, trade offs, and diverse points of view to determine a path forward

About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

$7,650/month to $12,134/month benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Salary : $7,650 - $12,134

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist Intern, Multimodal Audio Generation (PhD)?

Sign up to receive alerts about other jobs on the Research Scientist Intern, Multimodal Audio Generation (PhD) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$146,301 - $195,282
Income Estimation: 
$194,895 - $259,743
Income Estimation: 
$192,057 - $260,440
Income Estimation: 
$249,515 - $311,938
Income Estimation: 
$155,477 - $213,492
Income Estimation: 
$79,622 - $96,017
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$96,054 - $120,781
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$118,775 - $156,834
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Meta

Meta
Hired Organization Address Washington, DC Full Time
We are building a governance, risk, and compliance function to enable our company to build products that can withstand r...
Meta
Hired Organization Address Washington, DC Full Time
Cross-Meta Security’s mission is to protect the company, our community, and their data while empowering safe innovation....
Meta
Hired Organization Address Rayville, LA Full Time
Meta is seeking an experienced On-Site Building Management System (BMS) Quality Manager to join our Data Center Design, ...
Meta
Hired Organization Address Prineville, OR Full Time
The Site Operations team is responsible for the delivery of data center compute and storage at Meta, enabling our family...

Not the job you're looking for? Here are some other Research Scientist Intern, Multimodal Audio Generation (PhD) jobs in the Menlo Park, CA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!