What are the responsibilities and job description for the AI Researcher (Multimodal Audio/Video Generation) position at Tavus?

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for an AI Researcher to join our core AI team and push forward the science of audio-visual avatar generation. If you thrive in high-speed startup environments, enjoy experimenting with generative models, and love seeing your research ship into production then you’ll feel right at home.

Your Mission 🚀

Research and develop audio-visual generation models for conversational agents (e.g. Neural Avatars, Talking-Heads).
Focus on models that are tightly coupled with conversation flow, ensuring verbal and non-verbal signals work seamlessly together.
Experiment with diffusion models (DDPMs, LDMs, etc.), long-video generation, and audio generation.
Collaborate with the Applied ML team to bring your research into real-world production.
Stay ahead of the latest advancements in multimodal generation — and help shape the next wave.

You’ll Be Great At This If You Have:

A PhD (or near completion) in a relevant field, or equivalent hands-on research experience.
Experience applying image/video generation models in practice.
Strong foundations in generative modeling and rapid prototyping.
Deep familiarity with diffusion models, including recent advances in efficiency.
Good understanding of video-language models and multimodal generation.
Proficiency in PyTorch and GPU-based inference.

Nice-to-Haves

Experience with long-video or audio generation.
Skills in 3D graphics, Gaussian splatting, or large-scale training setups.
Broader exposure to generative models and rendering.
Familiarity with software engineering best practices.
Publications in top-tier or respected venues (CVPR, NeurIPS, BMVC, ICASSP, etc.).

Location

Preferred: San Francisco (hybrid) or London (office opening soon). Remote within U.S. or Europe available for exceptional candidates.

Apply for this job

Receive alerts for other AI Researcher (Multimodal Audio/Video Generation) job openings

What is the career path for a AI Researcher (Multimodal Audio/Video Generation)?

Sign up to receive alerts about other jobs on the AI Researcher (Multimodal Audio/Video Generation) career path by checking the boxes next to the positions that interest you.

AI Engineer II

Income Estimation:

$101,387 - $124,118

AI Engineer III

Income Estimation:

$119,030 - $151,900

Machinist II

Income Estimation:

$51,669 - $66,452

Injection Molding Machine Operator III

Income Estimation:

$53,120 - $69,174

Machine Operator III

Income Estimation:

$50,113 - $64,377

Machinist III

Income Estimation:

$61,656 - $78,069

Computer Numeric Controlled Machine Operator II

Income Estimation:

$59,875 - $77,824

Job openings at Tavus

AI Researcher (Large Language Models)

Apply

Tavus York, NY
About Us Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, fre... more
16 Days Ago

Developer Experience Engineer

Apply

Tavus San Francisco, CA
About Us Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, fre... more
16 Days Ago

Not the job you're looking for? Here are some other AI Researcher (Multimodal Audio/Video Generation) jobs in the York, NY area that may be a better fit.

AI Researcher

Apply

Vatic Labs York, NY
As an AI Researcher at Vatic Labs, you will research and develop innovative AI-driven quantitative trading strategies. You will explore vast amounts of mar... more
5 Days Ago

AI Researcher

Apply

Mercor York, NY
About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benc... more
6 Days Ago

AI Researcher (Multimodal Audio/Video Generation)

What are the responsibilities and job description for the AI Researcher (Multimodal Audio/Video Generation) position at Tavus?

What is the career path for a AI Researcher (Multimodal Audio/Video Generation)?

Job openings at Tavus

Not the job you're looking for? Here are some other AI Researcher (Multimodal Audio/Video Generation) jobs in the York, NY area that may be a better fit.

We don't have any other AI Researcher (Multimodal Audio/Video Generation) jobs in the York, NY area right now.

AI Assistant is available now!