Demo

AI Researcher (Computer Vision/Multimodal/Generative AI)

SPREEAI
San Francisco, CA Full Time
POSTED ON 4/18/2026
AVAILABLE BEFORE 5/26/2026

About The Role

We are hiring ML Researchers to develop novel approaches that advance the frontier of multimodal vision AI and create product-defining capabilities for SpreeAI. This role exists because current generative and vision models are not designed for photorealistic human representation, controllable try-on, or real-world deployment constraints. You will explore new architectures, algorithms, and training strategies that improve realism, controllability, efficiency, and multimodal understanding — with a direct path from research to production.


You Will Work On Research Problems Across

  • photorealistic virtual try-on
  • human-centric visual representation learning
  • video-based modeling and temporal consistency
  • multimodal reasoning and generative pipelines
  • compute-efficient diffusion and generative architectures


This is a research role with product impact: successful work leads to platform capabilities, white papers, patents and most importantly, industry differentiation.


Why This Role Exists

Modern Multimodal AI Systems Struggle With Identity Preservation, Pose Consistency, Physical Realism, And Controllability Under Production Constraints. We Are Building New Approaches Where

  • diffusion models must produce consistent outputs across poses, viewpoints, and garments,
  • generative models must learn human and garment interactions realistically,
  • research innovations must scale to real-world deployment environments.


This role is for researchers who want to see novel ideas become shipped systems used by real customers.


What You'll Do

  • Develop novel architectures and training approaches for vision and multimodal AI.
  • Advance generative modeling techniques including controllable diffusion and video generation.
  • Design experiments improving realism, temporal consistency, and human representation.
  • Collaborate with applied engineering teams to translate research into production systems.
  • Publish white papers or research outputs aligned with product differentiation.
  • Evaluate new model paradigms for scalability and efficiency.


Core Research Areas & Model Architectures

Candidates Should Have Familiarity With Or Interest In Advancing

  • Diffusion models and latent diffusion architectures.
  • Transformer-based vision models (ViT, multimodal transformers).
  • Image-to-image and video generation pipelines.
  • Control mechanisms for generative models (conditioning, adapters, LoRA).
  • Representation learning for human pose, geometry, or identity consistency.
  • Multimodal architectures combining vision, text, and structured inputs.


Qualifications

  • PhD in Computer Science, Artificial Intelligence, Robotics, Computer Vision, or related field.
  • Strong research background in computer vision, generative modeling, or multimodal AI.
  • Strong programming skills in Python and familiarity with object-oriented languages.
  • Experience with deep learning frameworks (PyTorch preferred).
  • Strong foundations in machine learning theory and experimental design.


Preferred Qualifications

  • Publications at top conferences (CVPR, ICCV, NeurIPS, ICLR, SIGGRAPH, etc.).
  • Experience with diffusion-based generative models.
  • Video modeling or temporal learning experience.
  • Experience bridging research into production systems.
  • Interest in compute efficiency, distillation, or scalable generative pipelines.


SPREEAI is a fast-growing, innovative AI company at the forefront of fashion and e-commerce, revolutionizing how consumers engage with fashion through lifelike photorealistic try-on technology and hyper-personalized shopping experiences. Our mission is to redefine the retail landscape with cutting-edge AI solutions that blend high fashion and technology. We thrive in a dynamic, fast-paced environment where creativity meets technology to drive real impact. If you are passionate about innovation and shaping the future of fashion, SPREEAI offers a platform to make your mark.

Salary.com Estimation for AI Researcher (Computer Vision/Multimodal/Generative AI) in San Francisco, CA
$135,057 to $164,515
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AI Researcher (Computer Vision/Multimodal/Generative AI)?

Sign up to receive alerts about other jobs on the AI Researcher (Computer Vision/Multimodal/Generative AI) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at SPREEAI

  • SPREEAI San Francisco, CA
  • About The Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at... more
  • 2 Days Ago

  • SPREEAI San Francisco, CA
  • About The Role We are hiring Engineers focused on AI Model Evaluation to build the systems that ensure multimodal AI behaves reliably, consistently, and pr... more
  • 5 Days Ago

  • SPREEAI York, NY
  • About The Role Ready to launch your social media career at the intersection of fashion and AI? SPREEAI – a fast-growing, innovative startup blending high f... more
  • 9 Days Ago

  • SPREEAI San Francisco, CA
  • About The Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at... more
  • 9 Days Ago


Not the job you're looking for? Here are some other AI Researcher (Computer Vision/Multimodal/Generative AI) jobs in the San Francisco, CA area that may be a better fit.

  • Fireworks AI San Mateo, CA
  • About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and mo... more
  • 1 Day Ago

  • Archetype AI San Mateo, CA
  • About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team f... more
  • 2 Days Ago

AI Assistant is available now!

Feel free to start your new journey!