Demo

Machine Learning Infra Engineer

Nuance Labs
Seattle, WA Full Time
POSTED ON 4/3/2026
AVAILABLE BEFORE 5/2/2026
About The Role

Nuance Labs is building the next generation of emotionally expressive, real-time AI.

This is a critical role to build the infrastructure that powers our AI platform. You will own the systems that serve models at scale, orchestrate complex data workflows, and ensure our real-time video AI runs reliably with low latency for users worldwide.

Responsibilities

  • Own Inference Infrastructure: Build and maintain the serving stack for multimodal AI workloads. Optimize for latency, throughput, and cost using batching strategies, autoscaling, and intelligent resource allocation.
  • Real-Time Video Streaming: Architect systems to handle long-lived WebRTC connections with unpredictable client behavior, ensuring smooth video and audio delivery at scale.
  • Orchestrate Data Workflows: Build robust pipelines for offline processing, evaluation, and training using orchestration frameworks like Dagster or Ray. Manage petabyte-scale video storage and network requirements.
  • GPU Cluster Management: Configure and maintain GPU clusters using Kubernetes and Terraform. Implement monitoring, autoscaling based on custom metrics, and cost optimization strategies.
  • Developer Tooling: Build CI/CD, evaluation, and versioning systems that enable safe, zero-downtime model deployments and rapid iteration cycles.

Requirements

  • Infrastructure Expertise: Strong practical experience with Kubernetes, Terraform, and cloud platforms. You can design secure, scalable systems and debug complex distributed issues.
  • Systems Programming: Proficiency in Python and experience with systems languages (Rust or Go). Comfortable profiling workloads and resolving compute, memory, or network bottlenecks.
  • Orchestration & Pipelines: Experience managing large-scale offline workflows using tools like Dagster, Ray, Airflow, or similar frameworks.
  • Production Operations: Deep understanding of production reliability, monitoring, incident response, and capacity planning for high-traffic services.

Preferred Experience

  • Experience with WebRTC or real-time media pipelines in production
  • Experience running GPU-backed inference services at scale (vLLM, Triton Inference Server, TensorRT)
  • Knowledge of performance optimization and low-level systems debugging
  • Familiarity with video/audio processing and storage systems

Nuance Labs Key Facts

  • $10M seed round backed by Accel, South Park Commons, Lightspeed, and top angels including Synthesia’s former CPO.
  • A world-class team of PhDs from MIT, UW, and Oxford with decades of industry experience at Apple and Meta, advancing real-time avatars from cutting-edge research to products used by millions.
  • In-person collaboration, 5 days a week at Seattle HQ

Salary.com Estimation for Machine Learning Infra Engineer in Seattle, WA
$113,515 to $146,177
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Nuance Labs

  • Nuance Labs Seattle, WA
  • Nuance Labs is building visual conversational AI that possesses emotional realism. This is a high-impact role to shape the operational core of a fast-growi... more
  • 13 Days Ago

  • Nuance Labs Seattle, WA
  • Responsibilities Operationalize Research: Collaborate with researchers to move models from experimental checkpoints to production-ready systems. Establish ... more
  • 14 Days Ago

  • Nuance Labs Seattle, WA
  • About The Role We're building the engine that powers our AI avatar: a real-time interactive loop that continuously senses the user (audio and video), orche... more
  • 14 Days Ago

  • Nuance Labs Seattle, WA
  • About Nuance Labs Nuance Labs is an early-stage deep tech startup. We’re building the first real-time human foundation model — unifying text, speech, and v... more
  • 14 Days Ago


Not the job you're looking for? Here are some other Machine Learning Infra Engineer jobs in the Seattle, WA area that may be a better fit.

  • nuancelabs Seattle, WA
  • About the Role Nuance Labs is building the next generation of emotionally expressive, real-time AI. This is a critical role to build the infrastructure tha... more
  • 1 Month Ago

  • Uber Seattle, WA
  • About The Role Have you ever ordered a car service on Uber, and when the ride arrives, wondered how it got to you so fast? Ever ordered food on UberEats an... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!