Demo

Applied ML Engineer

Knowtex
San Francisco, CA Full Time
POSTED ON 6/15/2026
AVAILABLE BEFORE 6/26/2026
About Knowtex

Knowtex is building the future of voice AI operating systems for clinicians, transforming how healthcare documentation happens at the point of care. Founded by Stanford AI scientists with deep clinical experience, we're experiencing explosive growth across both commercial health systems and federal healthcare, with our ambient documentation platform scaling rapidly to thousands of clinicians across hundreds of specialties. We're at an inflection point where cutting-edge AI meets real clinical impact, giving clinicians hours back each day to focus on what matters most - their patients.

Position Overview

We are seeking an Applied ML Engineer to productionize and scale machine learning systems powering our voice AI platform. This role bridges research and engineering — transforming models into reliable, low-latency, production-grade systems deployed across enterprise healthcare environments.

You will work closely with ML Scientists, Backend Engineers, and Platform teams to optimize inference performance, build evaluation pipelines, and ensure robust model deployment in regulated environments.

Key Responsibilities

  • Productionize ML models for real-time clinical applications
  • Optimize inference pipelines for low latency and high throughput
  • Deploy and scale models using AWS-based infrastructure
  • Build automated evaluation and regression testing frameworks for LLM outputs
  • Implement monitoring systems for model performance and drift detection
  • Collaborate with Backend teams to integrate ML services into APIs and workflows
  • Improve model efficiency through quantization, batching, caching, and optimization techniques Support specialty-level model evaluation and performance analysis
  • Contribute to CI/CD workflows for ML deployment

Required Qualifications

  • 3–7 years of experience in machine learning engineering or applied ML roles
  • Strong proficiency in Python and PyTorch (or TensorFlow)
  • Experience deploying ML models in production environments
  • Familiarity with transformer architectures and large language models
  • Experience with model optimization techniques (quantization, distillation, pruning)
  • Experience working with cloud infrastructure (AWS preferred)
  • Strong software engineering fundamentals and debugging skills

Preferred Qualifications

  • Experience with speech recognition systems or NLP pipelines
  • Experience with Triton Inference Server or similar deployment frameworks
  • Familiarity with healthcare data or clinical documentation workflows
  • Experience working in regulated environments (HIPAA, GovCloud, etc.)
  • Knowledge of medical coding systems (ICD-10, CPT)

Technical Environment

  • Python, PyTorch / TensorFlow
  • Transformer-based LLM architectures
  • AWS (SageMaker, ECS, Lambda, S3)
  • Triton Inference Server
  • CI/CD pipelines for ML deployment
  • Observability tools for performance and drift monitoring

Compensation & Benefits

  • Meaningful equity compensation
  • Unlimited PTO
  • Premium health, dental, and vision coverage
  • 401(k) plan

Salary : $110,000 - $120,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Applied ML Engineer?

Sign up to receive alerts about other jobs on the Applied ML Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$77,900 - $95,589
Income Estimation: 
$101,387 - $124,118
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Knowtex

  • Knowtex San Francisco, CA
  • About Knowtex Knowtex is building the future of voice AI operating systems for clinicians, transforming how healthcare documentation happens at the point o... more
  • 3 Days Ago

  • Knowtex San Francisco, CA
  • About Knowtex Knowtex is building the future of voice AI operating systems for clinicians, transforming how healthcare documentation happens at the point o... more
  • 4 Days Ago

  • Knowtex San Francisco, CA
  • Position Overview We are seeking a frontend-leaning engineer to help build and scale the user-facing web applications powering Knowtex’s AI-driven clinical... more
  • 5 Days Ago


Not the job you're looking for? Here are some other Applied ML Engineer jobs in the San Francisco, CA area that may be a better fit.

  • adaption San Francisco, CA
  • The Role We're looking for an Applied ML Engineer who thrives at the intersection of applied research and building real-world products, and who's equally c... more
  • 3 Days Ago

  • Faire San Francisco, CA
  • About Faire Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more r... more
  • 14 Days Ago

AI Assistant is available now!

Feel free to start your new journey!