Demo

Senior Machine Learning Engineer

TetraMem - Accelerate The World
San Jose, CA Full Time
POSTED ON 4/15/2026
AVAILABLE BEFORE 5/14/2026
Responsibilities

  • Develop, optimize, and deploy lightweight machine learning models for edge AI applications, particularly for audio processing.
  • Implement and optimize ML models on embedded platforms, including FPGA and custom ASIC solutions.
  • Work closely with hardware and software teams to integrate ML models into production systems.
  • Research and implement state-of-the-art ML techniques to enhance model efficiency, latency, and power consumption for embedded AI applications.
  • Improve inference efficiency and model compression techniques, including quantization, pruning, and knowledge distillation.
  • Collaborate with cross-functional teams to drive innovation and contribute to the overall system architecture.
  • Provide technical leadership and mentorship to junior engineers.
  • Publish research findings, present at conferences, and contribute to open-source projects when applicable.

Requirements

  • 5 years of relevant industry experience (or a PhD) in Computer Science, Electrical Engineering, Machine Learning, or related fields.
  • Strong hands-on experience in machine learning, with a focus on edge AI, on-device inference, and deploying lightweight models on resource-constrained devices.
  • Expertise in modern ML frameworks such as PyTorch, TensorFlow (including TensorFlow Lite), and JAX.
  • Proficiency in Python and C/C , with practical experience in ML model optimization and production deployment.
  • Deep experience with model quantization (PTQ/QAT), pruning, knowledge distillation, sparsity, and other compression techniques for efficient edge inference.
  • Hands-on experience developing for or integrating with AI chip SDKs, neural accelerators (NPUs/DSPs), or hardware-specific toolchains (e.g., NVIDIA TensorRT, Qualcomm Neural Processing SDK, ARM Ethos, or similar).
  • Familiarity with edge inference runtimes (ONNX Runtime, ExecuTorch, TVM) and optimizing models for hardware constraints (latency, memory footprint, power consumption).

Experience in one or more of the following areas considered a strong plus:

  • Understanding of ML compiler and runtime design.
  • Experience working with tools such as Optimum, ONNX, TensorRT, TFLite/LiteRT, ncnn, or CoreML.
  • Familiarity with hardware acceleration techniques.
  • Experience in embedded system development.

Salary Range: $200,000 - $280,000 / year

Salary : $200,000 - $280,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at TetraMem - Accelerate The World

  • TetraMem - Accelerate The World San Jose, CA
  • Responsibilities Design and development of high-performance analog integrated circuits, including amplifiers, data converters, voltage regulators, and othe... more
  • 1 Day Ago

  • TetraMem - Accelerate The World San Jose, CA
  • Responsibilities Develop compiler toolchain to translate deep learning models to revolutionary new hardware Innovative in ways to optimize the speed and ef... more
  • 5 Days Ago

  • TetraMem - Accelerate The World San Jose, CA
  • Responsibilities Provide exceptional technical support to customers, addressing their inquiries and issues related to our AI in-memory computing products a... more
  • 6 Days Ago

  • TetraMem - Accelerate The World San Jose, CA
  • About The Role We are looking for an Analog Hardware Intern to contribute to the design and validation of high-performance analog/mixed-signal circuits in ... more
  • 6 Days Ago


Not the job you're looking for? Here are some other Senior Machine Learning Engineer jobs in the San Jose, CA area that may be a better fit.

  • Otter.ai Mountain View, CA
  • The Opportunity Do you want to lead projects to build and deploy cutting-edge AI technology to help people get unparalleled value from meetings and convers... more
  • 25 Days Ago

  • Bjak Sdn Bhd Palo Alto, CA
  • {"description": " About the Role A1 is building a proactive AI system that carries work forward across conversations, tools, and time. As a Senior Member o... more
  • 25 Days Ago

AI Assistant is available now!

Feel free to start your new journey!