What are the responsibilities and job description for the ML Runtime Optimization Engineer, Mid-Level position at Jobright.ai?
Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust.
Job Summary:
Applied Intuition is a vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines. They are seeking an ML Runtime Optimization Engineer to optimize ML models and deploy them on production-grade embedded runtime environments, focusing on performance optimization for ADAS/AD stacks across various embedded compute platforms.
Responsibilities:
• Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms
• Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
• Work on model pruning and quantization, and support deployment on memory constrained platforms
• Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
• Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration
Qualifications:
Required:
• Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field
• 3 years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
• Strong software development skills with the focus on embedded programming
• Experience profiling and optimizing model performance on embedded compute platforms
• Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)
Preferred:
• M.Sc or PhD in a ML related area
• Built an ML optimization framework from scratch before
• Deployed ML solutions to embedded chips for real time robotics applications
Company:
Applied Intuition provides software infrastructure to safely develop, test, and deploy autonomous vehicles at scale. Founded in 2017, the company is headquartered in Mountain View, California, USA, with a team of 501-1000 employees. The company is currently Late Stage. Applied Intuition has a track record of offering H1B sponsorships.