Demo

Senior Software Engineer - AI/ML, AWS Neuron Inference

Amazon Web Services (AWS)
Seattle, WA Full Time
POSTED ON 5/19/2026
AVAILABLE BEFORE 6/17/2026
Description

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine

learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.

The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models.

Key job responsibilities

Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.

About The Team

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Basic Qualifications

  • 5 years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent
  • 5 years of programming using a modern programming language such as Java, C , or C#, including object-oriented design experience
  • Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance.

Preferred Qualifications

  • Master's degree in computer science or equivalent
  • Hands-on experience with PyTorch or Jax - preferably involving developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually


Company - Annapurna Labs (U.S.) Inc.

Job ID: A10422684

Salary.com Estimation for Senior Software Engineer - AI/ML, AWS Neuron Inference in Seattle, WA
$101,368 to $126,046
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Software Engineer - AI/ML, AWS Neuron Inference?

Sign up to receive alerts about other jobs on the Senior Software Engineer - AI/ML, AWS Neuron Inference career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$77,900 - $95,589
Income Estimation: 
$101,387 - $124,118
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Amazon Web Services (AWS)

  • Amazon Web Services (AWS) Canton, MS
  • Description AWS is looking for a Network Deploy Technician to join our growing team within infrastructure operations. You will work with minimum supervisio... more
  • Just Posted

  • Amazon Web Services (AWS) Sparks, NV
  • Description Join our dynamic team and become a critical technical leader who ensures the heartbeat of our global infrastructure remains strong and resilien... more
  • Just Posted

  • Amazon Web Services (AWS) Sparks, NV
  • Description AWS Infrastructure Services (AIS) owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re th... more
  • Just Posted

  • Amazon Web Services (AWS) Sparks, NV
  • Description Join our dynamic team and become a critical technical leader who ensures the heartbeat of our global infrastructure remains strong and resilien... more
  • Just Posted


Not the job you're looking for? Here are some other Senior Software Engineer - AI/ML, AWS Neuron Inference jobs in the Seattle, WA area that may be a better fit.

  • Amazon Web Services (AWS) Seattle, WA
  • Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI... more
  • 17 Days Ago

  • Amazon Web Services (AWS) Seattle, WA
  • Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI... more
  • 20 Days Ago

AI Assistant is available now!

Feel free to start your new journey!