Demo

Principal AI Architect

Jobs via Dice
Mountain View, CA Full Time
POSTED ON 11/3/2025
AVAILABLE BEFORE 12/1/2025
Do you want to be at the forefront of innovating the latest hardware designs to propel Microsoft's cloud growth? Are you seeking a unique career opportunity that combines technical capabilities, cross-team collaboration with business insight and strategy?

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to achieve our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Join the Systems Planning and Architecture (SPARC) team within Microsoft's Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft's expanding Cloud Infrastructure and for powering Microsoft's "Intelligent Cloud" mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide, and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live.

We are looking for a Principal AI Architect to join our team!

Responsibilities:

  • Model Bring-Up & Characterization
  • Lead the bring-up and functional validation of LLMs on custom AI accelerators and GPUs
  • Develop and maintain detailed performance characterizations across compute, memory, and interconnect domains.
  • Instrument and profile end-to-end training and inference workloads to identify scaling inefficiencies and performance gaps.
  • Hardware/Software/Model Co-Design
  • Partner with silicon and system architects, compiler/runtime engineers, and model researchers to define co-design strategies that maximize efficiency and utilization.
  • Drive studies and experiments across quantization formats, tensor parallelism, activation checkpointing, memory layouts, and communication topologies.
  • Performance Optimization-Analyze kernel- and system-level traces to identify limiting factors in compute, memory, and interconnect.
  • Propose and implement optimizations in scheduling, fusion, and data movement to improve throughput and power efficiency.
  • Guide runtime and compiler improvements informed by workload analysis.
  • Cross-Functional Leadership
  • Collaborate with teams across Azure ML, DeepSpeed, and Maia hardware programs to deliver production-grade AI infrastructure
  • Present architectural findings and recommendations to senior engineering leadership.
  • Mentor and technically guide engineers working in performance, compiler, and system bring-up domains.

Qualifications:

Required/minimum Qualifications

  • Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 9 years technical engineering experience OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 11 years technical engineering experience OR equivalent experience.
  • 10 years of experience in AI systems, hardware/software co-design, or performance engineering.
  • 5 years of experience in AI accelerator and GPU architectures, including compute pipelines, memory hierarchies, and interconnects.
  • 5 years of experience with PyTorch, CUDA, Triton, or similar frameworks for performance tuning and kernel development.
  • 5 years of experience of cross-disciplinary collaboration between hardware, software, and ML model teams.
  • 5 years of experience in profiling and optimizing large-scale distributed AI workloads.

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications

  • Experience with compiler and runtime frameworks (e.g., MLIR, TVM, XLA, or custom code generation flows).
  • Familiarity with DeepSpeed, Megatron-LM, SGLang, or vLLM training and inference pipelines.
  • Deep understanding of transformer-based model architectures and scaling behaviors.
  • Hands-on experience with AI performance modeling, benchmarking, or workload simulation.
  • Demonstrated technical leadership and communication skills in highly collaborative environments.

Hardware Engineering IC6 - The typical base pay range for this role across the U.S. is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: ;br>

Microsoft will accept applications for the role until November 14th, 2025.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

#AHSI #SPARC

Salary : $163,000 - $331,200

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Principal AI Architect?

Sign up to receive alerts about other jobs on the Principal AI Architect career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$149,493 - $192,976
Income Estimation: 
$184,796 - $233,226
Income Estimation: 
$73,784 - $86,677
Income Estimation: 
$90,372 - $103,622
Income Estimation: 
$61,825 - $80,560
Income Estimation: 
$90,032 - $105,965
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$136,611 - $163,397
Income Estimation: 
$135,163 - $163,519
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$150,859 - $181,127
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$169,825 - $204,021
Income Estimation: 
$166,631 - $195,636
Income Estimation: 
$162,237 - $199,353
Income Estimation: 
$181,083 - $218,117
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Jobs via Dice

Jobs via Dice
Hired Organization Address Rapid, SD Full Time
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Swoon Group, is see...
Jobs via Dice
Hired Organization Address Rapid, SD Temporary
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Rose International,...
Jobs via Dice
Hired Organization Address Alaska, AK Full Time
Dice is the leading career destination for tech experts at every stage of their careers. Our client, DMS Vision Inc., is...
Jobs via Dice
Hired Organization Address Alaska, AK Full Time
Dice is the leading career destination for tech experts at every stage of their careers. Our client, SVK Technology Solu...

Not the job you're looking for? Here are some other Principal AI Architect jobs in the Mountain View, CA area that may be a better fit.

Principal AI Solutions Architect

Rubrik Security Cloud, Palo Alto, CA

Principal AI Architect

Mogi I/O : OTT/Podcast/Short Video Apps for you, Sunnyvale, CA

AI Assistant is available now!

Feel free to start your new journey!