Demo

Lead Site Reliability Engineer, AI/ML Platform

JPMorgan Chase
Jersey, NJ Full Time
POSTED ON 12/23/2025
AVAILABLE BEFORE 2/23/2026

 Responsibilities:

  • Design and implement solutions to enhance the reliability and scalability of AI/ML platforms and applications to accommodate fast growing demands.
  • Partner with product engineering teams to ensure the AI/ML systems are reliable and high performing. 
  • Develop observability, security, automation and fin-ops tools and orchestration.
  • Provide strategic technology leadership by defining and evaluating standards and architecture for reliability, observability and automation frameworks.
  • Build strong cross-functional relationships that foster engagements across the organization and deliver solutions to user problems.
  • Debug and solve issues in a production environment, identify root cause and remediate. 
  • Participates in on-call rotations, incident management and escalation workflows.
  • Take full ownership of problems, develop solutions, and acquire new knowledge to complete the task.
  • Mentor and guide junior engineers.

Required Qualifications:

  • Bachelor’s degree in computer science, Information Technology, or equivalent technical qualification with 5 years professional experience.
  • Expertise in SRE principles, reliability, scalability and performance of application and infrastructure.
  • Have hands-on experience with cloud platforms (AWS, GCP, Azure) and IaC tools (Terraform, Ansible). 
  • Extensive experience implementing advanced observability using tools like Open Telemetry, Dynatrace, Grafana, and/or cloud-native services.
  • Experience in architecting distributed systems and cloud-native architecture in AWS.
  • Systematic problem-solving and troubleshooting skills in a complex system.
  • Excellent communication skills and ability to represent and present business and technical concepts to stakeholders. 
  • Self-managed, self-motivated with strong sense of ownership, urgency, and drive

Good to have:

  • Prior experience working in AI, ML, or Data engineering.
  • Prior experience developing AI Ops/AI Agents.
  • Multi cloud experience (AWS, GCP, Azure) is a plus 

Salary.com Estimation for Lead Site Reliability Engineer, AI/ML Platform in Jersey, NJ
$144,034 to $182,096
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Lead Site Reliability Engineer, AI/ML Platform?

Sign up to receive alerts about other jobs on the Lead Site Reliability Engineer, AI/ML Platform career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at JPMorgan Chase

  • JPMorgan Chase Wilmington, DE
  • Chase Card Services is seeking an experienced leader to drive customer and firm value within the Card Authorizations’ product. You will work closely with B... more
  • 12 Days Ago

  • JPMorgan Chase Wilmington, DE
  • We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Software Engineer III - Python/Agent... more
  • 12 Days Ago

  • JPMorgan Chase Wilmington, DE
  • As a Computational Linguist in the Machine Learning & Optimization team, you are an integral part of the group that optimizes features, models, and AI capa... more
  • 12 Days Ago

  • JPMorgan Chase Newark, DE
  • Join a team where your expertise in data governance will shape the future of data management at JPMorgan Chase. As part of the CIB Chief Data Office, you’l... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Lead Site Reliability Engineer, AI/ML Platform jobs in the Jersey, NJ area that may be a better fit.

  • JPMorganChase Jersey, NJ
  • Responsibilities JOB DESCRIPTION Design and implement solutions to enhance the reliability and scalability of AI/ML platforms and applications to accommoda... more
  • 20 Days Ago

  • JPMorganChase Jersey, NJ
  • Job Description Job Description At J.P. Morgan Chase, we are building an enterprise-grade AI/ML Data Platform that enables scalable, secure, and responsibl... more
  • 16 Days Ago

AI Assistant is available now!

Feel free to start your new journey!