Demo

Engineering Director - Site Reliability - Requsition 24015266

hackajob
York, NY Full Time
POSTED ON 4/9/2026
AVAILABLE BEFORE 5/8/2026
hackajob is collaborating with American Express to connect them with exceptional professionals for this role.

You Lead the Way. We’ve Got Your Back.

With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.

At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success. Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.

Join Team Amex and let's lead the way together.

As part of our diverse tech team, you can architect, code and ship software that makes us an essential part of our customers’ digital lives. Here, you can work alongside talented engineers in an open, supportive, inclusive environment where your voice is valued, and you make your own decisions on what tech to use to solve challenging problems. American Express offers a range of opportunities to work with the latest technologies and encourages you to back the broader engineering community through open source. And because we understand the importance of keeping your skills fresh and relevant, we give you dedicated time to invest in your professional development. Find your place in technology on #TeamAmex.

Responsibilities

Leadership and Strategy:

Direct and mentor a diverse team of SRE engineers across multiple locations.

Develop and implement the technical strategy for infrastructure, alerting, monitoring, and development tooling.

Foster a culture of openness, innovation, and inclusivity.

Collaborate with senior leadership to align SRE goals with organizational objectives.

Act as a liaison between engineering, operations, and application support teams to ensure cohesive strategy and execution.

Operational Excellence

Ensure the reliability, scalability, and performance of all platform services.

Oversee incident management processes, ensuring rapid resolution and effective post-incident analysis.

Implement best practices for monitoring, logging, and alerting across all systems.

Drive continuous improvement in operational processes and system reliability.

Develop and maintain comprehensive documentation and knowledge sharing across teams.

24x7 Operations: Ensure 24x7 operations by establishing and managing a follow-the-sun support model, on-call rotations, and effective handover processes to maintain continuous monitoring and incident response.

Technical Oversight

Lead the design and architecture of comprehensive infrastructure solutions that address complex technical challenges and align with business objectives.

Provide technical leadership and guidance to the Platform SRE team, ensuring that architectural standards and best practices are followed for all initiatives.

Lead the development and maintenance of automation tools for infrastructure management.

Manage the observability platform and establish standards for tracking application health.

Collaborate with application teams to define and meet service reliability targets.

Ensure robust disaster recovery and business continuity plans are in place.

Ensure robust monitoring and alerting infrastructure is in place for all critical services.

Core Infrastructure Management: Oversee the management of compute, storage, network, and cloud infrastructure to ensure high availability and performance.

Talent Management

Attract, hire, and retain top SRE talent.

Provide coaching, mentorship, and career development for team members.

Set clear performance goals and conduct regular evaluations.

Required Skills And Experience

8 plus years in proven experience leading SRE or similar engineering teams.

Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation).

Strong background in network protocols, routing, switching, and security.

Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk).

Experience with incident management and root cause analysis.

Solid understanding of containerization (Docker, Kubernetes) and orchestration.

Familiarity with service mesh technologies (Istio, Linkerd).

Excellent problem-solving skills and the ability to manage complex, cross-functional projects.

Strong communication skills and the ability to work with diverse teams.

Experience in Application Support: Demonstrated experience in supporting applications, including understanding application lifecycle management and ensuring reliability.

Salary : $150,000 - $190,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Engineering Director - Site Reliability - Requsition 24015266?

Sign up to receive alerts about other jobs on the Engineering Director - Site Reliability - Requsition 24015266 career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$154,184 - $199,940
Income Estimation: 
$189,563 - $242,917
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at hackajob

  • hackajob Wilmington, DE
  • hackajob is collaborating with J.P. Morgan to connect them with exceptional professionals for this role. Job Description Embrace the challenge of orchestra... more
  • 12 Days Ago

  • hackajob Wilmington, DE
  • hackajob is collaborating with J.P. Morgan to connect them with exceptional professionals for this role. Job Description Bring your expertise to JPMorgan C... more
  • 12 Days Ago

  • hackajob Wilmington, DE
  • hackajob is collaborating with J.P. Morgan to connect them with exceptional professionals for this role. Job Description Job Description Join the Finance D... more
  • 12 Days Ago

  • hackajob Newark, DE
  • hackajob is collaborating with J.P. Morgan to connect them with exceptional professionals for this role. Job Description Posting Description: The Firmwide ... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Engineering Director - Site Reliability - Requsition 24015266 jobs in the York, NY area that may be a better fit.

  • Affirm York, NY
  • Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or comp... more
  • 17 Days Ago

  • Forhyre York, NY
  • Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to imp... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!