Demo

Cloud Infrastructure Site Reliability Engineer (SRE)

Intelliswift - An LTTS Company
Berkeley, NJ Full Time
POSTED ON 1/13/2026
AVAILABLE BEFORE 2/15/2026

Job Posting Title: Cloud Infrastructure Site Reliability Engineer (SRE)

Location: Alpharetta, GA or Berkeley Heights, NJ


Position Summary:

As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, you will be responsible for operating infrastructure solutions, following the principles and practices pioneered by Google’s SRE model. Your work will ensure our cloud services meet uptime, reliability, and performance targets, and you will drive automation and continuous improvement across our production environments. This role will involve collaborating with cross-functional teams to enhance our cloud reliability posture and streamline processes through automation.


Key Responsibilities:

• Design, build, and maintain highly available, scalable, and secure cloud infrastructure on platforms such as AWS, GCP, or Azure.

• Develop and implement automation for provisioning, monitoring, scaling, and incident response using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation, Ansible).

• Monitor system reliability, capacity, and performance; proactively detect and address issues before they impact users.

• Respond to production incidents, participate in on-call rotations, and lead post-incident reviews to drive root cause analysis and reliability improvements.

• Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.

• Build and maintain tools for deployment, monitoring, and operations; automate manual processes to reduce toil.

• Document operational processes and system architectures to ensure knowledge sharing and repeatability.

• Continuously evaluate and implement new technologies to improve system reliability, security, and efficiency.

Qualifications:

• Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.

• 3 years of experience in software development with proficiency in at least one programming language (e.g., Python, Go, Java, C ).

• Experience administrating cloud platforms (AWS, GCP, Azure), including networking, security, containerization, storage, data management, and serverless technologies.

• Solid understanding of Linux systems, networking fundamentals, virtualized, and distributed systems, file systems, system processes and configurations.

• Deep understanding of observability (monitoring, alerting, and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards, alerts, and logs.

• Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing, deployments, provisioning, and observability.

• Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews.

• Understanding of setting, monitoring, and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.


Additional Qualifications a Plus:

• Experience working with enterprise-scale financial services or other regulated industries

• 5 years of experience in SRE, DevOps, infrastructure, or cloud engineering roles, preferably supporting large-scale, distributed systems.

• Excellent problem-solving, troubleshooting, and communication skills.

• Experience leading technical projects or mentoring junior engineers.

• Certifications: Certified Engineer, DevOps, SRE, CSREF

Salary.com Estimation for Cloud Infrastructure Site Reliability Engineer (SRE) in Berkeley, NJ
$103,676 to $122,421
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Cloud Infrastructure Site Reliability Engineer (SRE)?

Sign up to receive alerts about other jobs on the Cloud Infrastructure Site Reliability Engineer (SRE) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Intelliswift - An LTTS Company

  • Intelliswift - An LTTS Company Madison, WI
  • Pay rate range - $100/hr. to $130/hr. on W2 Onsite Job Description The Life Insurance Medical Consultant provides expert support to our underwriting team b... more
  • 13 Days Ago

  • Intelliswift - An LTTS Company Redmond, WA
  • Job Title: Optical Engineer, Design Validation Location: Onsite in Redmond, WA Duration: 24 Months We are looking for a candidate with experience in novel ... more
  • 13 Days Ago

  • Intelliswift - An LTTS Company Sunnyvale, CA
  • ASIC Design Verification Engineer Fulltime role with L&T Technology Services (LTTS) Sunnyvale, California - Onsite Note: No hybrid or remote Job Descriptio... more
  • 13 Days Ago

  • Intelliswift - An LTTS Company Milpitas, CA
  • Job Title: Electro-Mechanical Assembler Duration: 12 Months Location: Milpitas, CA Pay Rate: $25- $27/hr. Intelliswift Software Inc. conceptualizes, builds... more
  • 13 Days Ago


Not the job you're looking for? Here are some other Cloud Infrastructure Site Reliability Engineer (SRE) jobs in the Berkeley, NJ area that may be a better fit.

  • Blankfactor Berkeley, NJ
  • This position is as a full time position supporting the financial services/payments space and is fully onsite 5 days per week with some on call support (ro... more
  • 26 Days Ago

  • Jobs via Dice Holmdel, NJ
  • Job Overview We are seeking a skilled Engineer, Site Reliability (SRE) to contribute to the reliability, scalability, and performance of our multi-cloud Sa... more
  • 5 Days Ago

AI Assistant is available now!

Feel free to start your new journey!