Demo

Cloud Site Reliability Engineer (SRE) (1134534)

The Judge Group
Berkeley, NJ Full Time
POSTED ON 5/29/2026
AVAILABLE BEFORE 6/27/2026
Location: Berkeley Heights, NJ

Salary: $70.00 USD Hourly - $80.00 USD Hourly

Description

Job Title: Cloud Site Reliability Engineer (SRE)

Location: Berkeley Heights, NJ / Alpharetta, GA (Onsite 5 Days)

Duration: Contract To Hire

Job Description

Position Overview: We are seeking a Cloud Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our cloud-based infrastructure.

The ideal candidate combines software engineering expertise with advanced systems operations skills to maintain highly available systems while reducing operational toil. This role involves automation, monitoring, capacity planning, incident response, and cloud platform management across a dynamic, distributed environment.

As a Cloud SRE, you will work closely with Engineering, Architecture, DevOps, and security teams to ensure seamless service experiences for our customers while contributing to platform design and operational efficiency. Position Requirements: Our Engineers play a critical role in the success of our clients and are expected to effectively communicate our recommended solutions in a consultative role for each client. Therefore, a successful candidate will possess a high degree of self-management, personal accountability, strong communication skills, and teamwork. The ability to interact, engineer, and communicate collaboratively at the highest technical levels with customers, vendors, partners, and all members of staff is required.

Key Responsibilities

  • System Reliability & Availability: Design and maintain fault-tolerant, high-availability architectures across AWS, Azure, and GCP. Implement redundancy, load balancing, and automated failover strategies.
  • Cloud Infrastructure Management: Deploy, manage, and optimize cloud resources using IaC tools such as Terraform, Ansible.
  • Monitoring & Observability: Implement monitoring, alerting, and logging frameworks using Splunk, Azure monitor, Dynatrace, AWS cloud watch or similar to detect and resolve issues proactively.
  • Incident Management: Lead real-time incident response, root-cause analysis, and postmortems to continuously improve uptime and resilience.
  • Capacity Planning & Scaling: Predict traffic patterns, optimize resource utilization, and enforce autoscaling and performance best practices.
  • Automation & Tooling: Develop scripts and internal tooling for automating routine tasks to reduce manual intervention. Languages may include Python, Power Shell, or Bash.
  • Security & Compliance: Collaborate with security teams to implement secure infrastructure practices including encryption, role-based access, auditing, and vulnerability management.
  • Collaboration & Mentorship: Work across engineering and DevOps teams, providing guidance on reliability best practices and mentoring junior SREs.

Required Skills & Qualifications

  • Programming & Scripting: Proficiency in Python, Power Shell, Bash, or equivalent for automation and system management.
  • Cloud Platforms: Hands-on experience with AWS, Azure, or GCP; strong understanding of VPCs, IAM, serverless architectures, and managed Kubernetes services.
  • Containers & Orchestration: Experience with Docker and Kubernetes.
  • Infrastructure as Code (IaC): Proficient in Terraform, Ansible.
  • Monitoring & Observability: Expertise with Splunk, Azure Monitor, Dynatrace, AWS Cloud Watch or similar tools.
  • Expert Knowledge and practical experience using Cloud data migration tools
  • Operating Systems: Advanced knowledge of Windows, Linux/Unix environments, with experience in system administration and networking fundamentals.
  • Incident Response: Strong problem-solving skills under pressure, with experience managing outages and mitigating risk.
  • Collaboration & Communication: Ability to articulate technical insights, coordinate across teams, and contribute to a blameless culture to resolve issues and drive consistent results. Preferred Qualifications
  • Industry certifications such as AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer, Azure Dev Ops Engineer.
  • Exposure to chaos engineering or resilience testing frameworks.
  • Prior experience in Multicloud deployments or hybrid cloud environments.
  • Familiarity with service-level objectives (SLOs), indicators (SLIs), and error budgets for service reliability.
  • Gather feedback from the department on areas of improvement and provide solutions utilizing Azure

By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.

Contact: vsingh02@judge.com

This job and many more are available through The Judge Group. Find us on the web at www.judge.com

Salary : $70 - $80

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Cloud Site Reliability Engineer (SRE) (1134534)?

Sign up to receive alerts about other jobs on the Cloud Site Reliability Engineer (SRE) (1134534) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at The Judge Group

  • The Judge Group Aurora, CO
  • Job Summary An outpatient occupational health and urgent care clinic is seeking a locum Physician, Physician Assistant, or Nurse Practitioner for a full-ti... more
  • 1 Day Ago

  • The Judge Group Everett, WA
  • Manufacturing Planner/Production Planner/Planner Everett, WA Onsite Key Responsibilities : - Review and analyze engineering change requests (ECRs) to deter... more
  • 1 Day Ago

  • The Judge Group Kent, WA
  • Job Title: Avionics Technician II Location: Kent, WA Schedule: Monday – Thursday, 5:00 AM – 3:30 PM Duration: 7 Months (Potential for Extension/Conversion)... more
  • 1 Day Ago

  • The Judge Group Charlotte, NC
  • Key Responsibilities Process and fill prescription orders for pharmacist verification Package and ship completed prescriptions to patients Maintain and man... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Cloud Site Reliability Engineer (SRE) (1134534) jobs in the Berkeley, NJ area that may be a better fit.

  • Judge Group, Inc. Berkeley, NJ
  • Location: Berkeley Heights, NJ Salary: $70.00 USD Hourly - $80.00 USD Hourly Description: Job Title: Cloud Site Reliability Engineer (SRE) Location: Berkel... more
  • 4 Days Ago

  • Bright Vision Technologies South Plainfield, NJ
  • Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and ... more
  • 1 Day Ago

AI Assistant is available now!

Feel free to start your new journey!