Recent Searches

You haven't searched anything yet.

15 Staff Site Reliability Engineer Jobs in Atlanta, GA

SET JOB ALERT
Details...
Kaleyra
Atlanta, GA | Full Time
$96k-112k (estimate)
Just Posted
Confluent
Atlanta, GA | Full Time
$88k-104k (estimate)
1 Week Ago
First Advantage
Atlanta, GA | Full Time
$151k-169k (estimate)
Just Posted
Diversity Resource Staffing Inc
Atlanta, GA | Full Time
$67k-79k (estimate)
4 Months Ago
Flock Safety
Atlanta, GA | Full Time
$116k-142k (estimate)
7 Days Ago
Innova Solutions
Atlanta, GA | Full Time
$97k-114k (estimate)
1 Week Ago
Hermeus
Atlanta, GA | Full Time
$110k-131k (estimate)
1 Week Ago
Jobs for Humanity
Atlanta, GA | Full Time
$104k-122k (estimate)
2 Weeks Ago
GreenSky Administrative Services LLC
Atlanta, GA | Other
$86k-101k (estimate)
4 Weeks Ago
LTIMindtree
Atlanta, GA | Full Time
$86k-101k (estimate)
2 Months Ago
Diversity Resource Staffing Inc
Atlanta, GA | Full Time
$79k-93k (estimate)
4 Months Ago
FIS Global
Atlanta, GA | Other | Full Time
$109k-127k (estimate)
7 Months Ago
Now100
Atlanta, GA | Other
$98k-112k (estimate)
11 Months Ago
Bohler
Atlanta, GA | Full Time
$82k-101k (estimate)
3 Months Ago
Experient Group
Atlanta, GA | Full Time
$106k-134k (estimate)
1 Month Ago
Staff Site Reliability Engineer
$67k-79k (estimate)
Full Time 4 Months Ago
Save

Diversity Resource Staffing Inc is Hiring a Staff Site Reliability Engineer Near Atlanta, GA

Job Description Summary
The Site Reliability Engineering team is responsible for the availability and reliability of our worldwide Cloud based applications and platform. We obsess over availability by building tools, engineering new systems to automate our platform/apps, and are given the freedom to cut across all organizations to identify availability impediments and drive them to closure. We are software engineers with full visibility and influence across the entire technical stack. We strive to ensure some of the biggest companies / Industries in the world always have reliable access to the software & solutions that power their businesses.
Job Description
Roles and Responsibilities
In this role, you will:
• Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria
• Develop automated solutions to address potential problems before they result in a service interruption
• Provide impact assessment and mitigation plan for changes going into the production environment
• Investigate root cause of severe and systemic outages, identify corrective actions and apply across the enterprise
• Develop availability measures that align with consumer experience to accurately assess the usability of crucial services
• Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages
• Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages
• Analyze failure points in services to model risk level and resolution steps if failure occurs.
• Assist in driving architecture enhancements into system to mitigate potential failure points.
• Programmatically monitor for and remediate configuration drift of critical devices
• Develop response plans to potential failure points and evaluate effectiveness during planned tests
• Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture
• Provide technical coaching and direction to more junior teammates
Eligibility Requirements: “Legal authorization to work in the U.S. is required. We will not sponsor individuals for employment visas, now or in the future, for this job.”
Education Qualification
Bachelor's Degree in Computer Science or “STEM” Majors (Science, Technology, Engineering and Math) with minimum 6 years of experience
Desired Characteristics
Technical Expertise:
• Experience with configuring, customizing, and extending monitoring tools (Datadog, Sensu, Grafana, Splunk, etc.)
• Excellent knowledge of common operating systems (Unix/Linux, Windows)
Strong oral and written communication skills.
• Demonstrated experience scripting or developing software and services for the cloud Ruby, Python, Go, Java, Node.js, .NET, etc.
• Extensive knowledge of network protocols (TCP/IP, SNMP, FTP, syslog, TFTP, etc.
• Experience managing version control systems such as Git
• Experience deploying and managing infrastructure on public clouds such as AWS or Azure
• Experience using an automated configuration management system (Terraform, Chef, Puppet, Ansible, Salt, etc.)
• Strong organizational and project management skills
• Strong analytical and problem resolution skills
• Excellent knowledge of Network Management (SNMP, MIB)
• Excellent knowledge of TCP/IP networking, and inter-networking technologies (routing/switching, proxy, firewall, load balancing etc.)
• Knowledge and experience using Analytics Software Packages like Matlab, SAS, JMPro etc. Programming experience with open source scripting and data analysis packages like Python, R is a plus.
 

Job Summary

JOB TYPE

Full Time

SALARY

$67k-79k (estimate)

POST DATE

12/13/2023

EXPIRATION DATE

05/04/2024

WEBSITE

danifdesign.com.br

Show more

Diversity Resource Staffing Inc
Full Time
$91k-108k (estimate)
2 Months Ago
Diversity Resource Staffing Inc
Full Time
$90k-108k (estimate)
2 Months Ago
Diversity Resource Staffing Inc
Full Time
$93k-117k (estimate)
4 Months Ago

The job skills required for Staff Site Reliability Engineer include AWS, Python, Linux, Ansible, Configuration Management, Computer Science, etc. Having related job skills and expertise will give you an advantage when applying to be a Staff Site Reliability Engineer. That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Staff Site Reliability Engineer. Select any job title you are interested in and start to search job requirements.

For the skill of  AWS
BizTek People, Inc. | APA International Placement Consultants
Full Time
$132k-161k (estimate)
5 Months Ago
For the skill of  Python
Promantus
Full Time
$107k-132k (estimate)
1 Month Ago
For the skill of  Linux
Exemplar ITS
Contractor
$58k-72k (estimate)
1 Month Ago
Show more

The following is the career advancement route for Staff Site Reliability Engineer positions, which can be used as a reference in future career path planning. As a Staff Site Reliability Engineer, it can be promoted into senior positions as a Corrosion Engineer I that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Staff Site Reliability Engineer. You can explore the career advancement for a Staff Site Reliability Engineer below and select your interested title to get hiring information.

Flock Safety
Remote | Full Time
$116k-142k (estimate)
7 Days Ago
Innova Solutions
Full Time
$97k-114k (estimate)
1 Week Ago