What are the responsibilities and job description for the Cloud Automation Engineer position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Grepforce LLC, is seeking the following. Apply via Dice today!
Rate: $55/hr
Location: Raleigh, NC, United States
Duration: 1 year
Job Description:
We are seeking highly skilled Site Reliability Engineers to support the reliability, scalability, and performance of critical enterprise platforms. This role requires seasoned professionals with deep technical expertise across cloud infrastructure, operating systems, automation, and modern observability practices. The ideal candidate brings a disciplined engineering mindset, excels under pressure, and consistently drives operational excellence through metrics, automation, and continuous improvement. This is a hands-on, engineering-focused position working closely with cross-functional teams to ensure the seamless operation of complex, high-availability systems.
Responsibilities:
Design, implement, and maintain highly reliable, scalable, and secure systems across cloud and on-prem environments.
Manage and optimize distributed systems running on platforms such as Azure, Linux (RHEL7 ), and Windows Server (2019 ).
Build and improve automation workflows using scripting languages such as Python, Go, and Bash.
Develop Infrastructure-as-Code solutions using tools like Terraform and Ansible.
Define, monitor, and refine SLIs, SLOs, and SLAs to ensure consistent service quality.
Reduce operational toil through automation, tooling enhancements, and process improvements.
Integrate systems with observability platforms to ensure full operational visibility and proactive issue identification.
Troubleshoot complex incidents, lead structured incident response efforts, and conduct detailed post-mortem analyses.
Collaborate closely with software engineering, infrastructure, and business teams to deliver resilient and performant services.
Identify opportunities to optimize system reliability, performance, and maintainability, taking full ownership of problem spaces.
Rate: $55/hr
Location: Raleigh, NC, United States
Duration: 1 year
Job Description:
We are seeking highly skilled Site Reliability Engineers to support the reliability, scalability, and performance of critical enterprise platforms. This role requires seasoned professionals with deep technical expertise across cloud infrastructure, operating systems, automation, and modern observability practices. The ideal candidate brings a disciplined engineering mindset, excels under pressure, and consistently drives operational excellence through metrics, automation, and continuous improvement. This is a hands-on, engineering-focused position working closely with cross-functional teams to ensure the seamless operation of complex, high-availability systems.
Responsibilities:
Design, implement, and maintain highly reliable, scalable, and secure systems across cloud and on-prem environments.
Manage and optimize distributed systems running on platforms such as Azure, Linux (RHEL7 ), and Windows Server (2019 ).
Build and improve automation workflows using scripting languages such as Python, Go, and Bash.
Develop Infrastructure-as-Code solutions using tools like Terraform and Ansible.
Define, monitor, and refine SLIs, SLOs, and SLAs to ensure consistent service quality.
Reduce operational toil through automation, tooling enhancements, and process improvements.
Integrate systems with observability platforms to ensure full operational visibility and proactive issue identification.
Troubleshoot complex incidents, lead structured incident response efforts, and conduct detailed post-mortem analyses.
Collaborate closely with software engineering, infrastructure, and business teams to deliver resilient and performant services.
Identify opportunities to optimize system reliability, performance, and maintainability, taking full ownership of problem spaces.
Salary : $55