What are the responsibilities and job description for the Site reliability Engineer position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Next Level Business Services, Inc., is seeking the following. Apply via Dice today!
Job Summary
As a Site reliability Engineer, you will be responsible for designing, implementing, and maintaining cloud infrastructure across AWS, Google Cloud Platform, and/or Azure. You will work closely with engineering and data teams to support scalable applications and data platforms while ensuring reliability, security, and cost optimization.
Key Responsibilities
Job Summary
As a Site reliability Engineer, you will be responsible for designing, implementing, and maintaining cloud infrastructure across AWS, Google Cloud Platform, and/or Azure. You will work closely with engineering and data teams to support scalable applications and data platforms while ensuring reliability, security, and cost optimization.
Key Responsibilities
- Design, provision, and manage cloud infrastructure across AWS, Google Cloud Platform, and/or Azure
- Build and maintain Infrastructure-as-Code (IaC) using Terraform or similar tools
- Deploy and manage containerized applications using Kubernetes and Docker
- Implement CI/CD pipelines and automate deployment processes
- Monitor system performance and maintain observability using tools like Grafana, Prometheus, or CloudWatch
- Ensure security best practices including IAM, network security, and secrets management
- Configure networking components such as VPCs, load balancers, and DNS
- Automate operational tasks using scripting languages like Python or Bash
- Collaborate with cross-functional teams to support application and data infrastructure needs
- Experience with at least one cloud platform (AWS, Google Cloud Platform, or Azure)
- Hands-on experience with Kubernetes and containerization
- Strong knowledge of Infrastructure-as-Code tools (Terraform preferred)
- Experience with CI/CD tools (GitHub Actions, Jenkins, GitLab CI, etc.)
- Familiarity with monitoring and logging tools
- Scripting experience (Python, Bash, or similar)
- Understanding of networking and security fundamentals
- Experience working in multi-cloud environments
- Exposure to FinOps or cloud cost optimization practices
- Knowledge of data platforms or data pipeline infrastructure
- Familiarity with GitOps practices and tools like Argo CD
- Cloud: AWS, Google Cloud Platform, Azure
- Containers: Kubernetes, Docker, Helm
- IaC: Terraform, CloudFormation
- CI/CD: GitHub Actions, Jenkins, GitLab CI
- Monitoring: Grafana, Prometheus, CloudWatch, Splunk
- Languages: Python, Bash