What are the responsibilities and job description for the Senior Dev Operations Engineer-SRE (CR260) position at SoftSol?
Job Summary
- The role is for a Senior Dev Operations Engineer SRE (CR260), a position that is remote and long-term.
- The applicant will be a lead on the DevOps team, responsible for system administration areas including the monitoring, installation, configuration, maintenance, operations, and architecture of AWS cloud and on-premise environments.
- The candidate must have experience in setting up alerts/alarms/notifications in AWS Cloud, using AWS services including Kafka, ECS, EKS, and with Infrastructure as Code (IaC) CDK or Terraform.
- The candidate should have 6 years of overall IT experience and 4 years of AWS Cloud management experience. AWS Certified DevOps and/or Solution Architect certification is a must.
- The candidate is expected to have experience in AWS provisioning, operations, and management of AWS environments, setting up/maintaining multi AZ infrastructure including HA and DR in AWS, and experience with code repositories Azure DevOps Server, GIT, GITLab, SVN.
- The candidate should have strong scripting skills, particularly in Python, knowledge of networking, load balancing and firewalls, high-level understanding of networking standard protocols and components, experience in software development, and familiarity with deploying and configuring Java and .Net applications.
- The role involves monitoring sites, environments, and software, measurement, optimization, and tuning of system performance, automating system and application monitoring, anticipating potential problems, conducting post-incident reviews and Root Cause Analysis, documenting work, coding automation, implementing production monitoring systems, and carrying out security assessments.
- The candidate will also be responsible for server maintenance, design, implementation, and support of large scale web farm infrastructure, helping engineering implement new technologies, analyzing and designing infrastructure, triaging and providing technical solutions, supporting developers, and authoring internal documentation.
- The candidate is expected to provide 24x7 production support.