What are the responsibilities and job description for the Site Reliability Engineer/DevOps Engineer (Required Local Candidates with prefer Public Sector exp) position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, TexcelVision Inc., is seeking the following. Apply via Dice today!
Respond by: 04/07/2026
Rate: DOE
Type: Contract
Work Mode: Hybrid
Location: Austin TX
Please respond with resume and 3 references preferably supervisor (name, title, company, email, phone number)
Background Check will be performed if a candidate is selected for placement and will have to be passed
Job Description
Site Reliability Engineer will be responsible for ensuring the reliability, availability, performance, and scalability of production systems by applying software engineering practices to infrastructure and operations. Partners with development teams to build resilient, observable, and automated platforms that meet defined service level objectives (SLOs).
Ii. Candidate Skills And Qualifications
Minimum Requirements:
Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity. Years Required/Preferred Experience 8 Required experience in systems engineering, DevOps, or site reliability engineering roles 8 Required Strong experience with Linux/Unix systems and system internals 8 Required Proficiency in one or more programming/scripting languages (Python, Go, Java, Bash) 8 Required Experience designing and operating highly available, distributed systems 8 Required Strong knowledge of cloud platforms (AWS, or Google Cloud Platform) and cloud-native services 8 Required Experience with containerization and orchestration (Docker, Kubernetes) 8 Required Strong understanding of monitoring, alerting, and logging concepts 8 Required Experience defining and managing SLIs, SLOs, and error budgets 8 Required Familiarity with incident management, root cause analysis (RCA), and postmortems 8 Required Experience integrating security and compliance into operational workflows 4 Preferred Familiarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk) 4 Preferred Experience operating 24x7 production environments with on-call rotations 4 Preferred Experience with chaos engineering and resiliency testing 4 Preferred Experience with feature flags, canary deployments, and progressive delivery 4 Preferred Strong documentation skills for runbooks, dashboards, and operational standards
III. TERMS OF SERVICE
Services are expected to start 05/01/2026 and are expected to complete by 08/31/2026. Total estimated hours per Candidate shall not exceed 780 hours. This service may be amended, renewed, and/or extended providing both parties agree to do so in writing.
IV. WORK HOURS AND LOCATION
Respond by: 04/07/2026
Rate: DOE
Type: Contract
Work Mode: Hybrid
Location: Austin TX
Please respond with resume and 3 references preferably supervisor (name, title, company, email, phone number)
Background Check will be performed if a candidate is selected for placement and will have to be passed
Job Description
Site Reliability Engineer will be responsible for ensuring the reliability, availability, performance, and scalability of production systems by applying software engineering practices to infrastructure and operations. Partners with development teams to build resilient, observable, and automated platforms that meet defined service level objectives (SLOs).
Ii. Candidate Skills And Qualifications
Minimum Requirements:
Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity. Years Required/Preferred Experience 8 Required experience in systems engineering, DevOps, or site reliability engineering roles 8 Required Strong experience with Linux/Unix systems and system internals 8 Required Proficiency in one or more programming/scripting languages (Python, Go, Java, Bash) 8 Required Experience designing and operating highly available, distributed systems 8 Required Strong knowledge of cloud platforms (AWS, or Google Cloud Platform) and cloud-native services 8 Required Experience with containerization and orchestration (Docker, Kubernetes) 8 Required Strong understanding of monitoring, alerting, and logging concepts 8 Required Experience defining and managing SLIs, SLOs, and error budgets 8 Required Familiarity with incident management, root cause analysis (RCA), and postmortems 8 Required Experience integrating security and compliance into operational workflows 4 Preferred Familiarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk) 4 Preferred Experience operating 24x7 production environments with on-call rotations 4 Preferred Experience with chaos engineering and resiliency testing 4 Preferred Experience with feature flags, canary deployments, and progressive delivery 4 Preferred Strong documentation skills for runbooks, dashboards, and operational standards
III. TERMS OF SERVICE
Services are expected to start 05/01/2026 and are expected to complete by 08/31/2026. Total estimated hours per Candidate shall not exceed 780 hours. This service may be amended, renewed, and/or extended providing both parties agree to do so in writing.
IV. WORK HOURS AND LOCATION