Demo

Production Support Engineer (DevOps / Streaming Platform)

Jobs via Dice
Atlanta, GA Full Time
POSTED ON 6/15/2026
AVAILABLE BEFORE 7/12/2026
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Kani Solutions, is seeking the following. Apply via Dice today!

Job Title: Production Support Engineer (DevOps / Streaming Platform)

Location: Atlanta, GA (Hybrid)

Hire Type: Contract (6 months )

Summary

Client is building a dedicated Operational Support (L2) team responsible for the stability, availability, and operational excellence of their 24/7 live video streaming, ads, player, and real time delivery platforms.

As an Operational Support Engineer (L2), you take end to end ownership of customer impacting production incidents once they are triaged by Level 1 support. You operate directly on production systems, lead live incident resolution, and act as the operational bridge between Support, Engineering, DevOps, and customers, particularly during high impact live events.

This is a hands on, customer facing role focused on incident ownership, production operations, automation, and operational scalability, not just reactive troubleshooting.

Key Responsibilities:

Incident & Operational Support

Take ownership of escalated customer issues from Level 1 Support and drive them to resolution

Troubleshoot and resolve complex, high-impact production incidents affecting live streams, VOD playback, ad insertion, DRM, and real-time WebRTC services

Operate directly on production environments, including configuration changes, CDN adjustments, and corrective actions, following established operational procedures, including executing mitigations and emergency changes during live incidents when customer impact requires immediate action

Lead or actively contribute to live incident bridges involving customers, internal teams, and partners

Provide clear, timely communication during incidents, including status updates and customer-facing explanations

Infrastructure as Code & Production Operations

Work fluently with Infrastructure as Code (IaC) to understand, troubleshoot, and safely modify production environments

Leverage tools and frameworks such as:

  • Terraform
  • Helm
  • Kubernetes manifests
  • GitOps workflows
  • CI/CD and deployment pipelines Use IaC as the primary mechanism for safe, auditable, and repeatable operational changes Collaborate with Engineering and DevOps to improve deployment reliability and operational safety Validate and execute infrastructure or configuration changes through codified workflows

AI-Driven Operations & Automation

Leverage AI tools and automation to enhance operational efficiency and incident response

Contribute to and use:

  • AI-assisted incident triage and classification
  • Automated runbook execution
  • AI-based pattern detection across incidents
  • Intelligent alert correlation and noise reduction

Use AI to:

  • Generate or improve incident communications
  • Accelerate troubleshooting workflows
  • Identify recurring patterns and systemic issues

Drive adoption of automation-first and AI-augmented operational practices

Pre-Event Planning & Operational Readiness

Participate in pre-event readiness planning for critical customer events

Validate system readiness through:

  • Runbook checks
  • Monitoring coverage validation
  • Risk identification and mitigation planning

Define and rehearse incident response strategies for high-risk scenarios

Collaborate with customers and internal teams to ensure smooth event execution

On-Call & 24/7 Operations

Participate in a 24/7 on-call rotation, including nights, weekends, and holidays, as part of a global support model

Ensure smooth handovers between shifts and regions

Respond to critical alerts within defined SLAs for stream health, player errors, and delivery infrastructure

Root Cause & Continuous Improvement

Perform or contribute to root cause analysis (RCA) for production incidents

Document findings, corrective actions, and preventive measures

Identify recurring issues and work with Engineering and Product teams to eliminate them permanently

Contribute to and improve runbooks, operational playbooks, and knowledge bases for all products (Player, ads, live and real time streaming)

Collaboration & Engineering Feedback Loop

Work closely with Engineering teams to escalate defects, validate fixes, and support production deployments

Provide feedback on system observability, tooling gaps, and operational risks

Act as the operational voice during post-incident reviews

Required Skills & Experience:

5 years of relevant experience in operational, support, or similar customer‑facing roles

Proven ability to own complex problems end‑to‑end and operate with a high degree of autonomy

Strong experience supporting production video streaming platforms, OTT services, live systems

Solid troubleshooting skills across distributed systems (APIs, microservices, cloud infrastructure)

Familiarity with HLS, DASH, CMAF, WebRTC, DRM and CDN architectures

Experience working with monitoring, alerting, and logs to diagnose live incidents (Grafana, Kibana/ELK, Prometheus, Loki)

Correlate backend streaming metrics, player telemetry, and CDN signals to diagnose live customer issues end-to-end.

Comfort performing controlled changes in production environments

Working knowledge of incident management and on-call operations.

Operational Mindset:

Proven ability to remain calm, structured, and decisive during high-pressure incidents

Strong sense of ownership and accountability for customer outcomes

Excellent written and verbal communication skills, including customer-facing communication during incidents.

Salary.com Estimation for Production Support Engineer (DevOps / Streaming Platform) in Atlanta, GA
$84,175 to $101,640
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Production Support Engineer (DevOps / Streaming Platform)?

Sign up to receive alerts about other jobs on the Production Support Engineer (DevOps / Streaming Platform) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$85,140 - $105,525
Income Estimation: 
$107,004 - $128,710
Income Estimation: 
$102,830 - $126,611
Income Estimation: 
$105,325 - $132,008
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Jobs via Dice

  • Jobs via Dice Smithfield, RI
  • job summary: Experience with incident management and issue troubleshooting. Experience in financial services applications is a plus. location: Smithfield, ... more
  • 8 Days Ago

  • Jobs via Dice Providence, RI
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Talent Groups, is seeking the following. Apply via Dic... more
  • 8 Days Ago

  • Jobs via Dice Providence, RI
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, PTR Global, is seeking the following. Apply via Dice t... more
  • 8 Days Ago

  • Jobs via Dice Woonsocket, RI
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Photon, is seeking the following. Apply via Dice today... more
  • 8 Days Ago


Not the job you're looking for? Here are some other Production Support Engineer (DevOps / Streaming Platform) jobs in the Atlanta, GA area that may be a better fit.

  • Synapse Virtual Production Atlanta, GA
  • Systems Engineer, Atlanta Studio Synapse Virtual Production Temporary The Systems Engineer owns the health, reliability, maintenance, and evolution of the ... more
  • 8 Days Ago

  • NLB Services Atlanta, GA
  • • 8 years of experience in L2/L3 production support on scope of applications from Customer Solutions, including Intelligent Reverse Logistics, Intelligent ... more
  • 2 Days Ago

AI Assistant is available now!

Feel free to start your new journey!