What are the responsibilities and job description for the Application Production Support Analyst-Hybrid Role position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Empower Professionals, is seeking the following. Apply via Dice today!
Role: Application Production Support Analyst / Incident Lead
Location: Bellevue, WA / Atlanta, GA / Kansan City, KS / Frisco, TX
Duration: 12 Months
Responsibilities
Perform hands-on troubleshooting for microservices-based applications
Analyze logs using Splunk, identify patterns, and isolate root causes
Monitor application health via Grafana dashboards and s
Support and debug Unix-based batch jobs, failures, and recoveries
Query and analyze Cassandra DB for data validation and issue diagnosis
Troubleshoot services deployed on AWS and Kubernetes (K8s)
Post-Incident & Problem Management
In compliance with the salary transparency law, the expected pay range for this role is $40-50. Actual compensation depends on experience and interview evaluation.
Thanks
Piyush Verma Lead Technical Recruiter | Empower Professionals
|Official Phone: x 350
Role: Application Production Support Analyst / Incident Lead
Location: Bellevue, WA / Atlanta, GA / Kansan City, KS / Frisco, TX
Duration: 12 Months
Responsibilities
- Incident Management & Response
- Skills : devops, production support, incident triage, grafana, incident management, unix, splunk
- Lead and manage Major Incident (P1/P2) bridges, ensuring fast triage and restoration
- Act as the Single Point of Contact (SPOC) during major incidents
- Ensure incidents are resolved within SLA timelines with clear communication throughout the lifecycle
- Coordinate with engineering, infrastructure, DevOps, and database teams during incidents
Perform hands-on troubleshooting for microservices-based applications
Analyze logs using Splunk, identify patterns, and isolate root causes
Monitor application health via Grafana dashboards and s
Support and debug Unix-based batch jobs, failures, and recoveries
Query and analyze Cassandra DB for data validation and issue diagnosis
Troubleshoot services deployed on AWS and Kubernetes (K8s)
Post-Incident & Problem Management
- Lead Root Cause Analysis (RCA) and post-incident reviews
- Track and ensure completion of corrective and preventive actions
- Identify recurring issues and partner with teams to eliminate systemic problems
- Contribute to automation and monitoring improvements to reduce MTTR
- Help refine incident processes, playbooks, and escalation models
- Support continuous improvements in observability and resilience
- 6 10 years of experience in Application Production Support or Incident Management
- Strong understanding of microservices architecture and distributed systems
- Splunk (advanced log analysis and querying)
- Grafana and monitoring tools
- Cassandra DB (strong querying and functional knowledge)
- Unix/Linux (batch jobs, shell scripting, troubleshooting)
- AWS (EC2, CloudWatch, core services)
- Kubernetes (K8s) and containerized environments
- Strong experience handling Major Incidents and production bridges
- Ability to work in 24x7 rotational shifts, including weekends
In compliance with the salary transparency law, the expected pay range for this role is $40-50. Actual compensation depends on experience and interview evaluation.
Thanks
Piyush Verma Lead Technical Recruiter | Empower Professionals
|Official Phone: x 350
- Fax: | 100 Franklin Square Drive Suite 104 | Somerset, NJ 08873
Salary : $40 - $50