What are the responsibilities and job description for the Site Reliability Engineer position at PriceSenz?
Site Reliability Engineer (SRE) – Observability Specialist
Location: Charlotte, NC, 28277 (Remote)
Duration: Contract
Work Type: W2 Only
Remote: Yes (Onshore only)
Overview
We are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in observability, excellent communication skills, and the ability to influence reliability maturity across multiple engineering teams. This role requires someone who can blend technical depth with strategic thinking to improve reliability, visibility, and operational excellence.
Key Responsibilities
Observability Engineering
- Design, scale, and maintain Prometheus and Grafana monitoring environments.
- Build advanced PromQL queries, dashboards, alerts, and visualization layers.
- Manage and optimize Grafana instances used by multiple engineering teams.
- Utilize Dynatrace for metrics analysis, performance insights, dashboards, and reporting.
- Analyze telemetry to identify “metrics that matter” (MTM) and deliver actionable insights.
Site Reliability Engineering
- Apply and mature SRE practices using a structured SRE Maturity Model.
- Define, implement, and monitor Service Level Objectives (SLOs) and error budgets.
- Partner with engineering, product, operations, and leadership teams to improve service reliability.
- Reduce operational toil through automation and process improvements.
- Support incident reviews, root cause analysis, and continuous improvement initiatives.
Required Skills & Experience
- Strong understanding of SRE principles, reliability maturity models, and operational best practices.
- Proven experience improving application reliability through data-driven decisions.
- Hands-on expertise in Prometheus, Grafana, and PromQL.
- Strong knowledge of Dynatrace, metrics analysis, observability tools, and monitoring strategies.
- Excellent communication skills, able to explain complex technical topics clearly.
- Strong problem-solving skills and proactive approach to reliability challenges.
Nice to Have
- Experience with Kubernetes, cloud platforms (AWS/GCP/Azure), or CI/CD pipelines.
- Background in automation and scripting.
- Experience working with large-scale distributed systems or high-availability environments.
PriceSenz is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, or disability.