What are the responsibilities and job description for the Site Reliability Engineer (SRE) with a performance engineering position at Proventus Metrics?
Site Reliability Engineer (SRE) with Performance Engineering
Englewood Cliffs, NJ
Introduction:
As a Site Reliability Engineer (SRE) with a focus on performance engineering, you will play a crucial role in ensuring the high availability and reliability of services through SRE principles and performance optimization. You will work closely with teams to enhance observability, establish performance KPIs, and conduct various types of performance testing to identify and resolve bottlenecks.
Responsibilities:
- Ensure high availability and resilience of services using SRE principles
- Identify and fix systemic reliability issues through architectural improvements
- Develop and enhance observability dashboards using tools like AppDynamics, Grafana, ELK, Splunk, etc.
- Establish Performance KPIs, monitor trends, and set alerting mechanisms for proactive detection
- Collaborate with teams to implement distributed tracing and full stack performance visibility
- Improve frontend performance and Core Web Vitals
- Perform end-to-end performance analysis across application, database, infrastructure, and cloud layers
- Identify and resolve bottlenecks using APM & profiling tools
- Optimize JVM, threads, queries, caching, and microservices performance
- Conduct load, stress, soak, and scalability testing using tools like JMeter, Gatling, K6
Requirements:
Required Skills:
- Strong knowledge of RUM and APM tools such as AppDynamics
- Hands-on experience with Cloud platforms like AWS or Azure
- Understanding of Core Web Vitals, browser performance, and CDN optimization
- Experience with frontend profiling tools like Lighthouse and Chrome Dev Tools
- Performance scripting skills using tools like JMeter, K6, Gatling
- Dashboarding skills with monitoring tools like AppDynamics, Quantum metrics, ELK, Grafana
Preferred Skills:
- Experience with SQL/NoSQL databases and caching strategies
- Knowledge of frontend performance optimization techniques
- Experience with APM and profiling tools such as Dynatrace, Datadog, etc.