What are the responsibilities and job description for the Systems Reliability Engineer position at AllSTEM Connections?
Position Details:
Job Title: Systems Reliability Engineer
Location: Los Angeles, CA
Duration: 12-month contract with possibility of extension
Required:
- Must haves banking OR mortgage industry experience is preferred.
- Minimum of 5 years of experience in Site Reliability Engineering, IT operations, or related fields.
- Bachelor’s degree in computer science, engineering, or equivalent experience (2 additional years in lieu of degree).
- Technical expertise in system reliability, scalability, application design, and performance.
- Hands-on experience with observability and monitoring tools such as Grafana, AppDynamics, and Sumo Logic.
- Experience with automation platforms, particularly Ansible, for infrastructure and event-driven automation.
- Proven ability to mentor and guide engineers in adopting SRE practices and principles.
- Strong judgment and problem-solving capabilities.
- Experience working in multi-cloud environments.
Preferred:
- Experience applying ITIL, SRE and IT process best practices.
- Experience in tracking major incidents, rollbacks, and hotfixes; leading root cause analysis (RCA) processes; and ensuring resolution and completion of action items.
- Experience with technical engineering in IT operations.
Salary : $50 - $70