What are the responsibilities and job description for the Senior Site Reliability Engineer position at Brooksource?
Senior Site Reliability Engineer
Hybrid (3 days / week) in downtown Detroit
Are you passionate about ensuring the reliability and scalability of complex systems? Do you thrive on implementing efficient solutions to prevent and resolve incidents? We are seeking a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team.
The Work
- Collaborate with cross-functional teams to design, build, and maintain robust, scalable, and fault-tolerant systems
- Work closely with development teams and architects to advocate for reliability best practices during the application development lifecycle
- Design and implement monitoring and alerting to provide real-time visibility into user experience and system health and performance
- Monitor and analyze system performance, proactively identifying potential issues and implementing solutions to ensure optimal performance and reliability
- Develop and maintain automated tools and processes to streamline operational tasks and reduce manual interventions
- Participate in incident response and post-mortems, contributing to continuous improvement efforts
- Conduct capacity planning and resource optimization to handle growing demands on our infrastructure
- Continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems
- Conduct capacity planning and resource optimization to handle growing demands on our infrastructure
- Continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems
The Skills You Bring
- Bachelor's degree in Computer Science, Engineering, or related fields preferred (or equivalent practical experience)
- Strong verbal and written communication skills
- Experience of overall 2-4 years of managing an SRE or DevOps team with observability workload.
- 2-4 years of Agile Management owning SRE roadmaps and deliverables using Scrum / Kanban
- 2-4 years of delivering projects alongside a constant flow of side intake and production response workloads
- Experience presenting to leadership and collaborate effectively/communicate technical concepts to non-technical business stakeholders
- Proven 5 years' experience as a Site Reliability Engineer or similar role in a production environment
- Applied AWS/Cloud Certification (AWS Cloud Architect, DevOps/SysOps) including experience with ASG, Fargate, Lambda, Aurora DB, Dynamo DB, ALB/NLB
- 5 years' working experience with CI/CD pipelines (Gitlab) and developing infrastructure-as-code (Terraform, Python, Ansible, etc.)
- Applied experience with Linux and Windows platforms, Java EE, JavaScript, Spring, Spring Boot, REST API/Micro Services, Shell Scripting, Python, PL/SQL, and databases, specifically Oracle
- Working knowledge of observability platforms like Splunk, Dynatrace
- Working experience with designing Observability for enterprise applications
- Experienced knowledge of system administration, DevSecOps
- Development experience along with cloud and physical servers
- Understanding and experience working with business, product and engineering teams in developing SLI, SLO and SLA's
- Conduct capacity planning and resource optimization to handle growing demands on our infrastructure
Other Skills & Experience Desired
- Strong knowledge of Linux/Unix systems and network protocols
- Familiarity with cybersecurity best practices and principles
- Ability to lead triage calls including working across multiple divisions to resolve issues.
EEO Statement:
Eight Eleven Group, LLC is an equal opportunity employer that does not discriminate on the basis of actual or perceived race, color, creed, religion, national origin, ancestry, citizenship status, age, sex or gender (including pregnancy, childbirth, lactation and related medical conditions), gender identity or gender expression, sexual orientation, marital status, military service and veteran status, physical or mental disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances.
Benefits & Perks
Benefits & Perks: Eight Eleven Group, LLC offers competitive medical, dental, vision, Health Savings Account, Dependent Care FSA, and supplemental coverage with plans that can fit each employee’s needs. We offer a 401k plan that includes a company match and is fully vested after you become eligible, paid time off, sick time, and paid company holidays. We also offer an Employee Assistance Program (EAP) that provides services like virtual counseling, financial services, legal services, life coaching, etc.
Pay Disclaimer:
The pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Salary : $75 - $85