Recent Searches

You haven't searched anything yet.

9 Software Engineer (Site Reliability) Operations Lead, Enterprise Systems Jobs in Austin, TX

SET JOB ALERT
Details...
Apple
Apple
Austin, TX | Full Time
$93k-112k (estimate)
Just Posted
Apple
Apple
Austin, TX | Full Time
$94k-113k (estimate)
5 Days Ago
Apple
Apple
Austin, TX | Full Time
$106k-116k (estimate)
6 Days Ago
Apple
Apple
Austin, TX | Full Time
$111k-134k (estimate)
6 Months Ago
Apple
Apple
Austin, TX | Full Time
$111k-134k (estimate)
3 Months Ago
Apple
Apple
Austin, TX | Full Time
$119k-143k (estimate)
2 Months Ago
Apple
Apple
Austin, TX | Full Time
$111k-134k (estimate)
3 Months Ago
Apple
Apple
Austin, TX | Full Time
$115k-139k (estimate)
4 Months Ago
Tesla Motors
Austin, TX | Full Time
$110k-131k (estimate)
2 Months Ago
Software Engineer (Site Reliability) Operations Lead, Enterprise Systems
Apple
Apple Austin, TX
Apply
$106k-116k (estimate)
Full Time 6 Days Ago
Save

Apple is Hiring a Software Engineer (Site Reliability) Operations Lead, Enterprise Systems Near Austin, TX

Summary
Posted: Mar 18, 2024
Role Number: 200526598
Conversational Engineering develops next generation communications, AI, and NLP solutions to support Apple Customers. Our mission is to maintain a comprehensive and effective support, sales & payment experience for customers around the globe. Our conversational engineering platform is growing rapidly to support new channels and regions. We are looking for a hands-on site reliability engineer operations lead who is passionate about designing, developing, and deploying cutting edge operations solutions which will impact millions of customers! We are seeking an experienced and dynamic Site Reliability Engineer (SRE) Operations Lead to lead our efforts in maintaining the reliability, availability, and performance of our systems. The ideal candidate will possess a strong background in production monitoring, a deep understanding of development and operations, and a proven track record in managing large-scale production systems. The SRE Operations Lead will play a crucial role in leading incident management from detection to resolution, ensuring the seamless operation of our systems and infrastructure.
Key Qualifications
  • Proven experience as a Site Reliability Engineer or similar role, with a focus on operations management.
  • Demonstrated experience managing large-scale production outages and leading incident response.
  • Deep understanding of production monitoring systems, log analysis, and performance metrics.
  • Proficient in scripting languages (e.g., Python, Bash) and automation tools.
  • Strong leadership and communication skills with the ability to effectively collaborate with cross-functional teams.
  • Experience mentoring and coaching team members to enhance overall performance.
  • Strong analytical and problem-solving skills with a proactive approach to identifying and addressing potential issues.
  • Ability to thrive in a fast-paced, dynamic environment and adapt to evolving technologies and business needs.
Description
Incident Management: Lead and coordinate incident response activities, ensuring timely detection, escalation, and resolution of production issues. Collaborate with cross-functional teams to mitigate the impact of incidents and prevent recurrence. Production Monitoring: Design, implement, and maintain robust production monitoring systems to proactively identify potential issues before they impact users. Analyze monitoring data to identify trends, patterns, and areas for improvement in system reliability. Operations Leadership: Provide technical leadership to the SRE team, fostering a culture of continuous improvement and innovation. Collaborate with development teams to integrate reliability best practices into the software development lifecycle. Capacity Planning: Work closely with infrastructure and capacity planning teams to ensure scalability and performance of systems. Proactively identify and address potential capacity issues before they impact system performance. Documentation: Maintain comprehensive documentation of system architecture, configurations, and procedures to facilitate efficient incident response and knowledge sharing. Collaboration: Collaborate with cross-functional teams, including development, QA, and product management, to drive improvements in system reliability and performance. Post-Incident Analysis: Conduct thorough post-incident analyses to identify root causes, contributing factors, and implement preventive measures to avoid recurrence.
Education & Experience
Bachelor's degree in Computer Science, Information Technology, or a related field OR equivalent work experieince
Additional Requirements
  • Certifications (Optional):
  • Relevant certifications in SRE, DevOps, or related fields would be a plus.
  • Advanced degree preferred.

Job Summary

JOB TYPE

Full Time

SALARY

$106k-116k (estimate)

POST DATE

04/28/2024

EXPIRATION DATE

05/17/2024

WEBSITE

applegroup.com

HEADQUARTERS

JONESBORO, AR

SIZE

25 - 50

FOUNDED

1993

TYPE

Private

CEO

MARY MARGARET SCHOLTENS

REVENUE

<$5M

INDUSTRY

Retail

Related Companies
About Apple

Apple Group provides embroidery and screen printing.

Show more

Apple
Part Time
$82k-105k (estimate)
Just Posted
Apple
Temporary
$78k-102k (estimate)
Just Posted
Apple
Full Time
$114k-137k (estimate)
Just Posted