What are the responsibilities and job description for the Staff Software Engineer, Production Engineering position at Uber?
About The Role
Engineering at Uber means building for real-world impact under real-world constraints. The problems are complex, the systems are massive, and the pace is fast. You'll need to make smart decisions with imperfect information, and own them. If you think in systems, thrive in ambiguous, high-stakes environments, and care about building things that actually work, this is where you'll grow.
Core Services Production Engineering is a software engineering team, not a traditional operations or sysadmin team. We solve reliability problems by writing code: chaos testing tooling, traffic engineering and dependency safety frameworks, automation tooling and observability primitives. As a Production Engineer, you will blend multiple domains of software engineering to ensure Uber's services run reliably at massive scale, while improving compute efficiency and accelerating developer productivity for a platform that serves millions of users around the world.
Our Main Focus Areas
What The Candidate Will Need / Bonus Points
---- What the Candidate Will Do ----
Engineering at Uber means building for real-world impact under real-world constraints. The problems are complex, the systems are massive, and the pace is fast. You'll need to make smart decisions with imperfect information, and own them. If you think in systems, thrive in ambiguous, high-stakes environments, and care about building things that actually work, this is where you'll grow.
Core Services Production Engineering is a software engineering team, not a traditional operations or sysadmin team. We solve reliability problems by writing code: chaos testing tooling, traffic engineering and dependency safety frameworks, automation tooling and observability primitives. As a Production Engineer, you will blend multiple domains of software engineering to ensure Uber's services run reliably at massive scale, while improving compute efficiency and accelerating developer productivity for a platform that serves millions of users around the world.
Our Main Focus Areas
- Chaos testing tooling and applications
- Dependency Safety frameworks and tooling
- Observability and alerting platforms
- Graceful degradation
What The Candidate Will Need / Bonus Points
---- What the Candidate Will Do ----
- Design, build, and maintain software to increase the reliability, scalability, and efficiency of thousands of stateless and stateful production services spread across multiple datacenter zones and regions.
- Lead initiatives end-to-end within the team, the Production Engineering org, and across engineering at large to increase reliability through automation, setting standards, developer tooling, and reusable frameworks.
- Work with other engineers to deeply understand their services and guide them towards practical and reliable architecture and implementation.
- Apply SRE concepts such as observability, integration/load/chaos testing, on-call, incident management, failovers, and disaster recovery to improve mean time between failures (MTBF), time to detection (TTD), and time to mitigation (TTM) of incidents.
- Participate in on-call rotations, responding to and leading mitigation of production incidents, and driving post-incident improvements.
- Staff Engineer L5B 8 years of experience in Go, Java, Python, or similar language
- Experience in delivering solutions end-to-end from defining problems to generating architecture plans, implementation, testing, and delivery.
- Writes clear technical proposals and RFCs; able to drive engineering alignment across teams through written design docs and verbal discussion.
- Experience in various parts of SRE / reliability engineering / incident management at a large-scale company.
- Experience in platform/infrastructure engineering in related disciplines such as compute platform, software networking, online storage, developer platform, and observability.
- Experience mentoring and leading teams on projects while remaining hands-on and technical.
Salary : $232,000 - $258,000