You haven't searched anything yet.
The manufacturing quality data engineering team is a high impact, high priority and high visibility team that is laser focus on safety critical issues and expanding critical services to Gigafactories worldwide. Within Tesla's Vehicle Engineering organization, you will have the data gold mines across design, manufacturing and vehicle data sources, enabling you to design, create and deploy innovative new data services, automation and machine learning tools into production use globally with physical impact that you can see. In this role, you will focus on implementing state of the art monitoring system to improve availability, latency and system health for services worldwide.
Develop and monitor service availability, latency and system health with tools such as Prometheus, Grafana, Splunk, Kibana.
Manage on-call schedule of team members with critical alerts configured with OpsGenie.
Curate, evaluate and enforce software reliability best practices.
Develop and maintain Jenkins CI/CD testing stage for build validation.
Specify Service Level Objectives (SLOs) and roadmap to achieve targets, balance feature development speed and reliability with well-defined service-level objectives.
Refine and enforce incident management response procedure.
Manage postmortem and drive corrective actions to closure.
Develop and maintain Kubernetes deployments.
Proposing new ways to improve observability and alerting for the MLOps/DevOps infrastructure and app components.
Review, monitor and resolve system vulnerabilities.
Partner with developers and engineers to improve service availability through rigorous testing and release procedures.
Master's degree or higher in quantitative discipline (e.g. Computer Science, Mathematics, Physics, Electrical Engineering, Statistics, Industrial Engineering) or the equivalent in experience and evidence of exceptional ability
7 years of work experience in Site Reliability Engineering
Strong attention to details and good foresight to envision surrounding risks in the future.
Extensive experience in deploying applications and services to Kubernetes with Docker, Jenkins and Git
Knowledge of various infrastructure and monitoring technologies (e.g. Hashicorp Vault, Helm, Prometheus, Grafana, Splunk, Kibana, Airflow)
Able to work under pressure while collaborating and managing competing demands with tight deadlines
Tesla is an Equal Opportunity / Affirmative Action employer committed to diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, age, national origin, disability, protected veteran status, gender identity or any other factor protected by applicable federal, state or local laws.
Tesla is also committed to working with and providing reasonable accommodations to individuals with disabilities. Please let your recruiter know if you need an accommodation at any point during the interview process.
For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here). Please contact accommodationrequest@tesla.com for additional information or to request accommodations.
Privacy is a top priority for Tesla. We build it into our products and view it as an essential part of our business. To understand more about the data we collect and process as part of your application, please view our Tesla Talent Privacy Notice.
Full Time
Retail
$120k-140k (estimate)
03/25/2023
07/02/2023
tesla.com
BRENTWOOD, TN
>50,000
2003
Private
SOURABH BIYANI
$10B - $50B
Retail