4 sr site reliability engineer sre Jobs in mc lean, va

SORT BY

SET JOB ALERT

OFF

Details...

Site Reliability Engineer V (McLean, VA or Sunnyvale, CA)

ID.me

Mc Lean, VA | Full Time

$168k-197k (estimate)

3 Months Ago

Site Reliability Engineer (SRE)

STEAMPUNK

Mc Lean, VA | Other

$138k-152k (estimate)

1 Month Ago

Site Reliability Engineer

INADEV

Mc Lean, VA | Full Time

$130k-146k (estimate)

2 Weeks Ago

Senior Network Reliability Engineer

Intelsat US LLC

Mc Lean, VA | Full Time

$114k-135k (estimate)

2 Months Ago

Site Reliability Engineer V (McLean, VA or Sunnyvale, CA)

ID.me Mc Lean, VA

$168k-197k (estimate)

Full Time | IT Outsourcing & Consulting 3 Months Ago

Save

ID.me is Hiring a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) Near Mc Lean, VA

The Site Reliability Engineer V (SRE) will combine software and systems engineering to build and run distributed, fault-tolerant systems at scale. SRE's ensure our services have the appropriate reliability and uptime to protect and promote our customers’ experience.

Note that candidates must be located in the Washington DC or San Francisco Bay area as this role requires an onsite presence.

Responsibilities

Design, build, implement, and maintain platform tooling that improves reliability across the entire product surface area, to improve the availability, scalability, latency, and efficiency of ID.me services
Manage end-to-end distributed systems availability and ensure high-performance of ID.me applications
Build automation solutions to prevent problem recurrence
Build visibility into SLIs, SLOs, SLAs, and dependency metrics to manage operational burden and systems reporting
Design, build, implement, and maintain observability ecosystem to provide visibility across the ID.me platform services and applications
Proactively identify risks and develop engineering processes and/or tooling to reduce availability risk
Evangelize best practices and mentor service owners on reliability, resiliency, and scalability for new and existing services and/or features
Participate in an on-call rotation and hold retroactive root cause analysis meetings, focusing on identifying remediations and product resiliency opportunities

Minimum Qualifications

At least 7 years of experience working in medium or large scale production systems
The ability to take a systematic approach to analyzing, troubleshooting, and diagnosing system problems to identify, locate, resolve, and repair problems
Experience in software development or systems engineering with code
Experience designing for scale and automation-forward ecosystems and solutions
Possess a breadth of engineering skills with an interest in service reliability, automation, monitoring, and capacity planning
Understanding of modern application architecture (e.g. microservices, EDA)
Experience with APM services and solutions (e.g. Open Telemetry, Honeycomb, New Relic, Dynatrace, AppDynamics, Datadog)
Experience with time-series observability solutions (e.g. InfluxDB, Prometheus, Grafana)
Experience with scaled indexed logging solutions (e.g. Splunk, ElasticSearch, OpenSearch)
Experience running and operating Ruby on Rails applications and infrastructure
Deep knowledge with major cloud services providers and solutions (Amazon Web Services, Google Cloud Platform, Microsoft Azure)
Previous experience working within site reliability engineering culture (e.g. improving reliability through systems engineering automation, chaos testing, synthetics, and process improvement)
Experience designing, building, implementing, and operating distributed systems and cloud infrastructure at scale
Experience with container computing and container orchestration (e.g. proprietary systems such as Google Kubernetes Engine (GKE), multi-cloud solutions such as Kubernetes, or Nomad)
Experience with configuration management systems (e.g. Ansible, Puppet, Chef, Saltstack, Consul)
Experience with virtual networking (e.g. cloud networking, service mesh, SDN)
Experience in security automation (e.g. cloud proprietary solutions such as Google Secret Manager or Vault)
Experience with infrastructure-as-code (e.g. Terraform)
Strong written communication skills
Ability to work in an asynchronous environment
Experience in supporting a 24/7 operational infrastructure including on-call rotations

Preferred Qualifications

Must have an obsession for building quality products
Ability to thrive when there are changing priorities and shifting of gears
Strong oral and written communication skills
Must be a team player with a strong, self-managing work ethic
Must be a self-starter with a passion for platform engineering, learning and continuous improvement

Day to Day Life

Ensure observability tooling and integrations are providing telemetry and logging statistics across the entirety of ID.me systems and applications
Enable the Engineering organization the ability to identify and triage operational issues, empowering teams to own and operate autonomously
Contribute to defining and executing on the Observability Roadmap in maintaining and modernizing cloud-native observability within the organization
Integrate telemetry and logging frameworks to the cloud platform
Evaluate new and existing observability technologies to ensure capabilities are inclusive of black box solutions (e.g. COTS) as well as Engineering-created software
Manage distributed system and application scaling activity directly (as applicable) as well as in an advisory capacity on behalf of Engineering development teams

Vision: To be the world's leading digital identity network empowering people to control their own information and to prove their credentials across all channels: online, call center, and in-person.

Mission: To make the world a more trusted place by delivering the highest level of security with the least amount of friction at the lowest possible cost.

People: We have an audacious mission. We aim to fix the identity layer of the internet. Billions of people will live better lives with more trust and convenience thanks to ID.me. We are like Special Forces. We take on the most difficult challenges with amazing teammates.

At ID.me, we believe that an in-office culture fosters professional growth and development, mentorship, collaboration, and accelerated innovation. This position will be in-office based at one of our locations in either McLean, VA or Sunnyvale, CA. Working in an office together allows our culture to thrive and our team members to establish real connections with their coworkers and the opportunity for lifelong friendships. Our work is critical to protecting online identity and we’re confident that working together is how we’ll change the world.

Job Summary

JOB TYPE

Full Time

INDUSTRY

IT Outsourcing & Consulting

SALARY

$168k-197k (estimate)

POST DATE

02/24/2024

EXPIRATION DATE

07/09/2024

WEBSITE

id.me

HEADQUARTERS

MC LEAN, VA

SIZE

100 - 200

FOUNDED

2011

CEO

BLAKE MICHAEL HALL

REVENUE

$10M - $50M

INDUSTRY

IT Outsourcing & Consulting

Related Companies

Staff Product Manager - Loyalty

ID.me

Full Time

$123k-152k (estimate)

1 Week Ago

Account Executive, Communities

ID.me

Full Time

$92k-125k (estimate)

1 Week Ago

Senior IT Engineer - IAM

ID.me

Full Time

$101k-122k (estimate)

1 Week Ago

The job skills required for Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) include Root Cause Analysis, Continuous Improvement, Systems Engineering, Written Communication, Team Development, Networking, etc. Having related job skills and expertise will give you an advantage when applying to be a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). Select any job title you are interested in and start to search job requirements.

Root Cause Analysis View Jobs

Continuous Improvement View Jobs

Systems Engineering View Jobs

Written Communication View Jobs

Team Development View Jobs

Networking View Jobs

For the skill of Root Cause Analysis

Quality Assurance Manager

Actalent

Full Time

$126k-159k (estimate)

3 Weeks Ago

For the skill of Continuous Improvement

Associate Engineer

kennedyjc

Full Time

$93k-113k (estimate)

1 Month Ago

For the skill of Systems Engineering

Systems Engineer, Senior (DDG-1000)

Basic Commerce & Industries Inc

Full Time

$119k-144k (estimate)

2 Weeks Ago

The following is the career advancement route for Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) positions, which can be used as a reference in future career path planning. As a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA), it can be promoted into senior positions as a Corrosion Engineer II that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). You can explore the career advancement for a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) below and select your interested title to get hiring information.

Site Reliability Engineer V (McLean, VA or Sunnyvale, CA)

Corrosion Engineer II

Reliability Engineer III

Corrosion Engineer III

Reliability Engineer IV

If you are interested in becoming a Site Reliability Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Site Reliability Engineer for your reference.

Step 1: Understand the job description and responsibilities of an Accountant.

Quotes from people on Site Reliability Engineer job description and responsibilities

Similarly to the point above, a site reliability engineer can expect to spend time fixing support escalation cases.

03/16/2022: Little Rock, AR

More times than not, site reliability engineers will need to take on-call responsibilities.

01/31/2022: Lexington, KY

Focuses on the reliability of behind-the-scenes systems that help make other teams' jobs more efficient.

02/24/2022: Tuscaloosa, AL

Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation.

02/25/2022: Manchester, NH

Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.

Career tips from people on Site Reliability Engineer jobs

The objective was to ensure service reliability and availability within operations management.

12/28/2021: Lima, OH

Step 3: View the best colleges and universities for Site Reliability Engineer.

Butler University

Carroll College

Cooper Union

High Point University

Princeton University

Providence College