Recent Searches

You haven't searched anything yet.

4 sr site reliability engineer sre Jobs in mc lean, va

SET JOB ALERT
Details...
ID.me
Mc Lean, VA | Full Time
$168k-197k (estimate)
3 Months Ago
STEAMPUNK
Mc Lean, VA | Other
$138k-152k (estimate)
1 Month Ago
INADEV
Mc Lean, VA | Full Time
$130k-146k (estimate)
2 Weeks Ago
Intelsat US LLC
Mc Lean, VA | Full Time
$114k-135k (estimate)
2 Months Ago
Site Reliability Engineer V (McLean, VA or Sunnyvale, CA)
ID.me Mc Lean, VA
$168k-197k (estimate)
Full Time | IT Outsourcing & Consulting 3 Months Ago
Save

ID.me is Hiring a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) Near Mc Lean, VA

The Site Reliability Engineer V (SRE) will combine software and systems engineering to build and run distributed, fault-tolerant systems at scale. SRE's ensure our services have the appropriate reliability and uptime to protect and promote our customers’ experience.

Note that candidates must be located in the Washington DC or San Francisco Bay area as this role requires an onsite presence.

Responsibilities

  • Design, build, implement, and maintain platform tooling that improves reliability across the entire product surface area, to improve the availability, scalability, latency, and efficiency of ID.me services
  • Manage end-to-end distributed systems availability and ensure high-performance of ID.me applications
  • Build automation solutions to prevent problem recurrence
  • Build visibility into SLIs, SLOs, SLAs, and dependency metrics to manage operational burden and systems reporting
  • Design, build, implement, and maintain observability ecosystem to provide visibility across the ID.me platform services and applications
  • Proactively identify risks and develop engineering processes and/or tooling to reduce availability risk
  • Evangelize best practices and mentor service owners on reliability, resiliency, and scalability for new and existing services and/or features
  • Participate in an on-call rotation and hold retroactive root cause analysis meetings, focusing on identifying remediations and product resiliency opportunities

Minimum Qualifications

  • At least 7 years of experience working in medium or large scale production systems
  • The ability to take a systematic approach to analyzing, troubleshooting, and diagnosing system problems to identify, locate, resolve, and repair problems
  • Experience in software development or systems engineering with code
  • Experience designing for scale and automation-forward ecosystems and solutions
  • Possess a breadth of engineering skills with an interest in service reliability, automation, monitoring, and capacity planning
  • Understanding of modern application architecture (e.g. microservices, EDA)
  • Experience with APM services and solutions (e.g. Open Telemetry, Honeycomb, New Relic, Dynatrace, AppDynamics, Datadog)
  • Experience with time-series observability solutions (e.g. InfluxDB, Prometheus, Grafana)
  • Experience with scaled indexed logging solutions (e.g. Splunk, ElasticSearch, OpenSearch)
  • Experience running and operating Ruby on Rails applications and infrastructure
  • Deep knowledge with major cloud services providers and solutions (Amazon Web Services, Google Cloud Platform, Microsoft Azure)
  • Previous experience working within site reliability engineering culture (e.g. improving reliability through systems engineering automation, chaos testing, synthetics, and process improvement)
  • Experience designing, building, implementing, and operating distributed systems and cloud infrastructure at scale
  • Experience with container computing and container orchestration (e.g. proprietary systems such as Google Kubernetes Engine (GKE), multi-cloud solutions such as Kubernetes, or Nomad)
  • Experience with configuration management systems (e.g. Ansible, Puppet, Chef, Saltstack, Consul)
  • Experience with virtual networking (e.g. cloud networking, service mesh, SDN)
  • Experience in security automation (e.g. cloud proprietary solutions such as Google Secret Manager or Vault)
  • Experience with infrastructure-as-code (e.g. Terraform)
  • Strong written communication skills
  • Ability to work in an asynchronous environment
  • Experience in supporting a 24/7 operational infrastructure including on-call rotations

Preferred Qualifications 

  • Must have an obsession for building quality products 
  • Ability to thrive when there are changing priorities and shifting of gears
  • Strong oral and written communication skills
  • Must be a team player with a strong, self-managing work ethic
  • Must be a self-starter with a passion for platform engineering, learning and continuous improvement

Day to Day Life

  • Ensure observability tooling and integrations are providing telemetry and logging statistics across the entirety of ID.me systems and applications
  • Enable the Engineering organization the ability to identify and triage operational issues, empowering teams to own and operate autonomously
  • Contribute to defining and executing on the Observability Roadmap in maintaining and modernizing cloud-native observability within the organization
  • Integrate telemetry and logging frameworks to the cloud platform
  • Evaluate new and existing observability technologies to ensure capabilities are inclusive of black box solutions (e.g. COTS) as well as Engineering-created software
  • Manage distributed system and application scaling activity directly (as applicable) as well as in an advisory capacity on behalf of Engineering development teams

Vision: To be the world's leading digital identity network empowering people to control their own information and to prove their credentials across all channels: online, call center, and in-person.

Mission: To make the world a more trusted place by delivering the highest level of security with the least amount of friction at the lowest possible cost.

People: We have an audacious mission. We aim to fix the identity layer of the internet. Billions of people will live better lives with more trust and convenience thanks to ID.me. We are like Special Forces. We take on the most difficult challenges with amazing teammates.

At ID.me, we believe that an in-office culture fosters professional growth and development, mentorship, collaboration, and accelerated innovation. This position will be in-office based at one of our locations in either McLean, VA or Sunnyvale, CA. Working in an office together allows our culture to thrive and our team members to establish real connections with their coworkers and the opportunity for lifelong friendships. Our work is critical to protecting online identity and we’re confident that working together is how we’ll change the world. 

Job Summary

JOB TYPE

Full Time

INDUSTRY

IT Outsourcing & Consulting

SALARY

$168k-197k (estimate)

POST DATE

02/24/2024

EXPIRATION DATE

07/09/2024

WEBSITE

id.me

HEADQUARTERS

MC LEAN, VA

SIZE

100 - 200

FOUNDED

2011

CEO

BLAKE MICHAEL HALL

REVENUE

$10M - $50M

INDUSTRY

IT Outsourcing & Consulting

Related Companies
Show more

ID.me
Full Time
$123k-152k (estimate)
1 Week Ago
ID.me
Full Time
$92k-125k (estimate)
1 Week Ago
ID.me
Full Time
$101k-122k (estimate)
1 Week Ago

The job skills required for Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) include Root Cause Analysis, Continuous Improvement, Systems Engineering, Written Communication, Team Development, Networking, etc. Having related job skills and expertise will give you an advantage when applying to be a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). Select any job title you are interested in and start to search job requirements.

For the skill of  Root Cause Analysis
Actalent
Full Time
$126k-159k (estimate)
3 Weeks Ago
For the skill of  Continuous Improvement
kennedyjc
Full Time
$93k-113k (estimate)
1 Month Ago
For the skill of  Systems Engineering
Basic Commerce & Industries Inc
Full Time
$119k-144k (estimate)
2 Weeks Ago
Show more

The following is the career advancement route for Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) positions, which can be used as a reference in future career path planning. As a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA), it can be promoted into senior positions as a Corrosion Engineer II that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Site Reliability Engineer V (McLean, VA or Sunnyvale, CA). You can explore the career advancement for a Site Reliability Engineer V (McLean, VA or Sunnyvale, CA) below and select your interested title to get hiring information.

If you are interested in becoming a Site Reliability Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Site Reliability Engineer for your reference.

Step 1: Understand the job description and responsibilities of an Accountant.

Quotes from people on Site Reliability Engineer job description and responsibilities

Similarly to the point above, a site reliability engineer can expect to spend time fixing support escalation cases.

03/16/2022: Little Rock, AR

More times than not, site reliability engineers will need to take on-call responsibilities.

01/31/2022: Lexington, KY

Focuses on the reliability of behind-the-scenes systems that help make other teams' jobs more efficient.

02/24/2022: Tuscaloosa, AL

Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation.

02/25/2022: Manchester, NH

Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.

Career tips from people on Site Reliability Engineer jobs

The objective was to ensure service reliability and availability within operations management.

12/28/2021: Lima, OH

Step 3: View the best colleges and universities for Site Reliability Engineer.

Butler University
Carroll College
Cooper Union
High Point University
Princeton University
Providence College