Recent Searches

You haven't searched anything yet.

6 Site Reliability Engineer (SRE) Jobs in Mc Lean, VA

SET JOB ALERT
Details...
STEAMPUNK
Mc Lean, VA | Full Time
$124k-139k (estimate)
1 Week Ago
Easy Dynamics
Mc Lean, VA | Full Time
$92k-109k (estimate)
1 Week Ago
STEAMPUNK
Mc Lean, VA | Other
$138k-152k (estimate)
1 Week Ago
INADEV
Mc Lean, VA | Full Time
$130k-146k (estimate)
Just Posted
Intelsat US LLC
Mc Lean, VA | Full Time
$114k-135k (estimate)
2 Months Ago
Capital One
Mc Lean, VA | Full Time
$124k-149k (estimate)
1 Week Ago
Site Reliability Engineer (SRE)
STEAMPUNK Mc Lean, VA
$138k-152k (estimate)
Other | Business Services 1 Week Ago
Save

STEAMPUNK is Hiring a Remote Site Reliability Engineer (SRE)

Overview

Design. Disrupt. Repeat. Be an agent of change on a team committed to achieving client-focused, mission-driven excellence. Steampunk is looking for an experienced Site Reliability Engineer with an appetite for taking on new challenges. Who We Are Steampunk is the explosive collision of human-centered design and traditional government contracting. An employee-owned company with a startup mindset and time-tested approaches tailored for the federal government, we’re passionate about creating solutions that are impactful, practical, scalable, and most importantly, that meet our clients’ ever-changing needs. At Steampunk, we believe in disrupting the status quo and setting the pace in the ecosystem of government contractors, while repurposing tried-and-true methodologies. We believe in empowering our people to find creative solutions to intractable problems. We believe the best environment in which to grow and thrive is outside our comfort zone. While good design makes for a good product, we believe human-centered design makes for an excellent one. We also believe effective teams are powered by diverse perspectives, backgrounds, and experiences. To that end, Steampunk is an equal opportunity employer committed to promoting diversity of race, gender, sexual orientation, religion, ethnicity, national origin, disability status, and protected veteran status, amongst our ranks. Additionally, we participate in the E-Verify program. Why Steampunk? Our people are the very core of what we do; their expertise and hunger for new and exciting challenges fuel our relentless pursuit of mission success. As part of our team of “Punks,” you’ll test the status quo, explore new boundaries, and set the bar high for how government clients expect to engage with contractors. Because we value our employees’ work/life balance (and believe those who work hard deserve to play hard), we offer a very competitive benefits package, including telework/flex scheduling, health/dental with orthodontics/vision insurance upon hire, paid time off with a sell-back benefit and carryover option, 11 Federal Holidays, 100% paid military leave, 100% 401(k) plan match upon hire, professional development/education reimbursement, all flexible spending accounts, and more

Contributions

As a Steampunk Site Reliability Engineer (SRE), you will be responsible for working with program development teams, infrastructure and platform services teams, and traditional operations and maintenance teams to embrace and embody a shared responsibility for the reliability of an organizations’ applications and infrastructure. As an SRE, your primary responsibility is to combine aspects of software engineering with traditional operations to maintain and improve the reliability, availability, and performance of cloud, infrastructure, and large-scale software systems and services while minimizing downtime and mitigating potential failures.There are a wide variety of responsibilities you will be delivering in this role:

  • Infrastructure Optimization: Conduct in-depth analyses of infrastructure, identifying areas for improvement in terms of performance, scalability, and resource utilization. Collaborate with development and operations teams to implement enhancements, utilizing software engineering and/or infrastructure-as-code principles to streamline deployment processes and ensure consistency across environments.
  • Reliability Metrics and Reporting: Define and implement key reliability metrics, service-level objectives (SLOs), and service-level indicators (SLIs) to measure and report on the health of our systems. Establish monitoring and alerting mechanisms to proactively identify potential issues before they impact users.
  • Automation and Tooling: Design and implement automation tools to reduce manual toil, streamline repetitive tasks, and enhance overall operational efficiency. Leverage software development techniques to create robust, scalable tooling that supports our reliability goals, and collaborate with development teams to integrate reliability features into the development lifecycle.
  • Performance Optimization using Software Development Techniques: Collaborate with software development teams to optimize the performance and resilience of services through code improvements, architectural enhancements, and performance tuning. Integrate automated testing and profiling into the development pipeline to identify and address performance bottlenecks early in the development lifecycle.
  • Capacity Planning and Scaling: Collaborate with infrastructure teams to forecast capacity requirements, ensuring our systems can seamlessly scale to meet growing user demands. Implement strategies for auto-scaling and load balancing to optimize resource utilization and enhance overall system stability.
  • Collaboration and Training: Work closely with development teams to embed reliability best practices into the software development process. Provide mentorship and training to cross-functional teams on SRE principles, encouraging a shared responsibility for the reliability of our services.
  • Incident Management: Lead the development and implementation of incident response procedures, ensuring timely and effective resolution of issues to minimize impact on users. Foster a culture of continuous improvement by conducting thorough post-incident reviews, identifying root causes, and implementing preventative measures.
  • Infrastructure and Systems Monitoring: Observe and monitor systems to make sure you have the insight into system performance, health, availability and what is happening internally in the system. Understand what to monitor based on the system(s) you are managing, where to store the monitoring data, who can access historical monitoring data, and how to look at the data to make determinations about future actions.

Qualifications

Required: Bachelor’s degree and at least 5 years of IT experience and 2 years of SRE experienceEligible to obtain and maintain and government security clearance

Knowledge and experience with Agile and DevSecOps methodologiesExperience in System Engineering in one or more areas including telecommunications concepts, computer languages, operating systems, database/Data Base Management System (DBMS) and middleware

Experience with the following software/tools:

  • Source code and binary repository products and techniques (GitHub, GitLab, BitBucket, Artifactory, Nexus, etc.)
  • Infrastructure and Cloud Management tools such as AWS CloudWatch
  • Log Management and Analysis tools such as Splunk
  • Automation and Configuration Management tools such as Terraform or Puppet

Preferred:Knowledge and experience with NewRelic and/or other AIOps platforms Have programming skills – Javascript, Ruby and/or Go Experience with Nginx, HAProxy, Docker, Kubernetes or similar technologies Experience with messaging systems, collaboration software, application-based firewall and proxy server(s), and operating systems Experience with Linux and Windows operating systems, along with scripting tools and techniques such as Bash, CSH, KSH, ZSH, etc. and/or Powershell. Experience with Monitoring and Alerting tools such as Prometheus, Grafana and Datadog

About steampunk

Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors. Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges. As an employee owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit http://www.steampunk.com.

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program. 

Job Summary

JOB TYPE

Other

INDUSTRY

Business Services

SALARY

$138k-152k (estimate)

POST DATE

04/17/2024

EXPIRATION DATE

04/15/2025

WEBSITE

steampunk.com

HEADQUARTERS

Mclean, VA

SIZE

<25

INDUSTRY

Business Services

Show more

STEAMPUNK
Other
$64k-82k (estimate)
3 Days Ago
STEAMPUNK
Other
$227k-283k (estimate)
5 Days Ago
STEAMPUNK
Remote | Other
$72k-89k (estimate)
1 Week Ago

The job skills required for Site Reliability Engineer (SRE) include Analysis, Futures, Continuous Improvement, Systems Engineering, Insight, Agile, etc. Having related job skills and expertise will give you an advantage when applying to be a Site Reliability Engineer (SRE). That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Site Reliability Engineer (SRE). Select any job title you are interested in and start to search job requirements.

For the skill of  Analysis
U.S. Army Intelligence and Security Command
Full Time
$111k-148k (estimate)
Just Posted
For the skill of  Futures
Sobotranz
Full Time
$45k-59k (estimate)
Just Posted
For the skill of  Continuous Improvement
MFI
Full Time
$62k-83k (estimate)
5 Days Ago
Show more

The following is the career advancement route for Site Reliability Engineer (SRE) positions, which can be used as a reference in future career path planning. As a Site Reliability Engineer (SRE), it can be promoted into senior positions as a Corrosion Engineer II that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Site Reliability Engineer (SRE). You can explore the career advancement for a Site Reliability Engineer (SRE) below and select your interested title to get hiring information.

If you are interested in becoming a Site Reliability Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Site Reliability Engineer for your reference.

Step 1: Understand the job description and responsibilities of an Accountant.

Quotes from people on Site Reliability Engineer job description and responsibilities

Similarly to the point above, a site reliability engineer can expect to spend time fixing support escalation cases.

03/16/2022: Little Rock, AR

More times than not, site reliability engineers will need to take on-call responsibilities.

01/31/2022: Lexington, KY

Focuses on the reliability of behind-the-scenes systems that help make other teams' jobs more efficient.

02/24/2022: Tuscaloosa, AL

Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation.

02/25/2022: Manchester, NH

Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.

Career tips from people on Site Reliability Engineer jobs

The objective was to ensure service reliability and availability within operations management.

12/28/2021: Lima, OH

Step 3: View the best colleges and universities for Site Reliability Engineer.

Butler University
Carroll College
Cooper Union
High Point University
Princeton University
Providence College
Show more