Demo

Lead Observability Engineer

Skyline Technology Solutions, LLC
Glen Burnie, MD Full Time
POSTED ON 12/16/2025
AVAILABLE BEFORE 2/16/2026

Our New Teammate

The Lead Observability Engineer serves as the organization’s technical authority for monitoring, telemetry, and reliability insights across all platforms and services. This role owns the architecture, implementation, and operation of the observability ecosystem—including metrics, logging, tracing, dashboards, alerting, and service-level indicators—ensuring that engineering teams have the visibility required to deliver resilient, high-performing systems.

The position combines deep platform engineering expertise with the strategic responsibilities of defining telemetry standards, guiding reliability practices, and driving the adoption of modern observability methodologies. The Lead Observability Engineer partners closely with application, platform, and security teams to establish scalable instrumentation frameworks, operationalize SLOs, and ensure data quality and consistency across environments.

This role requires technical leadership, strong architectural judgment, and the ability to translate complex system behavior into actionable insights that elevate operational excellence across the organization.

You can expect to spend your time accomplishing the following:

  • 50% of the time on Objective 1: Observability Platform Ownership
  • 25% of the time on Objective 2: Standards, Instrumentation, and Reliability Practices
  • 25% of the time on Objective 3: Cross-Functional Technical Leadership

 

Job Responsibilities – What to Expect

  • Architect, implement, and operate the full observability stack, including metrics, logging, tracing, dashboards, alerting, and telemetry pipelines.
  • Maintain and optimize Grafana, Loki, Tempo, exporters, agents, and related services to ensure reliability, performance, and scalability.
  • Ensure high-quality, consistent telemetry across all environments.
  • Define organizational standards for instrumentation, dashboards, alerts, SLIs, and SLOs.
  • Partner with engineering teams to guide adoption of reliability and observability best practices.
  • Improve signal-to-noise ratio in alerts and evolve incident visibility and analysis frameworks.
  • Collaborate with Platform, Application, Security, and Network Engineering teams to ensure observability is embedded into architecture and operational workflows.
  • Provide expert guidance on system behavior, failure modes, performance patterns, and telemetry-driven insights.

 

Your Knowledge & Expertise

  • Bachelor’s degree in Computer Science, Networking, Telecommunications, or related technical field 
  • Professional certifications Preferred: CISSP, CISM, PMP, ITIL, AWS/Azure
  • 8 years of experience in systems engineering, SRE, platform engineering, or infrastructure operations roles in large-scale, high-availability environments 
  • Observability engineering: metrics, logs, traces, dashboards, alerting, SLOs/SLIs, Linux systems engineering, OS tuning, benchmarking, and troubleshooting at scale 
  • Experience with log aggregation and search systems (Splunk, ElasticSearch), message brokers (RabbitMQ, Kafka), and system monitoring tools (Zabbix, Grafana) 
  • Proven hands-on experience operating Linux systems (RHEL, Ubuntu, CentOS) at scale, including performance tuning, benchmarking, hardening, and troubleshooting 
  • Demonstrated experience with observability tooling such as Splunk, ElasticSearch, Graphite, Zabbix, log pipelines, and metrics systems 
  • Proficiency with Kubernetes, Docker, CI/CD, and infrastructure automation frameworks such as Ansible, Chef, or Salt 
  • Background in security operations or tooling such as MS Defender, Nessus, Carbon Black, CrowdStrike, IAM, or FIM solutions
  • Experience designing or supporting disaster recovery, high-availability, and SLA-driven systems for mission-critical services 
  • Direct experience with distributed systems, Kafka-based architectures, or microservices environments
  • Strong familiarity with compliance frameworks (SOC2, PCI, HITRUST, FedRAMP, CONMON, C5, GDPR) and implementing technical controls in production environments 
  • Demonstrated ability to collaborate across cross-functional engineering, security, and compliance teams and lead technical initiatives without direct authority
  • Experience supporting or designing multi-datacenter infrastructure or hybrid cloud environments
  • Prior leadership experience in SRE, platform engineering, or cloud operations teams within enterprise-scale organizations 

 

Benefits Included:

  • Medical Insurance
  • Vision Insurance
  • Dental Insurance
  • FSA Plan
  • Paid Time Off
  • 401K Retirement Savings Plan
  • Training & Tuition Assistance
  • Disability & Life Insurance

Salary : $160,000 - $175,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Lead Observability Engineer?

Sign up to receive alerts about other jobs on the Lead Observability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$70,029 - $91,788
Income Estimation: 
$88,426 - $116,821
Income Estimation: 
$95,555 - $145,324
Income Estimation: 
$92,664 - $115,984
Income Estimation: 
$114,731 - $155,985
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other Lead Observability Engineer jobs in the Glen Burnie, MD area that may be a better fit.

  • PGTEK George, MD
  • Observability Engineer (OpsRamp) - Secret clearance You will be part of a larger technical team working as an Observability Engineer in an OpsRamp environm... more
  • 5 Days Ago

  • Transwestern and Careers Baltimore, MD
  • Four dynamic, integrated companies make up the Transwestern enterprise, giving us the perspective to think broadly, deeply and creatively about commercial ... more
  • 17 Days Ago

AI Assistant is available now!

Feel free to start your new journey!