Demo

Platform Engineer - Reliability

Squarepoint
Houston, TX Full Time
POSTED ON 5/22/2026
AVAILABLE BEFORE 6/19/2026

Role Overview

As a Platform Reliability Specialist at Squarepoint, you will play a critical role in ensuring the stability, performance, and day to day reliability of the shared platform services. You will work with a diverse group of stakeholders, including developers, researchers, and infrastructure teams, to maintain highly reliable systems and drive proactive improvements.

You will be responsible for reducing operational toil, improving response and learning from production issues, and evolving our reliability practices. This role blends software engineering, platform ownership, operational ownership, and long‑term architectural thinking to enhance our production systems. While you may have deep expertise in one or more areas, you will contribute across the platform.


Key areas include:

  • Operations & Toil Reduction: Own and improve day‑to‑day platform operations by streamlining workflows and enhancing on‑call ergonomics through better automations and runbooks
  • Reliability Engineering & Hardening: Work with service owners to apply engineering principles to improve resilience and performance: harden critical services against degradation and outages.
  • Tooling & Automation: Build and maintain platform tools, automation, and GitOps workflows that make it easy for teams to deploy, operate, and observe their services with minimal friction and operational overhead.
  • Knowledge & Standards: Capture and share reliability knowledge through documentation, runbooks, and post‑incident reviews. Help define and evolve reliability standards and best practices across the platform.


Required qualifications

  • 4 years in SRE, Production Engineering, or Reliability Engineering roles with direct ownership of production systems.
  • Experience with system administration and troubleshooting (Linux, Bash, containers).
  • Software development experience with Python, version control (Git), and CI/CD systems.
  • Hands‑on experience with observability systems including metrics, tracing, log pipelines, and alert design.
  • Demonstrated experience running systems at scale, including performance tuning, HA/DR architectures, and resilience engineering.


Nice to have

  • Expertise in a modern observability stack (e.g., Prometheus, Grafana, ELK, VictoriaMetrics).
  • Experience operating enterprise platform software such as Kubernetes clusters, GitLab at scale, or Slurm environments.
  • Familiarity with messaging systems (Kafka/RabbitMQ), service discovery (Consul), and databases (PostgreSQL, ClickHouse, Redis).
  • Experience authoring runbooks, running failure/chaos experiments, and participating in DR exercises.
  • Infrastructure automation and configuration management experience (e.g., Ansible, Terraform, Puppet).

Salary.com Estimation for Platform Engineer - Reliability in Houston, TX
$106,250 to $120,002
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Platform Engineer - Reliability?

Sign up to receive alerts about other jobs on the Platform Engineer - Reliability career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$104,754 - $125,215
Income Estimation: 
$134,206 - $155,125
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Squarepoint

  • Squarepoint Houston, TX
  • Kubernetes is the backbone that powers Squarepoint’s research and trading workflows. You’ll design and scale Kubernetes‑native orchestration used by hundre... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Platform Engineer - Reliability jobs in the Houston, TX area that may be a better fit.

  • DYNAMIS POWER SOLUTIONS LLC Houston, TX
  • Reliability Engineer Job Description Department: Manufacturing Job Status: Full Time FLSA Status: Salary Exempt Reports To: Quality Manager Location: Houst... more
  • 1 Day Ago

  • TwinRoots Houston, TX
  • The Company TwinRoots is a performance partner for industrial operations dedicated to empowering asset-intensive industries to overcome operational challen... more
  • 1 Day Ago

AI Assistant is available now!

Feel free to start your new journey!