What are the responsibilities and job description for the Site Reliability / Platform Engineering Roles position at The Phoenix Group?
Site Reliability / Platform Engineering Roles
Location: New York City (Hybrid or Flexible)
We work with a range of NYC-based organizations across fintech, asset management, SaaS, and data-driven companies that operate large-scale, production-critical systems.
This posting reflects the types of site reliability and platform engineering roles we consistently see across these teams, rather than a single isolated opening. These environments tend to value system stability, automation, and operational discipline, with engineers playing a central role in how platforms scale and perform over time.
What These Roles Typically Involve
Across these teams, site reliability and platform engineers are commonly responsible for:
- Designing and maintaining reliable, scalable infrastructure supporting production systems
- Improving observability, monitoring, and alerting to ensure system health and fast incident response
- Building automation and tooling to reduce operational overhead and manual intervention
- Partnering closely with software engineering teams to improve system reliability and deployment practices
- Participating in incident response, root cause analysis, and long-term reliability improvements
Experience That Tends to Translate Well
Professionals who tend to align well with these roles often bring:
- Experience operating and supporting production systems in cloud-based environments such as AWS, GCP, or Azure
- Strong background in Linux-based systems and networking fundamentals
- Hands-on experience with infrastructure-as-code, CI/CD pipelines, and automation tools
- Comfort working with monitoring, logging, and observability platforms
- A pragmatic approach to reliability that balances engineering rigor with business needs
Backgrounds We Commonly See
Many engineers placed into these types of roles come from:
- SRE, platform, or infrastructure teams within fintech, SaaS, or data-intensive organizations
- Engineering teams responsible for highly available, customer-facing systems
- Environments where reliability, uptime, and performance are treated as first-class concerns
- Teams where platform engineers work closely with senior software and infrastructure leadership
What Differentiates These Environments
Across these organizations, reliability engineering is embedded into how systems are designed and operated, not treated as a reactive support function. Engineers are expected to take ownership of platform health, influence architectural decisions, and help shape how teams scale safely and efficiently.
Compensation
Compensation varies by organization and scope, but typically reflects senior-level responsibility and the critical nature of the systems being supported.
How to Start a Conversation
If this type of site reliability or platform engineering work aligns with what you are doing now, or where you want to take your career next, you are welcome to apply or reach out directly to start a conversation. We are happy to share more context about specific teams and environments during an initial discussion.