What are the responsibilities and job description for the Sr. Platform Engineer (Hybrid Remote) position at TECHEAD?
This position is 3-4 days per week remote, 1-2 on site.
We are looking for a Senior Platform Engineer with a strong software engineering foundation to help build, scale, and maintain our internal developer platform. In this role, you won't just be managing infrastructure; you will be treating Infrastructure as Code (IaC) and platform tooling as a product.
About the Role
The ideal candidate bridges the gap between development and operations, leveraging deep expertise in Terraform and Kubernetes to create secure, scalable, and self-service infrastructure that empowers our engineering teams to ship code faster and more reliably.
Responsibilities
- Platform as a Product: Design, build, and maintain the underlying infrastructure platform, focusing on developer experience (DX), automation, and self-service capabilities.
- Infrastructure as Code (IaC): Architect and manage multi-cloud infrastructure using reusable, secure, and highly scalable Terraform modules.
- Container Orchestration: Design, scale, and optimize production-grade Kubernetes clusters, ensuring high availability, security, and efficient resource utilization.
- Software & Tooling Development: Write clean, maintainable code (Python, Go, or similar) to build internal CLI tools, APIs, and automation scripts that integrate platform components.
- CI/CD Pipeline Engineering: Build and optimize robust CI/CD pipelines to automate the deployment of both applications and infrastructure.
- Observability & Reliability: Implement and maintain comprehensive monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK) to ensure platform health.
- Mentorship & Collaboration: Partner closely with application development teams to understand their needs, ease their operational burdens, and champion modern DevOps practices.
Qualifications
- Software Engineering Background: 3 years of previous experience as a Software Engineer, with strong proficiency in at least one backend language (e.g., Go, Python, Java, or Node.js). You know how to apply software engineering best practices (DRY, testing, CI/CD, version control) to infrastructure.
- Terraform Mastery: Deep, hands-on experience with Terraform, including writing custom providers, managing complex state files, and designing modular architectures across cloud providers (AWS, GCP, or Azure).
- Kubernetes Expertise: Proven track record of managing Kubernetes in production. Deep understanding of K8s internals, networking, ingress controllers, security policies, and GitOps workflows (e.g., ArgoCD, Flux).
- Cloud Architecture: Strong experience with core cloud services (compute, networking, storage, IAM) in a major public cloud environment.
- Systems Fundamentals: Solid understanding of Linux networking, security, and systems administration.