What are the responsibilities and job description for the Site Reliability Engineer / Google Cloud Platform / Remote position at Motion Recruitment Partners, LLC?
This is a Site Reliability Engineer opportunity supporting a high-scale platform in the real-money gaming and lottery space. This is a fully remote role (EST hours preferred) focused heavily on Kubernetes, Google Cloud Platform, CI/CD automation, and observability tooling (Grafana/Prometheus stack) while supporting a distributed, production-critical environment.
This role is centered around owning reliability end-to-end. You will be responsible for ensuring platform stability, scalability, and performance while partnering closely with engineering teams to build systems that are fault-tolerant and production-ready from day one. The biggest draw here is the ability to work on high-impact systems where uptime and performance directly affect real users, while gaining strong exposure to modern SRE practices, microservices architecture, and cloud-native infrastructure.
Required Skills & Experience
5 years in SRE, DevOps, or Infrastructure Engineering
Strong experience with Kubernetes and Docker
Cloud experience (Google Cloud Platform preferred, AWS acceptable)
Experience building and maintaining CI/CD pipelines
Hands-on experience with observability tools (Grafana, Prometheus, Loki, Tempo)
Strong understanding of distributed systems and microservices architecture
Experience with incident response and root cause analysis
Strong troubleshooting and systems thinking skills
Desired Skills & Experience
Experience with Go
Familiarity with service mesh technologies (Istio)
Experience managing PostgreSQL at scale
Experience optimizing cloud cost and performance
Exposure to SLA/SLO/SLI frameworks
What You Will Be Doing
Tech Breakdown
60% Cloud Infrastructure (Google Cloud Platform, Kubernetes, Docker)
20% CI/CD & Automation
20% Observability & Monitoring (Grafana stack)
Daily Responsibilities
70% Hands On
10% Management Duties
20% Team Collaboration
The Offer
Bonus OR Commission eligible
You will receive the following benefits:
Medical, Dental, and Vision Insurance
Vacation Time
Stock Options
Applicants must be currently authorized to work in the US on a full-time basis now and in the future.
This role is centered around owning reliability end-to-end. You will be responsible for ensuring platform stability, scalability, and performance while partnering closely with engineering teams to build systems that are fault-tolerant and production-ready from day one. The biggest draw here is the ability to work on high-impact systems where uptime and performance directly affect real users, while gaining strong exposure to modern SRE practices, microservices architecture, and cloud-native infrastructure.
Required Skills & Experience
5 years in SRE, DevOps, or Infrastructure Engineering
Strong experience with Kubernetes and Docker
Cloud experience (Google Cloud Platform preferred, AWS acceptable)
Experience building and maintaining CI/CD pipelines
Hands-on experience with observability tools (Grafana, Prometheus, Loki, Tempo)
Strong understanding of distributed systems and microservices architecture
Experience with incident response and root cause analysis
Strong troubleshooting and systems thinking skills
Desired Skills & Experience
Experience with Go
Familiarity with service mesh technologies (Istio)
Experience managing PostgreSQL at scale
Experience optimizing cloud cost and performance
Exposure to SLA/SLO/SLI frameworks
What You Will Be Doing
Tech Breakdown
60% Cloud Infrastructure (Google Cloud Platform, Kubernetes, Docker)
20% CI/CD & Automation
20% Observability & Monitoring (Grafana stack)
Daily Responsibilities
70% Hands On
10% Management Duties
20% Team Collaboration
The Offer
Bonus OR Commission eligible
You will receive the following benefits:
Medical, Dental, and Vision Insurance
Vacation Time
Stock Options
Applicants must be currently authorized to work in the US on a full-time basis now and in the future.
Salary : $170,000 - $180,000