What are the responsibilities and job description for the Site Reliability Engineer (Google Cloud Platform) 7+ position at TekVivid?
Job Title: Site Reliability Engineer (SRE) – Google Cloud Platform
Experience: 8 Years
Location: San Jose, CA Onsite
Location: San Jose, CA Onsite
Job Summary
We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (Google Cloud Platform) and hands-on experience in TQL (Telemetry/Query Language or similar monitoring/query tools). The ideal candidate will be responsible for ensuring system reliability, scalability, and performance while maintaining high availability of critical applications.
Key Responsibilities
- Design, implement, and maintain highly reliable and scalable systems on Google Cloud Platform
- Monitor system performance, availability, and latency using TQL or similar query/monitoring tools
- Automate infrastructure provisioning using Infrastructure as Code (IaC) tools (Terraform, Deployment Manager)
- Troubleshoot production issues and perform root cause analysis
- Implement CI/CD pipelines for faster and reliable deployments
- Collaborate with development teams to improve system reliability and performance
- Manage incident response, on-call support, and post-incident reviews
- Optimize system performance, cost, and resource utilization
- Ensure security, compliance, and best practices across cloud environments
Required Skills & Qualifications
- 8 years of experience in Site Reliability Engineering / DevOps / Cloud Engineering
- Strong hands-on experience with Google Cloud Platform (Google Cloud Platform) services (Compute Engine, GKE, Cloud Storage, etc.)
- Google Cloud Platform Certification (Professional Cloud DevOps Engineer / Cloud Architect preferred)
- Experience with TQL or similar query languages for monitoring/logging (e.g., PromQL, SQL-like tools)
- Proficiency in scripting languages such as Python, Bash, or Go
- Experience with containerization and orchestration tools (Docker, Kubernetes)
- Strong understanding of CI/CD tools (Jenkins, GitLab CI, Cloud Build, etc.)
- Knowledge of monitoring tools (Prometheus, Grafana, Stackdriver/Cloud Monitoring)
- Experience with Infrastructure as Code (Terraform preferred)
- Solid understanding of networking, security, and system architecture
Salary : $45