What are the responsibilities and job description for the Senior Site Reliability Engineer position at wellsfargo?
Title: Senior Site Reliability Engineer
Location: Charlotte, NC
Alternative Location: Phoenix, AZ, Irving, TX
Duration: 12 months
Work Engagement: W2
Work Schedule: 3 days in office/2 days remote
Benefits on offer for this contract position: Health Insurance, Life insurance, 401K and Voluntary Benefits
Summary:
We are seeking an experienced Platform Reliability / SRE Engineer to ensure the reliability, performance, and smooth operation of our enterprise Harness Continuous Delivery (CD) platform. This role is hands-on, automation-focused, and central to supporting our development teams across multiple environments.
Responsibilities:
Platform Reliability & Operations
· Ensure end-to-end reliability, availability, and performance of the Harness CD platform across non‑prod, prod, and BCP environments
· Monitor and report on SLIs, SLOs, error budgets, deployment success rates, and platform health
· Lead incident response and troubleshooting for deployment failures, outages, or performance issues
· Identify and resolve scaling, performance, and capacity challenges across delegates, pipelines, Kubernetes clusters, and cloud integrations
Automation & Engineering Excellence
· Build automation for provisioning, configuration, scaling, upgrades, and ongoing maintenance of Harness components
· Develop Infrastructure as Code (IaC) using Terraform, Ansible, Helm, or similar tools
· Automate operational tasks including delegate lifecycle management, cluster onboarding, secret rotation, and pipeline validation
· Reduce manual work by creating repeatable, self-service automation workflows
DevOps & CI/CD Integration
· Maintain and improve integrations between Harness and tools such as GitHub, Jenkins, Azure DevOps, Kubernetes/OpenShift, and cloud platforms
· Enhance developer experience by supporting efficient, reliable deployment pipelines
· Partner with DevOps teams on deployment strategies (blue/green, canary, rolling updates)
· Work with Security teams to embed DevSecOps practices, including policy enforcement and governance pipelines
Observability & Monitoring
· Build and maintain monitoring, logging, dashboards, and alerting for all Harness components
· Use tools such as Splunk, Prometheus, Grafana, or AppDynamics to create actionable alerts
· Detect and escalate issues such as pipeline delays, delegate saturation, API errors, and Kubernetes resource constraints
· Support proactive monitoring to reduce detection and resolution time
Modernization & Continuous Improvement
· Assist with Harness upgrades, patches, and lifecycle maintenance
· Support modernization initiatives such as containerization, cloud-native deployments, and multi‑cloud expansion
· Assist with resiliency activities including BCP testing and backup verification
· Evaluate new Harness features and modules for enterprise adoption
Technical Leadership
· Serve as a technical SME for the Harness platform
· Create documentation, architecture details, and operational runbooks
· Partner with senior engineers to enhance automation standards and platform best practices
Qualifications:
· Applicants must be authorized to work for ANY employer in the U.S. This position is not eligible for visa sponsorship.
· Demonstrated experience in DevOps, SRE, Platform Engineering, or Cloud Engineering
· Demonstrated hands-on experience with Harness CD
· Strong experience with Kubernetes/OpenShift, Linux, and cloud deployment best practices
· Solid understanding of CI/CD workflows and release automation
· Experience applying SRE concepts (SLIs, SLOs, error budgets, reliability improvements)
· Strong scripting and automation skills using Python, Bash, PowerShell, and Ansible
· Experience with Infrastructure as Code (Terraform, Ansible, Helm, or similar)
· Experience with monitoring and logging tools such as Prometheus, Grafana, Splunk, ELK, or AppDynamics
· Strong troubleshooting skills across containers, OS, networking, platforms, and cloud environments
· Data center migration experience (preferred)
· Experience supporting enterprise-scale CD platforms (preferred)
· Experience in hybrid cloud or cloud-native environments (Azure, GCP) (preferred)
· Familiarity with DevSecOps, governance models, and policy automation (preferred)
· Experience supporting complex upgrades, migrations, or modernization projects (preferred)