Demo

Site Reliability Engineer, CI/CD & Load Testing (US-Remote)

Braintrust
Braintrust Salary
San Francisco, CA Remote Full Time
POSTED ON 6/5/2026
AVAILABLE BEFORE 7/4/2026
Company
Braintrust is a global talent network that connects top independent professionals with leading companies for high-quality, flexible work. We help organizations hire skilled talent faster while giving professionals access to vetted opportunities with innovative teams.

Job description

About Give Lively

Give Lively builds better fundraising technology for nonprofits and gives it away for free. Our platform supports donation, fundraising, and event tools used by thousands of nonprofits and donors.


Reliability is central to our mission. When nonprofits need to raise money — especially during high-volume giving periods — our platform needs to be secure, available, observable, and resilient.


About the role

Give Lively is looking for a pragmatic Site Reliability Engineer to improve the reliability, scalability, and operational maturity of our fundraising platform.


This role will bridge development and operations. You’ll help own and improve CI/CD pipelines, release engineering, load testing, Heroku operations, incident response, observability, and production reliability. You’ll help automate manual operational tasks and prepare the platform for high-traffic events like Giving Season.


Our current platform runs primarily on Heroku, Postgres, CloudFront, and CircleCI. The immediate priority is improving reliability, CI/CD performance, deployment confidence, load testing, and Heroku operations. Experience with AWS, Terraform, or PaaS-to-cloud migration is helpful as we evaluate future infrastructure paths, but it is not required on day one.


This is not a role focused on building Kubernetes clusters or microservices from scratch. We’re looking for someone pragmatic who can improve and scale the systems we run today.


What success looks like

You will help Give Lively provide resilient, observable infrastructure that supports a secure and always-available donation platform.


Success means:

  • The platform is prepared for major giving periods, including a 10x traffic spike during Giving Season.
  • CI/CD pipelines are faster, more reliable, and easier to maintain.
  • Load testing is designed, executed, analyzed, and translated into action.
  • Manual operational tasks are automated or eliminated.
  • Incident response is structured and documented.
  • Postmortems are clear, blameless, and useful.
  • Monitoring and alerting help the team identify issues before donors are impacted.
  • Infrastructure decisions are pragmatic and aligned with the platform’s current needs.


What you’ll do

  • Own and improve production reliability for Give Lively’s fundraising platform.
  • Manage and optimize Heroku infrastructure, dynos, release phases, and Heroku Postgres.
  • Improve CircleCI workflows, caching, parallelization, and deployment reliability.
  • Design and execute load-testing protocols using tools such as JMeter, Artillery, k6, or similar.
  • Prepare the platform for Giving Season and other high-traffic events.
  • Support incident response, postmortems, retrospectives, and root cause analysis.
  • Improve observability across Heroku, Pingdom, Rails Autoscale, logs, and APM tools.
  • Automate operational tasks using Ruby, Bash, Python, or similar scripting.
  • Support test infrastructure for Cypress, Katalon, Happo, and visual regression suites.
  • Help evaluate future AWS and Terraform infrastructure paths.
  • Work closely with engineers and product managers to define reliability goals and SLOs.


Must haves

  • 5 years of experience supporting production web applications or infrastructure.
  • Strong CI/CD experience, ideally with CircleCI.
  • Experience designing, executing, and analyzing load tests.
  • Deep experience with Heroku or similar PaaS environments.
  • Experience managing Heroku Postgres or Postgres in production.
  • Experience with incident response, on-call rotations, PagerDuty, Statuspage, or similar tools.
  • Experience setting up and tuning monitoring, alerting, and observability systems.
  • Comfort writing scripts in Ruby, Bash, Python, or similar.
  • Solid understanding of HTTP, DNS, CDN behavior, SSL/TLS, and web infrastructure.
  • Pragmatic engineering judgment and comfort improving existing production systems.
  • Strong collaboration and communication skills.


Nice to haves

  • Experience with Ruby on Rails applications.
  • Experience with CloudFront.
  • Experience with AWS.
  • Experience with Terraform or other Infrastructure as Code tools.
  • Experience migrating applications from Heroku or another PaaS to AWS.
  • Experience preparing systems for predictable traffic spikes.
  • Experience with Rails Autoscale, Pingdom, PagerDuty, Cypress, Katalon, or Happo.
  • Experience defining SLOs or improving operational maturity in a small team.


Current stack

Our applications are built with Ruby on Rails. The core platform uses Heroku, Heroku Postgres, CloudFront, CircleCI, Heroku CI, and Postgres. We use RSpec, Cypress, Katalon, and Happo for test coverage and visual regression. Monitoring and alerting are handled through PagerDuty, Pingdom, Rails Autoscale, and related tools.


What Will You Get:

  • Excellent Medical/Dental/Vision benefits-- you don’t pay a monthly premium nor deductible, we cover it 100%
  • 401k match program that grants a 75% match of all your contributions, vested 100% on Day 1.
  • PTO/Paid Holidays
  • $1k/yr professional development stipend
  • Work/life balance, focused on a sustainable workload
  • An experience you’ll love and the knowledge you're doing better for yourself and the world
  • The chance to make an impact with a highly motivated, talented, and fast-growing team

Salary : $120,000 - $135,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer, CI/CD & Load Testing (US-Remote)?

Sign up to receive alerts about other jobs on the Site Reliability Engineer, CI/CD & Load Testing (US-Remote) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Braintrust

  • Braintrust Seattle, WA
  • About The Company Braintrust is the AI observability platform. By connecting evals and observability in one workflow, Braintrust gives builders the visibil... more
  • 2 Days Ago

  • Braintrust San Francisco, CA
  • Company Braintrust is a global talent network that connects top independent professionals with leading companies for high-quality, flexible work. We help o... more
  • 2 Days Ago

  • Braintrust York, NY
  • Job Description: Seeking multiple Data Science Subject Matter Experts to help design, run, and optimize data collection and evaluation workflows for GenAI ... more
  • 4 Days Ago

  • Braintrust York, NY
  • Job Description: We are seeking multiple Marketing Subject Matter Experts (SME) to train, evaluate, and fine-tune AI models for marketing. You’ll bridge ma... more
  • 4 Days Ago


Not the job you're looking for? Here are some other Site Reliability Engineer, CI/CD & Load Testing (US-Remote) jobs in the San Francisco, CA area that may be a better fit.

  • Asana San Francisco, CA
  • The Staff CI/CD Engineer is pivotal in transforming our CI/CD landscape to enhance developer efficiency across the organization. You will architect, develo... more
  • 12 Days Ago

  • Neuralink South San Francisco, CA
  • About Neuralink: We are creating devices that enable a bi-directional interface with the brain. These devices allow us to restore movement to the paralyzed... more
  • 16 Days Ago

AI Assistant is available now!

Feel free to start your new journey!