You haven't searched anything yet.
Department: Engineering
Reports to: Site Reliability Engineer
Location: Seattle, WA
Employment Type: Full-time
Start Date: ASAP
Who we are:
fabric is a modern commerce platform that gives retailers tools to create world-class shopping experiences for mid-market enterprises. We champion a new, harmonious way of doing business that emphasizes connectedness and collaboration over competition and dominance. This is showcased in our products that rely on microservices, APIs, and easy integrations, and in our globally distributed team that genuinely cares about its customers. Our founders directed groundbreaking commerce initiatives at Amazon, Staples, Google and eBay. We're growing fast and looking for more awesome people to join us.
Duties:
The Site Reliability Engineer (Multiple positions open) at Commerce Fabric Inc. in Seattle, Washington will be responsible for building software systems and application tools for internal use as well as customer use that enable the engineering teams to operate safely at a high speed and wide scale. The duties include developing tools and automated solutions to support hosted services for high availability and resiliency; implementing, monitoring and alerting for improved mean time to detect (MTTD) and mean time to recover(MTTR), evaluating the average time elapsed between a failure and the next time it occurs and assessing the time it takes to run a repair after the failure; resolving issues escalated in the software production environment; troubleshooting performance, reliability and scalability issues on distributed systems; collaborating with application engineers and training developers; monitoring, alerting, and administering license management on cloud services including Amazon Web Services (AWS); developing monitoring and alerting framework utilizing Datadog, Cloudwatch, X-ray, Sedai(for Auto-remediation), Grafana and Prometheus; software programming using web technologies and infrastructure automation; administering and automating Linux Servers in cloud services; building on industry leading infrastructure tools and technologies including Terraform, Cloudfront, and AWS to create tailored solutions on a wide scale. This is a 100% remote position. May work from home anywhere in the United States.
Requirements:
Bachelor's degree in information systems or directly related field plus five years of software development experience developing tools and automated solutions to support hosted services for high availability. The five years of experience must include five years of experience with each of the following: (1) Set-Up Datadog or Cloudwatch dashboards for monitoring the applications or systems; (2) Set Up alarms for identification of application or server failures; (3) AWS services including EKS, Serverless(lambda), Cloudfront, VPC, Apigateways, EC2, Cloud Watch, ECS, RDS, SNS, or IAM; (4) coordinating infrastructure planning for capacity planning analysis, disaster recovery and load balancing activities in the hosting environment; (5) writing terraform or python automations for configuration management and deployment of applications to servers; (6) Architecting High Availability and resilient systems with autoscaling; (7) analyzing technical solutions that fit with scalable and distributed systems on AWS cloud; (8) designing and deploying microservice environment on Amazon EKS; (9) Orchestrate CICD pipelines on gitlab, bitbucket, or Jenkins for Infra deployment, Serverless API deployment and EKS deployments; (10) working with Terraform or Terragrunt code for the deployment of the Infrastructure; (11) Build Secure, Scalable Multi-VPC Network; and (12) Orchestrate Microservices or storefront NextJs applications on EKS/ECS Clusters.
This notice is subject to Commerce Fabric’s employee referral program.
Interested candidates must apply online at https://boards.greenhouse.io/fabric.
What we bring to the table:
*fabric is an equal opportunity employer as well as a government contractor that shall abide by the requirements of 41 CFR 60-300.5(a), which prohibits discrimination against qualified protected Veterans and the requirements of 41 CFR 60-741.5(A), which prohibits discrimination against qualified individuals on the basis of disability.
Full Time
Retail
09/14/2022
10/21/2022
fabric.com
RIFLE, CO
50 - 100
1993
Private
JOLYNN MURRAY
<$5M
Retail
Fabric is an online fabric store.
The job skills required for Site Reliability Engineer include Analysis, Planning, Troubleshooting, Initiative, Python, Collaboration, etc. Having related job skills and expertise will give you an advantage when applying to be a Site Reliability Engineer. That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Site Reliability Engineer. Select any job title you are interested in and start to search job requirements.
The following is the career advancement route for Site Reliability Engineer positions, which can be used as a reference in future career path planning. As a Site Reliability Engineer, it can be promoted into senior positions as a Corrosion Engineer II that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Site Reliability Engineer. You can explore the career advancement for a Site Reliability Engineer below and select your interested title to get hiring information.
If you are interested in becoming a Site Reliability Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Site Reliability Engineer for your reference.
Step 1: Understand the job description and responsibilities of an Accountant.
Quotes from people on Site Reliability Engineer job description and responsibilities
Similarly to the point above, a site reliability engineer can expect to spend time fixing support escalation cases.
03/16/2022: Little Rock, AR
More times than not, site reliability engineers will need to take on-call responsibilities.
01/31/2022: Lexington, KY
Focuses on the reliability of behind-the-scenes systems that help make other teams' jobs more efficient.
02/24/2022: Tuscaloosa, AL
Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation.
02/25/2022: Manchester, NH
Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.
Career tips from people on Site Reliability Engineer jobs
The objective was to ensure service reliability and availability within operations management.
12/28/2021: Lima, OH
Step 3: View the best colleges and universities for Site Reliability Engineer.