Site Reliability Engineer at PartsTech

PartsTech logo
PartsTech

Site Reliability Engineer

135k - 165k USD

Remote

Contractor

#Engineering

#E Commerce

#SaaS

#Incident Management

#API

#Infrastructure Monitoring

#Cloudwatch

#Kotlin

#Problem Solving

#Communication

#Analytical Skills

PartsTech is looking for a Site Reliability Engineer

PartsTech creates automotive e-commerce technology, helping repair shops, auto part distributors, and manufacturers run their businesses more effectively and profitably through e-commerce and data innovation. We increase efficiency for the automotive aftermarket by connecting repair shops, parts distributors, and manufacturers in one seamless, e-commerce platform. PartsTech makes finding and ordering the right parts simple, fast, and accurate.

PartsTech seeks a dynamic Site Reliability Engineer to support our platform and integrations. In this pivotal role, you will ensure that SLAs are exceeded and that we continue providing best-in-class services to our customers as our platform grows. 

The ideal candidate will have in-depth experience with SaaS application technologies, especially in production support & incident management processes, and provide guidance to improve MTTD and MTTR for large-scale platforms/cloud-based applications with multiple integrations. You will contribute significantly to our team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of cloud-based SaaS applications.

Eastern Time Zone - Canada, Remote

What You’ll Accomplish:

  • Utilize a variety of monitoring tools to observe system performance and detect anomalies. Tools to be utilized include Synthetic Monitoring, Application Performance Monitoring (APM), Infrastructure Monitoring, and API Monitoring.
  • Review system logs and be able to extract information to share with our partners on a per-request basis.
  • Work with the Integrations team and other internal teams to provide data around partner performance and SLAs.
  • Work with the outbound integrations team on measuring outbound partners’ SLAs.
  • Create and manage alerts based on Service Level Agreements (SLAs) and the requirements of specific applications/microservices.
  • Lead cross-functional teams in the identification, triage, and resolution of critical incidents, categorized as severity 1 (critical), severity 2 (urgent), and some severity 3 incidents.
  • Ensure swift restoration and recovery of services, adhering to established SLAs and minimizing business impact.
  • Serve as the primary point of contact for all communications related to incidents. 
  • Provide timely updates and escalations to stakeholders, senior management, customers, and partners.
  • Conduct post-mortem analyses for incidents of critical levels.
  • Recommend the implementation of a Correction of Error (COE) process for in-depth root cause analysis and preventive measures to avoid recurrence as the organization matures.
  • Prepare comprehensive incident reports. Ensure the regular update of Sigma with daily, weekly, and quarterly views for tracking and analysis purposes.
  • Weekly Business Reviews (WBRs): Conduct meetings between Customer Support, Product, Engineering, and Partner Support to review a real-time report. 
  • Monthly Business Reviews (MBRs): Present to Executives every month an end-to-end view of Customer & Partner Support view.
  • Quarterly Business Reviews  (QBRs): Snapshot report to BoD and partner.
  • Quarterly Business Review with Partners and Suppliers.

Who You Are: 

  • Bachelor’s Degree in Computer Science, Information Systems, or related Technical field or comparable work experience required.
  • Must reside in the Eastern Time Zone in Canada.
  • 5+ years of experience in utilizing Application Performance Monitoring (APM), API, and Infrastructure tools like AppDynamics, New Relic, DataDog, Grafana, Prometheus, CloudWatch, and Synthetics. 
  • Experienced in cloud-based deployment environments.
  • Proficient with programming constructs, especially in engineering frameworks applicable to Kotlin. 
  • Demonstrates a high sense of urgency in completing tasks and resolving issues, ensuring projects are delivered with excellence, on time. 
  • Possesses strong written and verbal communication skills, and is comfortable engaging with business stakeholders and external clients. 
  • Has an in-depth understanding of and experience with Incident Management processes, including detection, recovery, conducting Cause of Effect (COE) analysis, and following up with problem tasks.
  • Strong analytical and problem-solving skills.

Bonus Points:

  • Experience building large applications from scratch, complete with CI/CD infrastructure.
  • Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure).
  • Experience managing Kubernetes clusters or some other container orchestration infrastructure.
  • Experience with observability of large-scale distributed systems (100s+ microservices, 50+ integrations).

Compensation: Contract-to-Hire, Annual Salary Range - $135,000 - $165,000 CAD

Why You Should Join Us:

Our vision is to make it fast and easy for auto repair shops to find the right parts across all of their suppliers with one search. Together, PartsTech’s team helped countless businesses save valuable time so they can focus on their customers — and we’re just getting started.

The PartsTech team is a global, distributed group of passionate self-starters based in Cambridge, Hartford, CT, Eastern Europe, and beyond. We are remote-first, privately held and venture-backed. 

PartsTech is proud to be an equal opportunity employer, and values diversity at every level of our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We believe you should bring your whole self to work, so come as you are. 

PartsTech is an equal-opportunity employer and welcomes applications from candidates of all backgrounds.

Note: The job description provided is a general outline of responsibilities and qualifications for this role at PartsTech. Actual responsibilities and qualifications may vary depending on the specific needs of the company and department.

PartsTech logo

PartsTech

6 views

1 applied
Visit PartsTech
Share this job
Copy Permalink
Open roles at PartsTech
PartsTech logo
PartsTech

Backend Engineer

Remote

Full Time

#Engineering

#Kotlin

#Spring Boot

#GraphQL

#OpenSearch

#MongoDB

#MySQL

#Redis

#AWS

#Kubernetes

#Prometheus

PartsTech logo
PartsTech

Engineering Technical Lead, Features

130k - 150k USD

Remote

Full Time

#Engineering

#E Commerce

#SaaS

#Kotlin

#Java

#OpenSearch

#Elasticsearch

#React

#GraphQL

#gRPC

#PHP

#Symfony

#AWS

PartsTech logo
PartsTech

Senior Engineering Manager, Search & Discovery

175k - 225k USD

Remote

Contractor

#Engineering

#E Commerce

#Java

#Kotlin

#Python

#GraphQL

#PHP Symfony

#Elasticsearch

#AWS

#CI CD

#Kubernetes

#React

PartsTech logo
PartsTech

Engineering Technical Lead, Features

155k - 185k USD

Remote

Contractor

#Engineering

#E Commerce

#SaaS

#Kotlin

#Java

#OpenSearch

#Elasticsearch

#React

#GraphQL

#gRPC

#PHP

#Symfony

#AWS

PartsTech logo
PartsTech

Senior Engineering Manager, Search & Discovery

131k - 169k USD

Remote

Contractor

#Product Engineering

#Search Technologies

#Agile Methodologies

#AWS

#Java

Discover similar jobs
R
Rad AI

Data Engineer

Remote

Full Time

#Engineering

#Healthcare

#Analytics

#Metaflow

#Spark

#AWS

#EMR

#Docker

#Kubernetes

#SQL

#NoSQL

#DynamoDB

#Elasticsearch

T
Techpartnerships

NodeJs DEV

Remote

Full Time

#Engineering

Connecteam logo
Connecteam

Account Executive Mid Market

Remote

Full Time

#Sales

#SaaS

#B2B Sales

#CRM

#Project Management

#Time Management

H
Helpscout

Sr. Product Analyst

Remote

Full Time

#Business Operations

#SaaS

#Analytics

#SQL

#Mixpanel

#Testing

#BigQuery

#DBT

#Python

#Product Analytics

C
Claylabs

Product Marketing Manager

Remote

Full Time

#Marketing

#SaaS

#Product Marketing

#Positioning

#Messaging

#Go To Market

#Product Launches

#B2B SaaS

Stacvalley logo
Stacvalley

Quality Assurance Student - Graphics & Content

de flag
Germany

Remote

Internship

#Marketing

#E Commerce

#Adobe Creative Cloud

#Figma

#Quality Assurance

#Content Review

#Communication

A
Airship

Alliance and Partnership Manager

Remote

Full Time

#Partnerships

#Business Development

#SaaS

#Partnership Management

#Sales

#Crossbeam

#Salesforce

#Marketing

#Product

#API

The Browser Company logo
The Browser Company

Software Engineer, Compiler

us flag
US, CA

295k - 350k USD

Remote

Full Time

#Engineering

#Compiler

#Open Source

#Swift

#LLVM

#C++

#Windows

#Android

#Build Systems

#Tooling

#Design

Homebound logo
Homebound

Technical Lead Manager

Remote

Full Time

#Engineering

#Construction

#TypeScript

#Node

#React

#GraphQL

#PostgreSQL

#AWS

#AI

Flower logo
Flower

Founding Research Engineer in the Flower Frontier Model Team

Remote

Full Time

#Engineering

#Artificial Intelligence

#PyTorch

#Jax

#Transformers

#Optimization

#Training

#Docker

#Git

#Linux

RebelMouse logo
RebelMouse

Senior Account Executive

Remote

Full Time

#Sales

#SaaS

#B2B

#B2B Sales

#SaaS Sales

#Account Management

#Technical Demos

#Pipeline Management

#CRM

#MEDDICC

#Challenger

#Sandler

S
SearchStax

Head of Product Marketing

180k - 200k USD

Remote

Full Time

#Marketing

#SaaS

#Product Marketing

#Positioning

#Messaging

#Go To Market

#Sales Enablement

#Competitive Analysis

#B2B SaaS

K
Kraken.com

Senior Software Engineer - Frontend - Pro

Remote

Full Time

#Engineering

#Fintech

#React

#JavaScript

#TypeScript

#Next.js

#WebSockets

#API Design

#Testing

#UI UX

Prosper logo
Prosper

Sr. GRC Analyst

Remote

Full Time

#Technology

#Engineering

#GRC

#PCI DSS

#NIST

#SOC

#AWS

#Azure

#GCP

#Python

#BASH

#PowerShell

Versapay logo
Versapay

Principal .NET Software Engineer

Remote

Full Time

#Engineering

#Payments

#C#

#.NET

#SQL

#AWS

#Azure

#GitHub Actions

#RESTful APIs

#ISO 8583

B
Blockworks

Senior Data Engineer

160k - 200k USD

Remote

Full Time

#Engineering

#Cryptocurrency

#Python

#Go

#Rust

#TypeScript

#SQL

#Parquet

#Postgres

#Clickhouse

#Docker

#Kubernetes

#AWS

#GCP

#Airflow

#Dagster

#DBT

Wallarm logo
Wallarm

Senior Rust Developer

Remote

Full Time

#Engineering

#Cyber Security

#Rust

#Kubernetes

#Helm

#Terraform

#Backend Systems

#Distributed Systems

Firstup logo
Firstup

Account Manager

120k - 140k USD

Remote

Full Time

#Sales

#SaaS

#Account Management

#Sales Forecasting

#MEDDPICC

#CRM

#Executive Relationship Building

#Pipeline Generation

#Renewals

#Upselling

Distribusion logo
Distribusion

Technical Product Manager, Rail Integrations

Remote

Full Time

#Product

#Tech

#Product Management

#API

#Data Products

#Stakeholder Management

#Teams

#Process Improvement

#Jira

S
SecondDinner

Senior Director, Engineering

270k - 300k USD

Remote

Full Time

#Engineering

#Game Development

#Unity

#AWS

#Git

#.NET

#Technical Leadership

Your dream job awaits.

Explore exciting opportunities, connect with top employers, and ignite your career.