Senior Site Reliability Engineer

Relay

Relay

Software Engineering
Toronto, ON, Canada
Posted on Sunday, June 9, 2024
Our mission is to increase the success rate of small businesses. Traditional banking has been a growth limiter rather than a growth enabler for business owners, and we’re changing that. Relay is the all-in-one, collaborative money management platform. We’re building for employer SMBs and their finance function, internal and external, and are focused on delivering a human-centric customer experience. Ultimately, we help SMBs be ‘on the money'.
We’re looking for an incredible Senior Site Reliability Engineer to join our Trust team. Your love of making high-impact decisions daily and desire to help shape the future of Relay is going to be crucial. The team’s vision is “Protecting the cathedral while enabling the bazaar” - quite a challenge in the scope of our multiple environments.

What You'll Be Doing:

  • Join the team owning our production infrastructures (AWS, Kubernetes, PostgreSQL databases, Terraform, Terragrunt)
  • Review infrastructure change requests, and triage & fix high-risk security and privacy issues in infrastructure components
  • Write playbooks, and run game days and threat modelling
  • Build monitoring systems to dynamically assess the infrastructure health
  • Improve our data repositories (db, warehouse, lake) posture: engine upgrade, zero-downtime migrations, privacy taggings
  • Provide guidance and mentoring for the rest of the team and help evolve Relay into a world-class security-oriented organization

Who You Are:

  • You have experience as a site reliability engineer working with these technologies: AWS, Datadog, Github, GHA, k8s, etc.
  • You have experience as a DBA (Aurora RDS, PostgreSQL, DynamoDB, ElastiCache)
  • You have experience with Terraform, Terragrunt, Node.js, Typescript
  • You have a strong security and operation focus; we are looking for someone to help us continue building security into every aspect of our work - and is ready to be on-call for production issues
  • You are a team player - our team is small and mighty, and we collaborate constantly - we want someone who is always willing to pitch in and isn’t afraid to ask for help
  • You are curious. You keep yourself on the bleeding edge of infrastructure best practices.

Bonus Points:

  • Show us your home lab! We have Ubiquity gears everywhere and we like to geek-out on our k8s clusters that control in-house experience
  • Send us your HackerOne account id - Security permeates everything we are doing
  • You’ve joined a company at its early stages and have seen it through scale
  • You have experience working in a fintech startup

Our SRE Tech Stack:

  • Container Orchestration: Kubernetes, ArgoCD, ECS
  • Cloud Platform: AWS (DynamoDB, RDS Postgres, Lambda, S3, SQS, SNS, SES, ElasticSearch, ECS, EKS, AND MORE)
  • Monitoring: Datadog
  • Relevant Languages: Javascript/Typescript, GoLang, Python
  • IAC: Terraform/Terragrunt
  • Tools: Github, GHA, Cloudflare

Our Commitment To You:

  • Competitive salary and meaningful equity: Relay employees are Relay owners, complete with equity and a competitive salary.
  • Comprehensive health benefits: enjoy full health benefits from day one: no probation period required. We offer flexible Health or Wellness Spending Accounts and medical, dental, and vision coverage for you and your dependents.
  • Flexible vacation and time off: every team member starts with 15 vacation days and 5 flex days to use as needed, plus an extra week of office closure during the end-of-year holidays so you can take time off to recharge and come back better for our customers.
  • Parental leave with top-up: we offer 12 weeks off with a 100% salary top-up for all full-time employees, regardless of location, and accessible for all parents: birthing, non-birthing, and adoptive.
  • Personal and professional growth: through ongoing feedback, mentorship, and coaching, work with peers and leaders who are invested in your growth and success.
  • Top-tier equipment: as a Mac-first company, our Toronto offices have everything you need to produce your best work comfortably, from multiple screens to ergonomic seating.
  • Social connection: we believe in celebrating our wins with two annual company-wide get-togethers, quarterly team events, happy hours, and special events and networking opportunities with industry leaders.
  • We’re driving real change for small business owners, powered by truly remarkable people. At Relay, you’ll find the confidence to take changes, trust to take initiative, and the support you need to build a career you love. Here, we make sure every team member feels empowered to make big decisions, encourage to ask tough questions, and challenged to take risks that result in work we’re all proud of. We give you the baton–you run the Relay.

The Interview Process:

  • Stage 1: A 30-minute Google Meets video call with a member of the Talent Team
  • Stage 2: A 45-minute Google Meets video call with our Engineering Manager, Trust
  • Stage 3: A 60-minute case study presentation with members of the Trust team
  • Stage 4: A 30-minute Google Meet video call with one of our executives
Research shows that women-identifying and other marginalized individuals tend to only apply when they meet 100% of the qualifications; if you don't have all the listed qualifications, we encourage you to apply anyway!
What’s Important to Us:
At Relay, we believe that diversity is key to building high-performing teams, and creating an inclusive work environment is our priority. We are an equal-opportunity employer and we welcome people of diverse backgrounds, perspectives, and skills.
We will work with applicants to provide accommodations at any stage of the hiring process. If you require accommodations during the interview process, please email your People Team contact, and we will work with you to meet your needs.