Staff Site Reliability Engineer

Thinkific is a software platform that enables entrepreneurs to create, market, sell, and deliver their own online courses. Our mission is no less than to revolutionize the way people learn and earn online by giving them the tools they need to turn their expertise into a sustainable business that impacts both them and their audience. Our team of 275+ Thinkers supports customers around the globe while working collaboratively to learn, grow, and succeed together. Join us to see how we’re building one of the best workplaces in Canadian tech!

We believe every candidate should have a fair, inclusive, and overall great experience when exploring a new role with Thinkific. That starts with outlining our hiring process so you know what to expect every step of the way—click here to learn more: https://thnk.cc/whattoexpect

Are you an experienced Site Reliability Engineer looking for a new challenge?  We’re looking for a Staff Site Reliability Engineer to join us at Thinkific. 

We’re looking for a Staff Site Reliability Engineer (SRE) to join us at Thinkific. As a Staff SRE, you’ll work closely with engineers, domain experts, and stakeholders to improve the performance, reliability, and security of our systems. You’ll bring deep technical skills and a collaborative mindset to drive forward important projects and help other engineers grow through mentorship and coaching.

You’ll be a core contributor on your team and a go-to expert in your domain. You’ll collaborate with engineering leadership and cross-functional partners to solve complex problems and ensure our systems scale reliably to support our growing platform.

Your goal will be to help guide and execute on projects related to your technical domain. Here’s how you’ll accomplish this:

  • Own a technical domain within our system and be accountable for operations and SLOs related to performance, reliability, and security, as well as architectural evolution and technical documentation aligned with broader strategy
  • Contribute to the planning and execution of technical projects within and across your team, helping ensure that initiatives are well-scoped, aligned with organizational priorities, and effectively delivered
  • Partner with product managers, designers, and other engineers to define system requirements, propose implementation strategies, and make tradeoffs visible
  • Champion operational excellence, observability, and incident response across your team and adjacent services
  • Write high-quality, maintainable, and efficient code with a focus on long-term scalability and performance
  • Share your expertise by mentoring other engineers, supporting code reviews, and guiding others through architectural and debugging challenges
  • Promote a culture of continuous improvement by encouraging experimentation, learning from failure, and driving engineering best practices in reliability, performance, and software quality
  • Participate in our on-call rotation and incident response processes to help maintain a high level of service reliability

The person we have in mind likely:

  • Has 6+ years of experience in the software or infrastructure engineering profession, including time spent in a reliability or platform-focused role
  • Has experience owning services in production, and feels comfortable with infrastructure as code, container orchestration, and cloud-native development practices
  • Understands the operational needs of complex distributed systems and has experience with monitoring, observability, incident management, and system hardening
  • Writes infrastructure code in tools like Terraform with an eye toward security, modularity, and collaboration
  • Has experience with languages like Ruby, Python, or Bash, and is proficient in working with relational and non-relational databases such as Postgres or AWS Aurora
  • Can identify root causes of complex issues across multiple systems and work with stakeholders to develop resilient solutions
  • Has experience with queueing systems like SNS, SQS, or Sidekiq and understands patterns for asynchronous processing and fault tolerance
  • Enjoys collaborating across teams, sharing knowledge, and helping shape the team’s technical roadmap
  • Is a thoughtful communicator who proactively shares context, feedback, and plans with their team and stakeholders
  • Brings a continuous improvement mindset by seeking out opportunities to streamline workflows, reduce toil, and enable team success
  • Loves to learn and grow. They’ve found (and keep looking for) ways to level up their skills in this field, whether that’s through formal education, gaining professional experience, or maybe even building their own business 

These things would also be nice, but we think you could learn them on the job: 

  • Experience working with AWS services and infrastructure at scale
  • Knowledge of networking fundamentals and related cloud services such as Cloudflare, load balancing, and TLS

The recruitment compensation range for this position is $135,000 – $165,000 CAD. Your specific compensation within this range is determined based on your job-related skills, knowledge, experience, and our internal equity assessment.

Diversity, Equity, Inclusion and Belonging & Accessibility

This is just our initial idea of who we’re looking for! At Thinkific, we know that people have unique career journeys. If your experience is close to what we’ve described but you feel that you might be missing a few of the requirements, please still apply! We believe in equal opportunity and are committed to diversity, equity, inclusion, and belonging across every facet of our business.

We’re also committed to providing a comfortable and accessible interview experience for every candidate. If there are any accommodations our team can make throughout our hiring process (big or small), please let us know.       

 


What you can expect if you join Thinkific:

👏 An amazing team of talented, passionate, and kind Thinkers. Together, we’ve built an amazing, award-winning culture—we’re a Certified Great Place to Work and one of Canada’s Top Small & Medium Employers!

🚀 The chance to build, improve, and innovate on a platform that’s driving positive impact for thousands of businesses and millions of students around the world.

💸 A competitive compensation package including base salary, equity, team-wide bonuses, and an Employee Share Purchase Plan.

🌴Flexible Paid Time Off to maintain mental and physical health. Our team is encouraged to take a minimum 4 weeks of vacation, plus Thinker Holidays (extended long weekends in the summer) and time off for the December holiday season.

🩺 Health Benefits and Wellness: Comprehensive benefits starting on Day 1 include health, vision, and dental coverage for you and your family, $3,000 for mental health care, a short-term health plan, and an additional health or personal spending account. Plus, family friendly benefits include generous parental leave top-ups for up to 32 weeks, as well as fertility coverage and personalized return to work options. 

💻 Flexible Work. Choose to work from home from anywhere in Canada, at our Vancouver HQ, a co-working space, or anywhere there’s wifi for a change of scenery.

⬆️ Learning & Growth. An annual $1500 USD Learn and Grow fund for conferences, seminars, or courses, plus training, mentorship, coaching, and internal promotion opportunities.

🏡 A home office setup so you’re ready to succeed with a company-owned Macbook Pro and a budget to order a desk, chair, or any accessories to help you work comfortably and productively. 

💙 A place where you can bring your whole self to work. We know that different perspectives lead to amazing ideas, more innovation, and, ultimately, our success as a company. We welcome applicants of all backgrounds, experiences, beliefs, identities, and statuses. Whoever you are—we can’t wait to meet you!

The Thinkific Vancouver office operates on the traditional, ancestral, and unceded territories of the xʷməθkʷəy̓əm (Musqueam), Sḵwx̱wú7mesh (Squamish), and Sel̓íl̓witulh (Tsleil-Waututh) Nations of the Coast Salish People.  We encourage everyone to learn more about the original caretakers of the land that you currently occupy.