Site Reliability Engineering (SRE)
People Can Fly
Toronto, Ontario, Canada
2d ago

Job Description

  • Build and deploy the cloud-native infrastructure of the online services platform.
  • Build the tools, and foster the culture, for reliability across all our services.
  • Plan for, and exercise recovery from, disasters.
  • Build and deploy the platform to cloud service providers in an automated, reproductible way. Provision additional instances for development, testing, load testing, certification and (if needed) external publishers.
  • Harden the platform; advise the programmers on maximizing the reliability, scalability and uptime of their services.
  • Deploy the required tools to ensure maintenance, updates and recoveries of the services are quick, seamless, traceable, reproductible, and simple to revert if needed.
  • Establish disaster recovery protocols. Put them to the test.
  • Write and deploy monitoring dashboards and alerting systems to ascertain the state of online services and their dependencies in real-time.
  • Assist programmers in instrumenting their services so that they're monitored effectively.

  • Build dashboards to monitor the cost of our online systems in real-time. Advise programmers on minimizing operational costs.
  • Communicate with 3rd party providers and / or publishers in case of outages on their end.
  • Establish protocols for 24 / 7 on-call support of our live games.
  • Qualifications

  • Typically : 2+ years of experience in a Site Reliability Engineering (SRE) or DevOps position.
  • Videogame-specific experience is useful but not mandatory.
  • Other relevant domains to look into : content distribution, ad-tech, news, mobile gaming, finance.
  • FAANG (or adjacent) experience highly sought after.
  • Strong knowledge of one or two of : Amazon Web Services, Microsoft Azure, Google Cloud Platform.
  • Experience building, deploying and operating Kubernetes clusters in cloud-native environments (EKS on Amazon, AKS on Azure, GKS on Google).
  • Knowledge of infrastructure-as-code tooling (e.g. Hashicorp Terraform) and integration into CI / CD pipelines (e.g. Atlantis).
  • Experience deploying software on Kubernetes clusters using Docker, Helm and ArgoCD (GitOps-style operations).
  • Experience with monitoring and tracing stacks : Prometheus, InfluxDB, Loki, Grafana, OpenTelemetry.
  • Deep understanding of scalability, security and maintainability considerations.
  • Being able to work efficiently under tight deadlines.
  • Knowledge of any project management and bug tracking software.
  • Strong verbal and written communication skills in English.
  • Open-minded team player attitude.
  • Strong work ethic and self-motivated.
  • Passionate about playing and making video games.
  • Additional Information

    What we offer : U.S.

    U.S.

  • 100% group health insurance benefit premiums paid by PCF (Medical, Dental, Vision, Group Life, and Supplemental Live) and start on day 1 of employment.
  • 401K with 100% match, up to 3% of employee salary, and vested immediately.
  • Paid week off during Winter Holidays.
  • 20 paid vacation days and 5 paid sick days.
  • Free virtual health and mental wellbeing sessions included in the plan for members and their dependents.
  • A competitive salary and performance-based annual bonuses.
  • Personal development opportunities and ability to work in a global environment.
  • Work in a creative team with people full of passion for what they do.
  • Long term disability, short term disability, travel insurance, as well as other benefits provided.
  • Canada

  • Benefit package 100% paid by PCF. Insurance company reimburses 100% of claims (Up to $500 per service a year, as well as individual family coverage).
  • Full Dental coverage, including major dental and orthodontics.
  • 4% RRSP matching before tax deductions, 100% vested on day 1.
  • Paid week off during Winter Holidays.
  • 20 paid vacation days and 5 paid sick days.
  • Free virtual health and mental wellbeing sessions included in the plan for members and their dependents.
  • A competitive salary and performance-based annual bonuses.
  • Personal development opportunities and ability to work in a global environment.
  • Work in a creative team with people full of passion for what they do.
  • Report this job
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Apply
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Continue
    Application form