Site Reliability Engineer
TradeRev
Toronto, Ontario, CA
10d ago

Site Reliability Engineer

Responsibilities :

  • Build scalable systems, using best practices around automation, pushing changes that improve reliability and velocity
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning and reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Provide mentorship and training to other team members on technologies and processes; drive education and knowledge transfer of design patterns, technical practices, and relevant technologies and tools
  • Drive high standards around incident response practices and policies
  • Qualifications :

  • 4+ years' of experience in an Operational role, DevOps, SRE, or Software Engineering
  • In-depth experience with cloud computing and solid experience of setup and management of cloud infrastructure
  • You can write code - in any language. You’ve implemented your work to production
  • Extensive experience with configuration management and infrastructure automation tools, ie Ansible, Terraform, SaltStack, Puppet, Chef, etc
  • Experience with large scale distributed systems in the cloud and concerns like load balancing and disaster recovery
  • Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting
  • Bachelor of Computer Science or Computer Engineering
  • Step 2
    Apply
    Add to favourites
    Remove from favourites
    Apply
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Continue
    Application form