Site Reliability Engineer
Procom
Toronto, ON Canada
1d ago

On behalf of our client in the Financial Services Sector, PROCOM is looking for a Site Reliability Engineer

DevOps Engineer - Job Description

  • Operational management of our product and application suites following concepts and ideas from Google’s Site Reliability Engineering (SRE).
  • Incident management, response and support under our mature incident management framework utilizing follow-the-sun methodology.
  • Drive Well-Architected and Application In Service Reviews for new applications, with a focus on the reliability and operational excellence pillars.
  • Software development of our shared job and workflow control solution utilizing AWS Batch, Step Functions and Lambda.
  • Software development as a member of one of our development teams during build and rollout phases utilizing project specific languages including Node.js, Java and Python.
  • Providing standardized offerings to facilitate the successful deployment of stacks including Continuous build, test, integration, and deployment platforms and pipelines
  • Providing standardized offerings to facilitate and ensure operational health of stacks throughout their lifecycle including metrics collection, aggregation, and visualization, inventory, capacity, and billing / tag management
  • DevOps Engineer - Mandatory Skills

  • Undergraduate degree in computing or related area
  • Previous experience in Software development, Engineering or Operations operationalizing and preferably supporting highly available and scalable applications
  • Professional software development and SDLC with one or more of the following C, C++, Java, Node.Js, SQL, JavaScript or similar programming languages
  • Proficient with at least one of the following programming and scripting languages : Ruby, Go, Python, Perl, bash, ksh
  • You possess 2+ years of advanced hands on experience in at least 3 of the following areas :
  • Infrastructure-as-a-service platforms : AWS, Google Compute Engine, Azure, Soft Layer, Linux OpenStack, etc.
  • Configuration management and automation tools such as : Chef, Puppet, and Ansible
  • Orchestration template technologies such as : OpenStack Heat, AWS Cloud Formation, Azure Resource Manager, Google Cloud Deployment Manager, and Hashicorp Terraform
  • Development using Github or Bitbucket
  • Containers and container scheduling and management platforms such as : Docker, rkt, Mesos, or Kubernetes
  • Managing traditional enterprise platforms for compute, network, and storage
  • Managing traditional enterprise platforms for application runtimes, integration middleware, and relational databases
  • Site Reliability Engineer - Nice to Have Skills

  • Cloud certification from AWS, Google or Azure
  • Linux certification (LPIC-2, LFCE, RHCE)
  • Scrum leadership and agile development experience
  • Expertise with hedge funds, investor relations, private equity and / or real estate
  • DevOps Engineer - Assignment Start Date

    ASAP - 6 months to start

    DevOps Engineer - Assignment Location

    Downtown Toronto

    Apply
    Add to favourites
    Remove from favorites
    Apply
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Continue
    Application form