On behalf of our client in the Financial Services Sector, PROCOM is looking for a Site Reliability Engineer
DevOps Engineer - Job Description
Operational management of our product and application suites following concepts and ideas from Google’s Site Reliability Engineering (SRE).
Incident management, response and support under our mature incident management framework utilizing follow-the-sun methodology.
Drive Well-Architected and Application In Service Reviews for new applications, with a focus on the reliability and operational excellence pillars.
Software development of our shared job and workflow control solution utilizing AWS Batch, Step Functions and Lambda.
Software development as a member of one of our development teams during build and rollout phases utilizing project specific languages including Node.js, Java and Python.
Providing standardized offerings to facilitate the successful deployment of stacks including Continuous build, test, integration, and deployment platforms and pipelines
Providing standardized offerings to facilitate and ensure operational health of stacks throughout their lifecycle including metrics collection, aggregation, and visualization, inventory, capacity, and billing / tag management
DevOps Engineer - Mandatory Skills
Undergraduate degree in computing or related area
Previous experience in Software development, Engineering or Operations operationalizing and preferably supporting highly available and scalable applications
Proficient with at least one of the following programming and scripting languages : Ruby, Go, Python, Perl, bash, ksh
You possess 2+ years of advanced hands on experience in at least 3 of the following areas :
Infrastructure-as-a-service platforms : AWS, Google Compute Engine, Azure, Soft Layer, Linux OpenStack, etc.
Configuration management and automation tools such as : Chef, Puppet, and Ansible
Orchestration template technologies such as : OpenStack Heat, AWS Cloud Formation, Azure Resource Manager, Google Cloud Deployment Manager, and Hashicorp Terraform
Development using Github or Bitbucket
Containers and container scheduling and management platforms such as : Docker, rkt, Mesos, or Kubernetes
Managing traditional enterprise platforms for compute, network, and storage
Managing traditional enterprise platforms for application runtimes, integration middleware, and relational databases
Site Reliability Engineer - Nice to Have Skills
Cloud certification from AWS, Google or Azure
Linux certification (LPIC-2, LFCE, RHCE)
Scrum leadership and agile development experience
Expertise with hedge funds, investor relations, private equity and / or real estate
DevOps Engineer - Assignment Start Date
ASAP - 6 months to start
DevOps Engineer - Assignment Location