Site Reliability Engineer (SRE) Consultant
Toronto, ON, Canada
2d ago

Remote Canada

We are growing and we have an exciting new opportunity for a Site Reliability Engineer Consultant to join the MOBIA team.

Location is flexible and dependent on finding a qualified candidate. If you are passionate, creative, entrepreneurial and believe in working hard and having fun, this could be the perfect opportunity for you!

Responsibilities :

  • Accelerate build and promote best practices for AIOps (i.e., Platform, infrastructure, and application monitoring / alerting)
  • Accelerate integration and deployment of observability platforms within a client’s environment, no matter if the client is deployed in a cloud, hybrid cloud, multi-cloud or with on-premises infrastructure deployments
  • Consult with and help MOBIA clients adopt SRE best practices
  • Help MOBIA clients optimize alert thresholds, and eliminate noisy alerts
  • Develop custom dashboards for platform monitoring across Kubernetes, public cloud, or with on-premises infrastructure as well as application services
  • Experience with automating and customizing observability platform configurations
  • Automation of routine onboarding and de-commissioning of applications and infrastructure components in common observability platforms
  • Conversion of configs / dashboards or alerting from legacy monitoring tools to new tools
  • Qualifications :

  • Hands on expertise with designing Dynatrace dashboards and reports for Kubernetes platforms
  • 2+ years hands on experience using Dynatrace to monitor applications in dev and prod environments
  • 2+ years’ experience with at least one of the following platforms : Dynatrace, Datadog, AppDynamics, Splunk, Grafana and Turbonomic, etc.
  • Experience with AIOps platform root cause diagnosis and alert customization
  • Certified in at least 1 AIOps platform
  • Experience with proactive customer application root cause analysis
  • Experience with SRE platform analysis and root cause analysis
  • Experience with app and operations team onboarding to AIOps platforms
  • Nice to Have Skills :

  • Experience with Automation tools such as Ansible, Terraform, etc.
  • Experience with Cloud Management Platform tools
  • Experience with process automation and workflow tools such as ServiceNow, JIRA, etc.
  • Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form