Remote Canada
We are growing and we have an exciting new opportunity for a Site Reliability Engineer Consultant to join the MOBIA team.
Location is flexible and dependent on finding a qualified candidate. If you are passionate, creative, entrepreneurial and believe in working hard and having fun, this could be the perfect opportunity for you!
Responsibilities :
Accelerate build and promote best practices for AIOps (i.e., Platform, infrastructure, and application monitoring / alerting)
Accelerate integration and deployment of observability platforms within a client’s environment, no matter if the client is deployed in a cloud, hybrid cloud, multi-cloud or with on-premises infrastructure deployments
Consult with and help MOBIA clients adopt SRE best practices
Help MOBIA clients optimize alert thresholds, and eliminate noisy alerts
Develop custom dashboards for platform monitoring across Kubernetes, public cloud, or with on-premises infrastructure as well as application services
Experience with automating and customizing observability platform configurations
Automation of routine onboarding and de-commissioning of applications and infrastructure components in common observability platforms
Conversion of configs / dashboards or alerting from legacy monitoring tools to new tools
Qualifications :
Hands on expertise with designing Dynatrace dashboards and reports for Kubernetes platforms
2+ years hands on experience using Dynatrace to monitor applications in dev and prod environments
2+ years’ experience with at least one of the following platforms : Dynatrace, Datadog, AppDynamics, Splunk, Grafana and Turbonomic, etc.
Experience with AIOps platform root cause diagnosis and alert customization
Certified in at least 1 AIOps platform
Experience with proactive customer application root cause analysis
Experience with SRE platform analysis and root cause analysis
Experience with app and operations team onboarding to AIOps platforms
Nice to Have Skills :
Experience with Automation tools such as Ansible, Terraform, etc.
Experience with Cloud Management Platform tools
Experience with process automation and workflow tools such as ServiceNow, JIRA, etc.