Senior Site Reliability Engineer (SRE)
Change Healthcare
Richmond, BC
1d ago

Transforming the future of healthcare isn’t something we take lightly. It takes teams of the best and the brightest, working together to make an impact.

As one of the largest healthcare technology companies in the U.S., we are a catalyst to accelerate the journey toward improved lives and healthier communities.

Here at Change Healthcare, we’re using our influence to drive positive changes across the industry, and we want motivated and passionate people like you to help us continue to bring new and innovative ideas to life.

If you’re ready to embrace your passion and do what you love with a company that’s committed to supporting your future, then you belong at Change Healthcare.

Pursue purpose. Champion innovation. Earn trust. Be agile. Include all.

Position Description

As a Site Reliability Engineer (SRE) is a member of the Cloud Operations team working on Change Healthcare’s Enterprise Imaging Cloud SaaS solution, you will be responsible for ensuring reliability, security and efficiency of a critical health care service with a rapidly growing customer base.

This includes continuous delivery, configuration management, performance monitoring and initial troubleshooting and incident response management.

In addition we try to improve our operations via software development by building automation for repetitive work. In this role you come with an extensive background in all aspects of enterprise grade large scale cloud solutions.

Responsibilities : Service Reliability

Service Reliability

  • Operate, scale, and troubleshoot applications and infrastructure for the cloud-based SaaS Enterprise Imaging platform and all components within.
  • 24x7x365 shift-based support with rotating on-call.
  • Ensure SLAs are met ensuring high availability and performance of enterprise imaging applications.
  • Follow and improvecloud operation run-books and standard operation procedures.
  • Incident response management utilizing well defined operational procedures, tools and efficient communication with various internal and external stakeholders.
  • System Administration (sysadmin) tasks on Cloud, Linux & Windows operating systems including network configurations and permission control.
  • Oversee, participate in and manage production application deployments.
  • Plan, implement, monitor, and test systems and procedures for best practice Business Continuity and Disaster Recovery (BC / DR).
  • Monitoring

  • Define metrics of success and provide operational reports and dashboards utilizing company analytics tools.
  • Define and implement effective cloud infrastructure, services, applications and customer connectivity monitoring and emergency alerting.
  • Monitor cloud resources utilization and associated cost.
  • Compliance & Security

  • Ensure compliance with medical device, privacy and security regulations including safety and security of infrastructure and data.
  • Actively support compliance auditing activities.
  • Ensure secure and managed access to production and staging environments.
  • Automation

  • Use and develop tools for systems continuous delivery, monitoring and troubleshooting automation.
  • Define and implement effective cloud infrastructure, services, applications and customer connectivity monitoring and emergency alerting.
  • Collaboration

  • Collaborate with other teams to make sure that the infrastructure and applications that depend on it work together seamlessly.
  • Support other team’s infrastructure needs on an as-needed basis.
  • Interact with vendors, consultants, partners, and customers to ensure that cloud operations meet the needs of all users of the platform.
  • Collaborate with internal teams such as engineering, security operations, software architecture, support and other cross-functional teams.
  • Continuous Improvement

  • Actively use and suggest improvements for continuous delivery toolset.
  • Work with other operational teams on defining and improving SLAs, processes, tools and procedures.
  • Serve as the company’s subject matter expert to support other Change Healthcare teams for purposes cloud technologies, operations and DevOps methodology.
  • Minimum Requirements (Required)

  • Bachelor's degree in Information Systems, Computer Science, Engineering or related field.
  • 3+ years in administration of cloud infrastructure and deployed applications for enterprise SaaS or PaaS companies in public clouds such as AWS, GCP, Azure (GCP preferred).
  • 5+ years of experience in administration of IT systems include compute, network, storage, access control.
  • 3+ years and a proven record of success in monitoring of cloud infrastructure and SaaS or PaaS applications.
  • 3+ years and a proven record of success of using / creating automated delivery and configuration tools (CI / CD and monitoring tools).
  • Critical Skills (Required)

    Energetic, motivated and customer focused.

    Exceptional critical and highly analytical thinking skills; ability to decompose complex problems, prioritize issues, and implement sensible solutions.

    Proficient experience managing outages, customer escalations, crisis management, and other similar circumstances.

    Able and willing to work in a fast paced, quickly changing environment.

    Strong knowledge of cloud infrastructure includes compute, networking, storage and other cloud services (GCP preferred).

    Solid foundation in Linux / Windows operating systems and tools.

    Strong knowledge of IT infrastructure such as switches, routers, firewalls, VPNs, IDS, IPS and proxies.

    Proficient with DevOps tools and environments like Jenkins, Git, Ansible, Teraform.

    Proficient with scripting languages like Python, PowerShell, Bash.

    Experience with centralized logging and metric services like StackDriver, TICK, ELK, DataDog, Splunk (TICK preferred).

    Experience with monitoring tools like StackDriver,DynaTrace, NewRelic, Graphite, Nagios, Zabbix.

    Understanding of cybersecurity methodology such as security controls, access control and auditing.

  • Good communication and presentation skills.
  • Able to mentor and lead less experienced team members.
  • Additional Knowledge and Skills (Preferred)

  • Advanced experience with DevOps methodology and Continuous Delivery.
  • Experience in migrating products from on premise to cloud.
  • Capability of effectively negotiating with peers without direct authority.
  • Experience with HIPAA compliance and the security of PHI data.
  • Familiarity with Healthcare IT standards as well as with Healthcare workflows.
  • Join our team today where we are creating a better coordinated, increasingly collaborative, and more efficient healthcare system!

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form