Position Overview Replicon's Site Reliability Engineering (SRE) team has a focus on automating the smooth operation of Replicon's Cloud Platform.
We strive to accomplish projects such as fully automating a continuous delivery process, building highly resilient infrastructure components, and introducing & practicing Chaos Engineering.
Database Reliability Engineers are dedicated every day to improving and maintaining the performance, cost-effectiveness, security, resiliency, and durability of the databases and data storage systems that support the Replicon Cloud Platform.
They proactively identify issues affecting data systems, squash issues before they impact customers, and address systemic causes at the core of those issues.
They are expected to constantly be learning and applying industry best-practices in data system management. Responsibilities include :
Design and implement service-level objectives for data storage systems, and collaborate with engineering teams to meet objectives
Identify and monitor poorly performing database operations, analyze them, and perform database, schema, and application optimizations
Build, manage, and maintain the software tools necessary to operate Replicon's data systems at scale, with a heavy focus on automation
Strive to eliminate manual work performed that is repetitive, non-creative, and non-challenging
Become deeply involved in any new implementations of data storage systems, providing Replicon minimum standards and technical requirements, and enforcing those standards consistently across the board
When the going gets tough, provide valuable insights in a blameless post-mortem environment to identify the root causes of data system reliability incidents, and resolve them thoroughly and completely
Expert-level experience with relational database management systems; including data architecture, application optimization, database server optimization and tuning
Strong preference for specific experience with PostgreSQL
Familiar with non-RDBMS data storage systems, such as redis, AWS ElastiCache, AWS S3, AWS DynamoDB
Working understanding of cloud based infrastructure offerings in data management, specifically AWS managed solutions (eg. AWS Aurora)
Experience in performing hands-on software development, in any programming language or platform
Familiarity with with DevOps-style automation tools (eg. Docker, Terraform)
Knowledge of design patterns and trade-offs when implementing large-scale SaaS multi-tenant applications on RDBMS and other data systems
Be excited and proactive about the opportunities to learn and work with new technologies and practices; bring a lifelong learning attitude into the position