SRE Lead

We're looking for an SRE Lead to work on site with our development team in our office in Hong Kong.

We're a small, but fast growing company, with new and exciting problems to solve. We work in project-based sprints in small, interdisciplinary teams.

As an SRE Lead you’d work on building out automated solutions for tough operational problems using industry best practice and cloud native technologies. You’d actively seek out innovative solutions towards operational excellence and coordinate proactively with development, operations and the wider platform team to improve system availability, security, performance and maintainability.

  • Collaborate closely with our development teams in our fast-paced delivery environment
  • Set reliability objectives across the layers of our application from business logic to infrastructure
  • Codify and rollout shared tooling and process to enable development teams to stay agile while improving non-functional requirements such as system availability, security, performance and maintainability
  • Build, run and mentor a team of SRE Engineers
  • Help define operational process including on-call rotation through development teams
  • Assess a wide range of incoming requirements with the wider Engineering team with a focus on how we operate that well in production
  • Mindset to automate everything or at least make it self-service
  • Excellent knowledge of modern automation principles, for example IaC and GitOps
  • Distributed system knowledge applicable to a microservice architecture
  • Breadth of knowledge – operating systems, networking, distributed computing, cloud computing
  • Self-starter, capable of working without direction and able to deliver projects from scratch
  • Public cloud management experience – e.g. AWS with Hashicorp Terraform
  • Container Management and container orchestration experience – Docker, Kubernetes
  • Experience managing and/or developing applications using JVM based languages such as Java/Scala/Kotlin
  • Good practical knowledge with SQL/RDBMS, PostgreSQL preferred
  • Monitoring tools Elastic Stack, Prometheus, Grafana
  • Strong knowledge of Linux/UNIX
  • Strong scripting skills – Python and Bash
  • Strong understanding and practice Agile/Lean projects SCRUM, KANBAN etc.
  • Practical knowledge with Git flow, Trunk and GitHub flow branching strategies
  • Strong English communication skills