Site Reliability Engineer | GCP and Kubernetes | SaaS HealthTech
6-12 Months - Fully Remote
This is one of those *awesome* opportunities! Our client is a global SaaS HealthTech with a footprint in every major continent (15,000 employees). They have just landed a huge new contract with a large hospital in the UK. You will be working as part of an enterprise business but have the fun culture and feel of building a 'mini start-up' inside a mature business.
This is an SRE contract role where you will be working on an exciting greenfield project, initially doing a lift and shift of on-prem architecture into GCP and then after that doing a GCP SRE platform architecture design and build to host the new SaaS Architecture using Kubernetes cluster builds, CI/CD installs and using languages such as Python to 'glue' everything together.
Absolute essentials / non-negotiables for the role:
- GCP knowledge, and experience in leading large-scale projects
- Working with Kubernetes and CI/CD tooling
- An understanding of true SRE / Platform engineering as opposed to 'DevOps'. You will be asked to define "what is an SRE" in the interview process.
- SRE fundamentals: SLIs, SLAs and SLOs
Tech Stack:
- Google Cloud Platform (GCP)
- Scripting - Python, PowerShell, Bash.
- CI/CD & automation tools such as - Jenkins, Git, GitLab, Ansible, Terraform
- Development languages such as Go, C#, Java, JavaScript or similar.
- Logging services such as StackDriver, ELK, DataDog, and Splunk.
- Monitoring tools like StackDriver, NewRelic, Graphite, Nagios, and Zabbix.
Duties:
- Success in the monitoring of cloud infrastructure and SaaS or PaaS applications
- Strong experience in administration of IT systems including compute, network, storage, and access control.
- A recognised record of success in the administration of cloud infrastructure and deployed applications for enterprise SaaS or PaaS companies in GCP
- SaaS experience or SRE cloud operations/developer
- Soft skills in terms of building conversation with different teams, developers, corp IT, Infosec
- Be the Principal/Lead SME in the Cloud Ops space.
- Define, and implement methodology and toolset for fully automated infrastructure management as a code.
- Serve as the company's subject matter expert to support other Change Healthcare teams for purposes of cloud technologies, operations, and DevOps methodology.
If you are a Site Reliability Engineer with a solid grasp of SRE fundamentals teamed with a background of Google Cloud Platform (GCP) knowledge and fancy an exciting and fast-paced project then I'd love to share more of the details. Apply today to avoid disappointment, this contract role will be snapped up fast I'm sure!
Site Reliability Engineer | GCP and Kubernetes | SaaS HealthTech