Site Reliability Engineer
Job role
Bristow Holland are proud to be working with a fast-growing tech organisation on an outstanding opportunity for a fully remote Site Reliability Engineer.
In this role, you’ll play a key part in building and maintaining reliable, scalable, and secure cloud infrastructure to support a mission-critical platform running entirely in Azure.
You’ll be embedded in the heart of the engineering function, working closely with developers, operations, and product teams to drive performance, automation, and resilience across all systems.
Responsibilities:
- Leading and driving SRE best practices across a modern Azure-based infrastructure
- Ensuring availability, reliability, performance, and security of key systems and services
- Designing and implementing scalable, resilient infrastructure
- Collaborating closely with software and product teams to enhance operational excellence
- Implementing monitoring, observability, and alerting
- Automating infrastructure and deployments using modern tools and IaC principles
- Contributing to incident response, root cause analysis, and continuous improvement
Key Skills:
- Proven experience in a Site Reliability Engineer or similar role
- Deep hands-on expertise with Azure cloud infrastructure
- Experience with tools like Terraform, Ansible, or scripting in PowerShell/Python/Bash
- Strong knowledge of Kubernetes, Azure App Services, Azure Storage, and networking/security principles
- Familiarity with observability tools like Azure Monitor, Prometheus, Grafana
- Understanding of SRE principles, automation, and CI/CD pipelines
- A collaborative, proactive attitude with great communication skills
Desirable Skills:
- Azure certifications (e.g. Azure Solutions Architect, Azure DevOps Engineer)
- Office 365 / Azure AD administration
- Experience with Docker and Kubernetes
- Knowledge of NoSQL databases
- A degree in Computer Science or similar field
Apply for Site Reliability Engineer now
"*" indicates required fields