Role: Site Reliability Engineer (Azure/.Net)
Location: Crawley, UK
Type: Permanent
Nature: 1 day per week
Main Role-
This is an opportunity for a Site Reliability Engineer to join a growing team where you will support the organisation in maintaining the reliability of our applications. You will influence how the team operates and deliver value as we migrate our applications into Azure and work to improve the overall performance of the IT Function for its customers and employees alike.
Key Responsibilities
Work with key individuals across IT Service, Architecture, Delivery and Change functions, to implement the processes, standards, tooling, automation, and strategic vision of the Site Reliability team.
Manage application-related incidents and work with the wider IT function to investigate, resolve and perform post-mortems for on-premise and cloud-hosted applications.
Monitor and report on appropriate application metrics to proactively resolve issues using modern toolsets. Create automation and pipelines where necessary to reduce toil, and improve availability, reliability, security, and velocity.
Maintain effective feedback loops so that findings can be prioritised and acted upon in a timely fashion.
Follow SRE and DevOps core principles to drive adoption and utilisation.
Where applicable, work with IT Services and Delivery functions to implement technical releases and maintenance plans for production applications, minimising impact on reliability through automation.
Provide technical guidance and support to less experienced members in the IT Function as appropriate and when required.
Act as an on-call point of technical escalation for business-critical applications.
Adhere to Company Risk Management policy and procedures, including reporting of incidents or breaches.
Apply and promote Treating Customers Fairly (TCF) culture and principles, together with the organisation's behaviours and values.
Adhere to Company Diversity & Inclusion policy.
Behaviours Required
Understanding People:
Asking questions
Actively listening
Learning from feedback
Create Simplicity:
Challenging to improve
Working together
Creating straightforward solutions
Keeping our Promises:
Taking ownership to deliver
Setting clear expectations
Respecting others
Skill/ Experience
Strong background in one or more of the following areas:
SRE/application support/IT operations/infrastructure/software development/DevOps.
Experience working within both Agile and ITIL frameworks.
Experience working with DevOps principals and concepts such as CI/CD and IaC.
Experience of SRE environments and processes specifically in the areas of availability, incident management and monitoring.
Excellent analytical and problem-solving skills.
Effective communication skills, both written and verbal.
Ability to work well in high-pressure situations.
Experience using Azure Monitor, App insights and KQL or similar tooling.
Experience writing runbooks, and implementation plans and adopting incident management best practices.
Strong understanding of environment architecture and release management as part of the software development lifecycle.
Experience using SolarWinds or similar tooling.
Knowledge of Scripting languages and Desired state configuration such as Bicep or Terraform, and PowerShell.
Experience using Azure DevOps or similar tooling and has had experience using and writing pipelines.
Knowledge of using and configuring issue management and CMS systems.
Experience in Pensions, Insurance, Banking and related products and services
GCS is acting as an Employment Agency in relation to this vacancy.