- Design, develop, and manage Infrastructure as Code (IaC) using Terraform.
- Build and maintain CI/CD pipelines for seamless and secure deployments.
- Enhance system observability using monitoring APM, logs, and metrics with event correlation.
- Ensure system reliability by proactively identifying and resolving performance and availability issues.
- Manage and optimise containerised environments with Kubernetes, ensuring scalability and high availability.
- Collaborate with development teams to implement SRE best practices
- Implement strategies for Continuous Deployment to minimise release risks.
- Previous experience within the iGaming and Gambling sector.
- Strong experience with AWS or similar cloud platforms.
- Strong expertise in Terraform for Infrastructure as Code (IaC) management.
- Hands-on experience with Kubernetes and Helm for container orchestration.
- Proficiency in observability tools such as Elastic Cloud, Grafana, and Prometheus.
- Experience in building and managing CI/CD pipelines
- Solid knowledge of Linux systems and shell scripting.
- Proficiency in programming languages such as Python, Go, or Java.
- Experience working with SQL and NoSQL databases.
- Background in deploying and maintaining highly available, scalable production environments.
- Experience with advanced deployment strategies such as canary releases or feature flags.
- Knowledge of distributed tracing and event correlation techniques.
- Exposure to DevOps practices applied to reliability engineering.
- Certifications in Cloud Computing, Kubernetes, or DevOps-related fields.