The Senior DevOps Engineer works with the team to lift the visibility, reliability, and security of the platforms that serve our clients and end-users globally. We are a focused team so your judgment materially shapes how the platform runs.
The role owns:
• Reliability and uptime of Esyasoft's production estate on AWS and Kubernetes
• CI/CD pipeline ownership — finalizing the move to full automation via GitOps
• Observability, security and compliance posture across the platform
• On-call coverage and incident response as part of a paid weekly rotation
Key Responsibilities & Areas of Ownership
A. Cloud Infrastructure and Platform Operations
• Own day-to-day reliability of Esyasoft's production estate on AWS and Kubernetes, including incident triage and remediation
• Operate and harden the Linux server fleet (Rocky Linux, Amazon Linux) and supporting database layers
• Manage cloud cost and capacity in partnership with engineering — right-size, decommission and forecast
B. CI/CD and Infrastructure as Code
• Finalise the transition to a fully automated CI/CD pipeline (Jenkins to GitOps via FluxCD)
• Maintain and extend Infrastructure as Code using Terraform and Kustomize
• Curate the developer experience so engineers can ship safely, quickly, and with minimal friction
C. Observability, Monitoring and Incident Response
• Evolve the monitoring stack (Prometheus, Grafana) — dashboards, alerts and SLOs that earn their keep
• Lead incident response during your rotation; drive blameless post-mortems and follow-through actions
• Participate in the weekly paid on-call rotation to maintain service levels for Esyasoft's users
D. Security and Compliance
• Maintain Esyasoft's posture against Cyber Essentials, ISO 27001 and PCI-DSS
• Embed secure-by-default patterns in CI/CD, IaC, container images and Kubernetes manifests; partner on access reviews and secrets management
E. Engineering Practice and Team Contribution
• Mentor peers and product engineers on infrastructure, deployment and reliability practice
• Contribute to architectural decisions across the wider engineering team
F. Tooling and Continuous Improvement
• Evaluate emerging tools and patterns responsibly — prove value, then adopt; keep what we have working hard before reaching for the new
• Curate documentation and runbooks so platform knowledge is shared across the team, not siloed
Requirements
Required Skills & Experience
• Proven track record running production workloads on AWS, with hands-on experience operating Kubernetes in production (not just deploying to it)
• Strong foundations in core systems — Linux, HTTP, networking, Bash and a scripting language (Python preferred)
• Practical experience with Infrastructure as Code (Terraform preferred), GitOps tooling and CI/CD pipelines end-to-end
• Comfort participating in an on-call rotation and leading incident response, including blameless post-mortems
Desirable Skills & Experience
• Exposure to compliance frameworks — Cyber Essentials, ISO 27001 or PCI-DSS — in a production context
• Experience with additional cloud providers (Digital Ocean, GCP) and database operations at scale (MySQL)
• PHP exposure (Esyasoft's stack includes PHP services)
Key Attributes
• Pragmatic — picks the smallest change that solves the real problem; values clarity over cleverness
• Reliability-minded — instinctively asks 'what does this look like when it breaks?' before shipping
• Clear communicator — writes runbooks, explains incidents and pushes back without drama
Use of Technologies
Esyasoft operates primarily within the cloud-native ecosystem and includes the requirement to work as a competent user of:
• AWS, Kubernetes, Docker
• Terraform, Kustomize, FluxCD, Jenkins, Prometheus, Grafana, MySQL
• Python, Bash, PHP — running on Rocky Linux / Amazon Linux
This role may also require flexibility to work with additional cloud providers (Digital Ocean, GCP) and third-party tools as the platform evolves.
By continuing you agree to our Terms & Privacy Policy.