it job board logo
  • Home
  • Find IT Jobs
  • Register CV
  • Career Advice
  • Contact us
  • Employers
    • Register as Employer
    • Pricing Plans
  • Recruiting? Post a job
  • Sign in
  • Sign up
  • Home
  • Find IT Jobs
  • Register CV
  • Career Advice
  • Contact us
  • Employers
    • Register as Employer
    • Pricing Plans
Sorry, that job is no longer available. Here are some results that may be similar to the job you were looking for.

32 jobs found

Email me jobs like this
Refine Search
Current Search
aws sre lead engineer
Cambridge University Press & Assessment
Principal Developer Team Lead
Cambridge University Press & Assessment Cambridge, Cambridgeshire
Job Title: Principal Developer Team Lead Salary: £51,400 - £68,800 Location: Cambridge/Hybrid Contract: Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge. About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4-8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essentialexperience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 16th April 2026. As part of the application process you can expect: Two questions to select one answer from multiple options. A 15-minute screening call with the Hiring Manager. First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview. Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the gov.uk website for guidance to understand your own eligibility based on the role you are applying for. Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe - for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
02/04/2026
Full time
Job Title: Principal Developer Team Lead Salary: £51,400 - £68,800 Location: Cambridge/Hybrid Contract: Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge. About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4-8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essentialexperience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 16th April 2026. As part of the application process you can expect: Two questions to select one answer from multiple options. A 15-minute screening call with the Hiring Manager. First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview. Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the gov.uk website for guidance to understand your own eligibility based on the role you are applying for. Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe - for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
Tiro Partners Limited
Lead Platform Engineer
Tiro Partners Limited
Lead Platform Engineer £100,000 - £120,000 Hybrid (London) The Mission Cut dev-to-production lead time with seamless CI/CD and preview environments Design "Golden Path" workflows so engineers ship features, not config files Auto-scale GCP infrastructure to handle Black Friday and January travel peaks Champion SRE practices - define SLOs and error budgets for the booking flow Architect data-sync pipelines with GDPR-compliant PII masking for dev environments Operationalise Vertex AI and Redis to power personalised search and dynamic pricing The Tech Stack We'll consider strong candidates from AWS or Azure backgrounds - GCP is where you'll operate. The must-haves are: Kubernetes (GKE) Docker Terraform PHP (Symfony/Drupal) JavaScript Go Cloud SQL/MySQL Redis Nginx Cloud Load Balancing Varnish CDNs Cloudflare Bonus: GCP, BigQuery, Apigee, Vertex AI About You Proven leadership of Platform, DevOps, or SRE teams in high-volume e-commerce, travel, or fintech Deep Kubernetes and container orchestration experience with PHP/React workloads Strong cloud fundamentals (IAM, VPCs, networking) - transferable from AWS or Azure to GCP Scripting fluency in Bash, Python, or Go Experience managing high-traffic SQL databases and Redis caching strategies A track record of data anonymisation workflows and dev-environment seeding What We Offer £100,000 - £120,000 salary Hybrid working (minimum 3 days in our London office) with flexible hours Enhanced pension plan & life and critical illness insurance Medicash health plan Season ticket loan & cycle scheme Supplier social events Dog-friendly office Interested? Apply here. We'd love to hear from you.
02/04/2026
Full time
Lead Platform Engineer £100,000 - £120,000 Hybrid (London) The Mission Cut dev-to-production lead time with seamless CI/CD and preview environments Design "Golden Path" workflows so engineers ship features, not config files Auto-scale GCP infrastructure to handle Black Friday and January travel peaks Champion SRE practices - define SLOs and error budgets for the booking flow Architect data-sync pipelines with GDPR-compliant PII masking for dev environments Operationalise Vertex AI and Redis to power personalised search and dynamic pricing The Tech Stack We'll consider strong candidates from AWS or Azure backgrounds - GCP is where you'll operate. The must-haves are: Kubernetes (GKE) Docker Terraform PHP (Symfony/Drupal) JavaScript Go Cloud SQL/MySQL Redis Nginx Cloud Load Balancing Varnish CDNs Cloudflare Bonus: GCP, BigQuery, Apigee, Vertex AI About You Proven leadership of Platform, DevOps, or SRE teams in high-volume e-commerce, travel, or fintech Deep Kubernetes and container orchestration experience with PHP/React workloads Strong cloud fundamentals (IAM, VPCs, networking) - transferable from AWS or Azure to GCP Scripting fluency in Bash, Python, or Go Experience managing high-traffic SQL databases and Redis caching strategies A track record of data anonymisation workflows and dev-environment seeding What We Offer £100,000 - £120,000 salary Hybrid working (minimum 3 days in our London office) with flexible hours Enhanced pension plan & life and critical illness insurance Medicash health plan Season ticket loan & cycle scheme Supplier social events Dog-friendly office Interested? Apply here. We'd love to hear from you.
IntaPeople
Senior Site Reliability Engineer
IntaPeople Nottingham, Nottinghamshire
We are partnering with a leading organisation in the data and analytics space to recruit an experienced Senior Site Reliability Engineer . This is an opportunity to join a highly collaborative, technically strong SRE function working on large scale, cloud native platforms that support high volume, high speed data services. The team is expanding due to increased workload, and this role will become the eighth member of an established, supportive engineering group. You ll play a key part in driving cloud automation, improving system reliability, and supporting critical production environments. Key Responsibilities Build, maintain, and improve AWS cloud infrastructure Develop automation using Terraform, Ansible, and Python Support incident response and troubleshoot performance issues Deliver routine maintenance, including patching and upgrades Enhance CI/CD pipelines (GitLab CI, GitHub CI) Contribute to Agile ceremonies and take ownership of user stories Implement new technologies and solutions to improve system reliability What You Will Bring Strong commercial experience with AWS (essential) Solid understanding of Linux systems (RHEL, CentOS or similar) Scripting skills, ideally Python Hands on experience with Terraform and/or Ansible Proficiency with Docker Exposure to CI/CD tooling and Agile ways of working Background in software engineering, systems engineering, or previous SRE roles Minimum 4 years experience in a relevant technical discipline Please note, this role is not suitable for candidates with Windows only experience or Engineers without hands on AWS or Linux exposure. Remote working is supported, with an on-site presence in Nottingham, ideally once per week preferred.
01/04/2026
Contractor
We are partnering with a leading organisation in the data and analytics space to recruit an experienced Senior Site Reliability Engineer . This is an opportunity to join a highly collaborative, technically strong SRE function working on large scale, cloud native platforms that support high volume, high speed data services. The team is expanding due to increased workload, and this role will become the eighth member of an established, supportive engineering group. You ll play a key part in driving cloud automation, improving system reliability, and supporting critical production environments. Key Responsibilities Build, maintain, and improve AWS cloud infrastructure Develop automation using Terraform, Ansible, and Python Support incident response and troubleshoot performance issues Deliver routine maintenance, including patching and upgrades Enhance CI/CD pipelines (GitLab CI, GitHub CI) Contribute to Agile ceremonies and take ownership of user stories Implement new technologies and solutions to improve system reliability What You Will Bring Strong commercial experience with AWS (essential) Solid understanding of Linux systems (RHEL, CentOS or similar) Scripting skills, ideally Python Hands on experience with Terraform and/or Ansible Proficiency with Docker Exposure to CI/CD tooling and Agile ways of working Background in software engineering, systems engineering, or previous SRE roles Minimum 4 years experience in a relevant technical discipline Please note, this role is not suitable for candidates with Windows only experience or Engineers without hands on AWS or Linux exposure. Remote working is supported, with an on-site presence in Nottingham, ideally once per week preferred.
Zellis
Site Reliability Engineer (CloudOps)
Zellis Swinton, Manchester
About the role The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security. Benefits & culture Part of the Zellis Group, Moorepay is a team of over 500 friendly professionals across four offices in Swinton (Manchester), Sheffield, Birmingham and Kochi (India). We're passionate about making Moorepay a fantastic place to work for every single one of our colleagues. The average length of service at Moorepay is 12 years, which speaks for itself. To help make Moorepay such a great place to work, we focus on three things in our company culture: mental health support, maintaining a healthy work/life balance, and equal opportunities and inclusion for all. Here's what you'll gain if you join our team: A career packed with opportunity, in a stable and growing company. A comprehensive programme of learning and development. Competitive base salary. 25 days annual leave, with the opportunity to buy more. You'll even get your birthday off as well! Private medical insurance. Life assurance 4x salary. Enhanced pension with up to 8.5% employer contributions. A huge range of additional flexible benefits across financial & personal wellbeing, lifestyle & leisure.
01/04/2026
Full time
About the role The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security. Benefits & culture Part of the Zellis Group, Moorepay is a team of over 500 friendly professionals across four offices in Swinton (Manchester), Sheffield, Birmingham and Kochi (India). We're passionate about making Moorepay a fantastic place to work for every single one of our colleagues. The average length of service at Moorepay is 12 years, which speaks for itself. To help make Moorepay such a great place to work, we focus on three things in our company culture: mental health support, maintaining a healthy work/life balance, and equal opportunities and inclusion for all. Here's what you'll gain if you join our team: A career packed with opportunity, in a stable and growing company. A comprehensive programme of learning and development. Competitive base salary. 25 days annual leave, with the opportunity to buy more. You'll even get your birthday off as well! Private medical insurance. Life assurance 4x salary. Enhanced pension with up to 8.5% employer contributions. A huge range of additional flexible benefits across financial & personal wellbeing, lifestyle & leisure.
DWP
Backup and Recovery Engineer
DWP Leeds, Yorkshire
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
01/04/2026
Full time
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
DWP
Backup and Recovery Engineer
DWP
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
01/04/2026
Full time
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
DWP
Backup and Recovery Engineer
DWP Blackpool, Lancashire
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
01/04/2026
Full time
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
DWP
Backup and Recovery Engineer
DWP Newcastle Upon Tyne, Tyne And Wear
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
01/04/2026
Full time
Backup and Recovery Engineer Pay up to £52,442 plus 28.97% employer pension contributions, hybrid working, flexible hours, and great work life balance. DWP. Digital with Purpose. We are looking for an outstanding Backup and Recovery Engineer to join our community of tech experts in DWP Digital, to assist in the design of Infrastructure services in collaboration with Architecture and Engineering principles. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives. DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people. The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us. What skills, knowledge and experience will you need? Hands on experience in Implementation, Migration, Operations and Support of Backup applications. Demonstrable use of Rubrik backup solutions across multi-cloud (AWS, Azure, OCI GCP) and on-premises infrastructure and experience of Integration of Rubrik with external platforms (e.g. ServiceNow, Splunk, vSphere, Azure AD) using REST APIs and automation tools (Ansible) Proven ability of managing Cloud compute, storage, and configurations, ensuring solutions are repeatable, scalable, resilient, and highly available Experience and knowledge of service management frameworks (ITIL - Incident, Problem, Change and SLAs). Demonstrable experience of producing and rapidly delivering minimum viable solutions, results focused with ability to prioritize the most impactful work. Working experience of regulatory frameworks such as GDPR, DORA and their organizational impact. Support audit and regulatory compliance efforts related to data protection. You and your role A day as a Backup and Recovery Engineer is all about keeping the organisation's data safe and recoverable, whether it lives in the cloud or on prem. You'll spend your time making sure backup systems are running smoothly, spotting issues before they become problems and jumping in to fix things fast when they do. You'll work closely with Architects, SREs, Delivery Managers and Product Managers, so there's plenty of collaboration as you help shape and run the backup services everyone relies on. Some days you'll be involved in designing or improving how we back things up across public cloud and traditional infrastructure. Other days are more hands on, checking alerts, dealing with backup failures, looking into suspicious activity or helping colleagues understand how the systems work. When major incidents happen, you take the lead in getting everything back to a healthy state, working with security teams if there's anything unusual going on, like potential ransomware or unauthorised access. You'll move between Linux and Windows environments depending on what needs doing. Overall, it's a mix of problem solving, teamwork, technical know how and keeping the department's data safe and recoverable every single day. Details. Wages. Perks. Location: You'll join us in one of our brilliant digital hubs in Blackpool, Leeds, Manchester or Newcastle, whichever is more convenient for you. Hybrid Working: We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub. Pay: We offer competitive pay of up to £ 52,442 Pension: You'll get a brilliant civil service pension with employer contributions worth 28.97% , worth over £12,000 per year. Holidays: A generous leave package starting at 26 days rising to 31 days over time. You can also take up to 3 extra days off a month on flexitime. You'll also get all the usual public holidays. We have a broad benefits package built around your work-life balance which includes: We have a broad benefits package built around your work-life balance which includes: Flexible working including flexible hours and flex-friendly policies Time off volunteering and charitable giving Bring your authentic self to work with 'I Can Be Me in DWP' Discounts and savings on shopping, fun days out and more Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference Professional development, coaching, mentoring and career progression opportunities. And we have an award-winning environment and culture: DWP have been recognised as 2024 Diversity Employer of the Year at the Computing Women in Tech Excellence awards Diverse and Inclusive Leadership at Digital Leaders Awards 2024 Commended as Best Place to Work in Digital category in the Computing Digital Technology Leaders awards 2025 Recognised as one of the Best Public Sector Employers at 2025 Women In Tech Employer Awards Process: We know your time is valuable, so our application and selection process are just two stages: Apply: complete your application on Civil Service Jobs. There'll be full instructions when you click through. Interview: a single stage interview online. CLICK APPLY for more information and to start your application.
Wiz Admin
Infoplus Technologies UK Ltd
Wiz Admin JD: Role Purpose The Cloud Security (Wiz Admin) is responsible for administering, operating, and optimising Aviva's Wiz Cloud Security Posture Management (CSPM/CNAPP) platform. This role ensures continuous visibility, governance, and risk reduction across Aviva's multi-cloud environments (AWS, Azure, GCP). The administrator will drive operational excellence, support engineering teams, integrate Wiz into enterprise tooling, and maintain policy compliance and posture improvement Key Responsibilities Platform Administration & Operations Own day-to-day administration of the Wiz platform across all cloud environments. Maintain Wiz connectors, least-privilege roles, integration points, and scanning configurations. Ensure onboarding/offboarding of cloud accounts, subscriptions, and K8s clusters. Monitor platform health, ingestion coverage, API integrations, and license utilisation. Cloud Posture Management Review, tune, and maintain security policies, controls, and baselines (eg, CIS, NIST, ISO). Validate and enhance attack path analysis, identity risk detection, and data exposure mapping. Prioritise findings using impact-based and exploit-path-based logic. Partner with Cloud Platform teams to ensure guardrails remain aligned with Wiz detections. Shift-Left Enablement Work with DevOps/SRE teams to embed Wiz in CI/CD pipelines for IaC scanning. Run onboarding sessions for teams on using Wiz Issues, Projects, and Policy-as-Code. Validate false positives/negatives and fine-tune policy gates for Terraform, ARM/Bicep, and CloudFormation. Incident & Risk Handling Support Cloud Security, SOC, and IR teams during investigations involving publicly exposed, exploitable, or high-risk cloud assets. Provide expert analysis on Wiz findings and attack paths; propose remediation and compensating controls. Contribute to post-incident reviews, root-cause analysis, and long-term posture improvements. Integrations & Automation Maintain integrations with Jira/ADO, SIEM/SOAR, Slack/Teams, and CMDB/GRC. Automate workflows for enrichment, prioritisation, ticketing, and reporting. Partner with Engineering to build auto-remediation playbooks for safe-to-fix classes (eg, public S3, permissive IAM). Governance, Reporting & Compliance Produce monthly security posture reports for leadership and Risk/Compliance teams. Track KPIs (coverage, MTTR, SLA adherence, risk trends). Support external and internal audit requests using Wiz's evidence and compliance modules. Manage exceptions/waivers and ensure they are reviewed and retired on schedule. Core Technical Skills Strong understanding of AWS, Azure, and GCP security controls and architecture. Hands-on experience with cloud IAM, network security, logging/monitoring, and workload security. Familiarity with Kubernetes security and container image scanning. Experience operating cloud security platforms (Wiz preferred; alternatives: Prisma, Lacework, Defender for Cloud). Working knowledge of Infrastructure-as-Code (Terraform strongly preferred). Understanding of identity and entitlements management (CIEM). Ability to analyse cloud attack paths and map misconfigurations to real exploitable risk. Nice-to-Have Skills Experience integrating security tools into CI/CD pipelines (Azure DevOps, GitHub, GitLab). Knowledge of SAST/DAST/Secret scanning tools. Exposure to SRE or Cloud Platform engineering. Soft Skills Strong communication skills-able to simplify complex findings for engineering teams. Problem-solving mindset with a bias for automation and scalability. Ability to work cross-functionally with Security, Cloud Platform, DevOps, Risk, and Audit. Comfortable with influencing teams without formal authority.
01/04/2026
Contractor
Wiz Admin JD: Role Purpose The Cloud Security (Wiz Admin) is responsible for administering, operating, and optimising Aviva's Wiz Cloud Security Posture Management (CSPM/CNAPP) platform. This role ensures continuous visibility, governance, and risk reduction across Aviva's multi-cloud environments (AWS, Azure, GCP). The administrator will drive operational excellence, support engineering teams, integrate Wiz into enterprise tooling, and maintain policy compliance and posture improvement Key Responsibilities Platform Administration & Operations Own day-to-day administration of the Wiz platform across all cloud environments. Maintain Wiz connectors, least-privilege roles, integration points, and scanning configurations. Ensure onboarding/offboarding of cloud accounts, subscriptions, and K8s clusters. Monitor platform health, ingestion coverage, API integrations, and license utilisation. Cloud Posture Management Review, tune, and maintain security policies, controls, and baselines (eg, CIS, NIST, ISO). Validate and enhance attack path analysis, identity risk detection, and data exposure mapping. Prioritise findings using impact-based and exploit-path-based logic. Partner with Cloud Platform teams to ensure guardrails remain aligned with Wiz detections. Shift-Left Enablement Work with DevOps/SRE teams to embed Wiz in CI/CD pipelines for IaC scanning. Run onboarding sessions for teams on using Wiz Issues, Projects, and Policy-as-Code. Validate false positives/negatives and fine-tune policy gates for Terraform, ARM/Bicep, and CloudFormation. Incident & Risk Handling Support Cloud Security, SOC, and IR teams during investigations involving publicly exposed, exploitable, or high-risk cloud assets. Provide expert analysis on Wiz findings and attack paths; propose remediation and compensating controls. Contribute to post-incident reviews, root-cause analysis, and long-term posture improvements. Integrations & Automation Maintain integrations with Jira/ADO, SIEM/SOAR, Slack/Teams, and CMDB/GRC. Automate workflows for enrichment, prioritisation, ticketing, and reporting. Partner with Engineering to build auto-remediation playbooks for safe-to-fix classes (eg, public S3, permissive IAM). Governance, Reporting & Compliance Produce monthly security posture reports for leadership and Risk/Compliance teams. Track KPIs (coverage, MTTR, SLA adherence, risk trends). Support external and internal audit requests using Wiz's evidence and compliance modules. Manage exceptions/waivers and ensure they are reviewed and retired on schedule. Core Technical Skills Strong understanding of AWS, Azure, and GCP security controls and architecture. Hands-on experience with cloud IAM, network security, logging/monitoring, and workload security. Familiarity with Kubernetes security and container image scanning. Experience operating cloud security platforms (Wiz preferred; alternatives: Prisma, Lacework, Defender for Cloud). Working knowledge of Infrastructure-as-Code (Terraform strongly preferred). Understanding of identity and entitlements management (CIEM). Ability to analyse cloud attack paths and map misconfigurations to real exploitable risk. Nice-to-Have Skills Experience integrating security tools into CI/CD pipelines (Azure DevOps, GitHub, GitLab). Knowledge of SAST/DAST/Secret scanning tools. Exposure to SRE or Cloud Platform engineering. Soft Skills Strong communication skills-able to simplify complex findings for engineering teams. Problem-solving mindset with a bias for automation and scalability. Ability to work cross-functionally with Security, Cloud Platform, DevOps, Risk, and Audit. Comfortable with influencing teams without formal authority.
83Zero Ltd
Senior Site Reliability Engineer
83Zero Ltd Wokingham, Berkshire
Senior Site Reliability Engineer - Active SC Required! Up to £75,000 + benefits Wokingham - Hybrid (UK-based) We're seeking a Senior Site Reliability Engineer to play a key role in designing and operating highly reliable, scalable systems in a fast-paced environment. You'll act as a technical leader within the team, driving best practices across reliability engineering, automation, and system performance. What you'll be doing: Designing and improving system reliability, scalability, and observability Leading incident management and driving root cause analysis Building and maintaining robust CI/CD pipelines and automation frameworks Partnering with development teams to embed SRE principles into the SDLC Mentoring junior engineers and promoting engineering best practices What we're looking for: Strong experience in SRE, DevOps, or platform engineering roles Deep understanding of cloud infrastructure (AWS, Azure, or GCP) Hands-on experience with Kubernetes and containerised environments Strong scripting/programming skills (Python, Go, or similar) Experience with monitoring, alerting, and observability tooling Proven ability to troubleshoot complex distributed systems Why apply? Opportunity to influence technical direction and best practices Work on large-scale, mission-critical systems Leadership exposure with clear progression to principal level
01/04/2026
Full time
Senior Site Reliability Engineer - Active SC Required! Up to £75,000 + benefits Wokingham - Hybrid (UK-based) We're seeking a Senior Site Reliability Engineer to play a key role in designing and operating highly reliable, scalable systems in a fast-paced environment. You'll act as a technical leader within the team, driving best practices across reliability engineering, automation, and system performance. What you'll be doing: Designing and improving system reliability, scalability, and observability Leading incident management and driving root cause analysis Building and maintaining robust CI/CD pipelines and automation frameworks Partnering with development teams to embed SRE principles into the SDLC Mentoring junior engineers and promoting engineering best practices What we're looking for: Strong experience in SRE, DevOps, or platform engineering roles Deep understanding of cloud infrastructure (AWS, Azure, or GCP) Hands-on experience with Kubernetes and containerised environments Strong scripting/programming skills (Python, Go, or similar) Experience with monitoring, alerting, and observability tooling Proven ability to troubleshoot complex distributed systems Why apply? Opportunity to influence technical direction and best practices Work on large-scale, mission-critical systems Leadership exposure with clear progression to principal level
Morson Edge
IT Manager
Morson Edge Manchester, Lancashire
IT Manager (CDN, AWS & SRE Focus) Manchester (Hybrid - 2 days in office) Up to £80,000 + Benefits Permanent, Full-Time The Opportunity Morson Edge are are looking for an experienced IT Manager to lead and evolve a highperforming infrastructure and reliability function. This is a key leadership role where you'll shape strategy, improve system resilience, and drive best practices across CDN, AWS cloud environments, and Site Reliability Engineering (SRE) . You'll work at the intersection of infrastructure, performance, and reliability-ensuring systems are scalable, secure, and always available. What You'll Be Doing Lead, mentor, and develop a team of engineers across cloud infrastructure and SRE Own and optimise AWS environments , ensuring scalability, cost-efficiency, and security Manage and enhance CDN performance and delivery strategies Drive adoption of SRE principles including SLIs, SLOs, and error budgets Improve system observability through monitoring, logging, and alerting Collaborate with engineering and product teams to support high-availability services Oversee incident management, root cause analysis, and continuous improvement Define and implement infrastructure best practices and automation What We're Looking For Proven experience in an IT Manager/Infrastructure Manager/SRE Lead role Strong expertise in AWS (EC2, Lambda, CloudFront, VPC, etc.) Solid understanding of Content Delivery Networks (CDN) and performance optimisation Experience implementing or working within SRE frameworks Knowledge of Infrastructure as Code (eg, Terraform, CloudFormation) Strong background in monitoring tools (eg, Prometheus, Grafana, Datadog) Excellent leadership and stakeholder management skills Nice to Have Experience with containerisation (Docker, Kubernetes) Exposure to DevOps culture and CI/CD pipelines Security and compliance awareness in cloud environments What's in It for You Salary up to £80,000 Hybrid working (2 days per week in Manchester office) Pension scheme Training and development opportunities A chance to shape and lead a modern, cloud-first infrastructure function
01/04/2026
Full time
IT Manager (CDN, AWS & SRE Focus) Manchester (Hybrid - 2 days in office) Up to £80,000 + Benefits Permanent, Full-Time The Opportunity Morson Edge are are looking for an experienced IT Manager to lead and evolve a highperforming infrastructure and reliability function. This is a key leadership role where you'll shape strategy, improve system resilience, and drive best practices across CDN, AWS cloud environments, and Site Reliability Engineering (SRE) . You'll work at the intersection of infrastructure, performance, and reliability-ensuring systems are scalable, secure, and always available. What You'll Be Doing Lead, mentor, and develop a team of engineers across cloud infrastructure and SRE Own and optimise AWS environments , ensuring scalability, cost-efficiency, and security Manage and enhance CDN performance and delivery strategies Drive adoption of SRE principles including SLIs, SLOs, and error budgets Improve system observability through monitoring, logging, and alerting Collaborate with engineering and product teams to support high-availability services Oversee incident management, root cause analysis, and continuous improvement Define and implement infrastructure best practices and automation What We're Looking For Proven experience in an IT Manager/Infrastructure Manager/SRE Lead role Strong expertise in AWS (EC2, Lambda, CloudFront, VPC, etc.) Solid understanding of Content Delivery Networks (CDN) and performance optimisation Experience implementing or working within SRE frameworks Knowledge of Infrastructure as Code (eg, Terraform, CloudFormation) Strong background in monitoring tools (eg, Prometheus, Grafana, Datadog) Excellent leadership and stakeholder management skills Nice to Have Experience with containerisation (Docker, Kubernetes) Exposure to DevOps culture and CI/CD pipelines Security and compliance awareness in cloud environments What's in It for You Salary up to £80,000 Hybrid working (2 days per week in Manchester office) Pension scheme Training and development opportunities A chance to shape and lead a modern, cloud-first infrastructure function
Experis IT
Site Reliability Engineer (SRE) Engineer
Experis IT Workington, Cumbria
Site Reliability Engineer (SRE) Engineer Rate: up to £550 per day - Umbrella only Clearance Required: SC Duration: 7 months Location: Wokingham ( with 2 days/week in office SC Clearance - active Key Responsibilities Lead and drive platform-first initiatives to improve scalability, reliability, and performance. Design, build, and maintain resilient infrastructure supporting distributed systems. Implement monitoring and alerting systems to ensure high availability and performance. Collaborate with engineering teams to enhance system reliability and mitigate risks. Develop and maintain CI/CD pipelines for seamless deployment and release management. Continuously evaluate and recommend improvements to platform infrastructure and processes. Ensure compliance with security standards, governance policies, and regulatory requirements Key Skills Proven expertise in software development and engineering for large-scale distributed systems. Strong proficiency in programming languages such as Golang, Java, or Python. Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP). Deep knowledge of container orchestration platforms like Kubernetes. Exceptional problem-solving skills and a passion for building scalable, secure solutions. Excellent communication skills to collaborate with cross-functional teams. All profiles will be reviewed against the required skills and experience. Due to the high number of applications we will only be able to respond to successful applicants in the first instance. We thank you for your interest and the time taken to apply!
01/04/2026
Contractor
Site Reliability Engineer (SRE) Engineer Rate: up to £550 per day - Umbrella only Clearance Required: SC Duration: 7 months Location: Wokingham ( with 2 days/week in office SC Clearance - active Key Responsibilities Lead and drive platform-first initiatives to improve scalability, reliability, and performance. Design, build, and maintain resilient infrastructure supporting distributed systems. Implement monitoring and alerting systems to ensure high availability and performance. Collaborate with engineering teams to enhance system reliability and mitigate risks. Develop and maintain CI/CD pipelines for seamless deployment and release management. Continuously evaluate and recommend improvements to platform infrastructure and processes. Ensure compliance with security standards, governance policies, and regulatory requirements Key Skills Proven expertise in software development and engineering for large-scale distributed systems. Strong proficiency in programming languages such as Golang, Java, or Python. Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP). Deep knowledge of container orchestration platforms like Kubernetes. Exceptional problem-solving skills and a passion for building scalable, secure solutions. Excellent communication skills to collaborate with cross-functional teams. All profiles will be reviewed against the required skills and experience. Due to the high number of applications we will only be able to respond to successful applicants in the first instance. We thank you for your interest and the time taken to apply!
Moorepay
Site Reliability Engineer (CloudOps)
Moorepay Manchester, Lancashire
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
01/04/2026
Full time
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
Moorepay
Principal Software Solutions Architect
Moorepay Manchester, Lancashire
Moorepay is transforming. We are a trusted leader in UK Payroll and HR solutions, but we aren't resting on our history. We are embarking on a major digital transformation to redefine how businesses manage their most important asset: their people. As the Principal Software Solutions Architect, you'll be the technical authority responsible for defining, governing, and evolving the end-to-end architecture of our "AI First" platform, ensuring architectural consistency, secure-by-design principles, and long-term scalability across all engineering squads. Working closely with the Engineering Manager, Cloud & Platform Engineering Lead, and Product leadership, this role shapes our architectural strategy, drives technical excellence, and provides deep guidance to multiple autonomous squads as we scale towards high-performing, cloud-native teams. The Architect balances hands-on solution design, strategic planning, technical oversight, and stakeholder collaboration to keep the platform robust, secure, and ready for future growth. This role defines the architectural backbone that enables the entire engineering organisation to scale effectively. As we transition to multiple autonomous squads, you will ensure our systems remain leading edge, secure, resilient, and consistent enabling rapid product delivery while maintaining high standards of engineering excellence. You will leave an enduring impact on the platform's foundations, influencing everything from service boundaries to reliability strategies and cloud platform design. This is a full time, permanent role working on a hybrid basis with 3 days per week in Manchester. Key Responsibilities: Team Leadership & Scaling Define and maintain the technical architecture vision and roadmap across all squads. Ensure alignment of architecture with business goals, engineering strategy, and long-term scalability. Drive system-wide architectural decisions, providing clear technical direction for squads. Evaluate emerging technologies and propose solutions that improve scalability, performance, and developer productivity. Mentor senior engineers and influence technical leaders across the organisation. Secure-by-Design & Compliance Embed secure-by-design principles into architectural decisions. Ensure threat modelling is performed for new features and major changes. Champion secure coding standards and integration of security testing into the delivery pipeline. Collaborate with security and compliance stakeholders to ensure solutions meet regulatory and governance requirements. Promote design patterns that minimise risk across distributed systems. Solution Design & Governance Own the end-to-end architectural design for major platform components and new product capabilities, with a focus on AI First. Work closely with Engineering Manager and Engineering Team Leads to ensure solutions are consistent, secure, and scalable. Lead architecture reviews and ensure adherence to design standards, technical patterns, and best practices. Produce solution blueprints, reference architectures, and technical documentation. Validate that all solutions support operational excellence, reliability, and maintainability. Cloud, Infrastructure, and Platform Architecture Define scalable service-based architectures leveraging cloud-native patterns. Work with the Lead SRE to ensure architectural designs account for: Observability (metrics, logs, tracing) Reliability (SLIs, SLOs, failover) CI/CD automation Infrastructure as code and environment design Drive optimisation of compute, storage, and network resources across cloud platforms (Azure/AWS). Engineering Collaboration & Technical Enablement Partner with Engineering Manager to ensure squads have clear architectural guidance. Support teams in breaking down complex technical problems into executable, scalable solutions. Provide architectural input into backlog refinement, release planning, and prioritisation. Act as the primary facilitator for cross-team architectural decision-making. Communicate architectural decisions, trade-offs, and risks to both technical and non-technical stakeholders. Continuous Improvement & Technology Standards Define and maintain engineering standards, reusable patterns, and architectural principles. Champion continuous improvement across code quality, security, performance, and operational readiness. Foster a culture of technical excellence, experimentation, and innovation. Skills & Experience Essential: Proven experience as a Principal Architect, Solutions Architect, or Senior Engineer leading architectural decisions in complex systems. Strong understanding of AI technologies such as agents and models for both accelerated design & delivery as well as delivery of product capabilities. Strong background in cloud-native architectures (microservices, event-driven, distributed systems). Deep understanding of secure-by-design principles, threat modelling, cryptography basics, and modern security practices. Experience with API design, integration patterns, and domain-driven design (DDD) and Event Driven Design. Ability to influence without authority and collaborate effectively across engineering, SRE, product, and leadership teams. Exceptional communication skills, capable of simplifying complex technical topics for diverse stakeholders. Extensive experience with modern programming platforms and frameworks (e.g., Node.js, C# .NET, React). Strong grounding in cloud platforms (AWS/Azure), including networking, identity, observability, and cost optimisation. Desirable: Experience designing solutions in regulated or compliance-driven industries. Background in DevOps, platform engineering, or SRE practices. Experience scaling architectures to support high-growth environments. Certification in cloud or architecture frameworks (AWS SA Pro, Azure Architect Expert, TOGAF, etc.).
01/04/2026
Full time
Moorepay is transforming. We are a trusted leader in UK Payroll and HR solutions, but we aren't resting on our history. We are embarking on a major digital transformation to redefine how businesses manage their most important asset: their people. As the Principal Software Solutions Architect, you'll be the technical authority responsible for defining, governing, and evolving the end-to-end architecture of our "AI First" platform, ensuring architectural consistency, secure-by-design principles, and long-term scalability across all engineering squads. Working closely with the Engineering Manager, Cloud & Platform Engineering Lead, and Product leadership, this role shapes our architectural strategy, drives technical excellence, and provides deep guidance to multiple autonomous squads as we scale towards high-performing, cloud-native teams. The Architect balances hands-on solution design, strategic planning, technical oversight, and stakeholder collaboration to keep the platform robust, secure, and ready for future growth. This role defines the architectural backbone that enables the entire engineering organisation to scale effectively. As we transition to multiple autonomous squads, you will ensure our systems remain leading edge, secure, resilient, and consistent enabling rapid product delivery while maintaining high standards of engineering excellence. You will leave an enduring impact on the platform's foundations, influencing everything from service boundaries to reliability strategies and cloud platform design. This is a full time, permanent role working on a hybrid basis with 3 days per week in Manchester. Key Responsibilities: Team Leadership & Scaling Define and maintain the technical architecture vision and roadmap across all squads. Ensure alignment of architecture with business goals, engineering strategy, and long-term scalability. Drive system-wide architectural decisions, providing clear technical direction for squads. Evaluate emerging technologies and propose solutions that improve scalability, performance, and developer productivity. Mentor senior engineers and influence technical leaders across the organisation. Secure-by-Design & Compliance Embed secure-by-design principles into architectural decisions. Ensure threat modelling is performed for new features and major changes. Champion secure coding standards and integration of security testing into the delivery pipeline. Collaborate with security and compliance stakeholders to ensure solutions meet regulatory and governance requirements. Promote design patterns that minimise risk across distributed systems. Solution Design & Governance Own the end-to-end architectural design for major platform components and new product capabilities, with a focus on AI First. Work closely with Engineering Manager and Engineering Team Leads to ensure solutions are consistent, secure, and scalable. Lead architecture reviews and ensure adherence to design standards, technical patterns, and best practices. Produce solution blueprints, reference architectures, and technical documentation. Validate that all solutions support operational excellence, reliability, and maintainability. Cloud, Infrastructure, and Platform Architecture Define scalable service-based architectures leveraging cloud-native patterns. Work with the Lead SRE to ensure architectural designs account for: Observability (metrics, logs, tracing) Reliability (SLIs, SLOs, failover) CI/CD automation Infrastructure as code and environment design Drive optimisation of compute, storage, and network resources across cloud platforms (Azure/AWS). Engineering Collaboration & Technical Enablement Partner with Engineering Manager to ensure squads have clear architectural guidance. Support teams in breaking down complex technical problems into executable, scalable solutions. Provide architectural input into backlog refinement, release planning, and prioritisation. Act as the primary facilitator for cross-team architectural decision-making. Communicate architectural decisions, trade-offs, and risks to both technical and non-technical stakeholders. Continuous Improvement & Technology Standards Define and maintain engineering standards, reusable patterns, and architectural principles. Champion continuous improvement across code quality, security, performance, and operational readiness. Foster a culture of technical excellence, experimentation, and innovation. Skills & Experience Essential: Proven experience as a Principal Architect, Solutions Architect, or Senior Engineer leading architectural decisions in complex systems. Strong understanding of AI technologies such as agents and models for both accelerated design & delivery as well as delivery of product capabilities. Strong background in cloud-native architectures (microservices, event-driven, distributed systems). Deep understanding of secure-by-design principles, threat modelling, cryptography basics, and modern security practices. Experience with API design, integration patterns, and domain-driven design (DDD) and Event Driven Design. Ability to influence without authority and collaborate effectively across engineering, SRE, product, and leadership teams. Exceptional communication skills, capable of simplifying complex technical topics for diverse stakeholders. Extensive experience with modern programming platforms and frameworks (e.g., Node.js, C# .NET, React). Strong grounding in cloud platforms (AWS/Azure), including networking, identity, observability, and cost optimisation. Desirable: Experience designing solutions in regulated or compliance-driven industries. Background in DevOps, platform engineering, or SRE practices. Experience scaling architectures to support high-growth environments. Certification in cloud or architecture frameworks (AWS SA Pro, Azure Architect Expert, TOGAF, etc.).
CBSbutler Holdings Limited trading as CBSbutler
Senior Site Reliability Engineer (SRE)
CBSbutler Holdings Limited trading as CBSbutler
Senior Site Reliability Engineer (SRE) Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you'll shape the architecture, improve platform-wide resiliency, and ensure services stay performant, scalable, and secure. This isn't just about maintaining a single system, you'll influence reliability across multiple services, driving improvements that touch the entire ecosystem. Key Responsibilities Lead incident response and troubleshooting for production systems, resolving high-severity issues and driving post-incident improvements. Influence architecture to improve platform-wide reliability, resiliency, and operational efficiency, ensuring services remain available under heavy load. Drive containerisation best practices and manage Kubernetes-based workloads at scale. Build and maintain event-driven architectures that scale globally while ensuring fault-tolerance and high availability. Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (Terraform, CloudFormation, Ansible, CDK). Collaborate with engineering, product, and security teams to define SLOs, SLIs, and error budgets across services. Provide mentorship, advocate SRE best practices, and ensure teams are empowered to deliver resilient, reliable systems. Experience / Must-Have Skills Extensive experience in AWS and AWS-managed services (EC2, Lambda, S3, VPC, CloudWatch, CloudTrail, IAM, EKS, Service Catalog, multi-account environments). Strong Kubernetes / container orchestration experience, including EKS, OpenShift, Docker, and service mesh. Deep understanding of networking fundamentals: DNS, VPCs, routing, load balancing, TCP/IP, firewall policies. Proven track record in incident response and troubleshooting at scale. Hands-on experience with infrastructure automation and CI/CD pipelines. Experience designing event-driven architectures and resilient systems. High level of autonomy, able to influence platform-wide decisions and architect for reliability across services. Ability and desire to mentor junior staff Bonus: experience in gaming, interactive entertainment, or other high-traffic, global-scale platforms. If you are interested in this role, please feel free to submit your CV.
31/03/2026
Contractor
Senior Site Reliability Engineer (SRE) Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you'll shape the architecture, improve platform-wide resiliency, and ensure services stay performant, scalable, and secure. This isn't just about maintaining a single system, you'll influence reliability across multiple services, driving improvements that touch the entire ecosystem. Key Responsibilities Lead incident response and troubleshooting for production systems, resolving high-severity issues and driving post-incident improvements. Influence architecture to improve platform-wide reliability, resiliency, and operational efficiency, ensuring services remain available under heavy load. Drive containerisation best practices and manage Kubernetes-based workloads at scale. Build and maintain event-driven architectures that scale globally while ensuring fault-tolerance and high availability. Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (Terraform, CloudFormation, Ansible, CDK). Collaborate with engineering, product, and security teams to define SLOs, SLIs, and error budgets across services. Provide mentorship, advocate SRE best practices, and ensure teams are empowered to deliver resilient, reliable systems. Experience / Must-Have Skills Extensive experience in AWS and AWS-managed services (EC2, Lambda, S3, VPC, CloudWatch, CloudTrail, IAM, EKS, Service Catalog, multi-account environments). Strong Kubernetes / container orchestration experience, including EKS, OpenShift, Docker, and service mesh. Deep understanding of networking fundamentals: DNS, VPCs, routing, load balancing, TCP/IP, firewall policies. Proven track record in incident response and troubleshooting at scale. Hands-on experience with infrastructure automation and CI/CD pipelines. Experience designing event-driven architectures and resilient systems. High level of autonomy, able to influence platform-wide decisions and architect for reliability across services. Ability and desire to mentor junior staff Bonus: experience in gaming, interactive entertainment, or other high-traffic, global-scale platforms. If you are interested in this role, please feel free to submit your CV.
Cambridge University Press & Assessment
Principal Developer Team Lead
Cambridge University Press & Assessment
Principal Developer Team Lead Salary:   £51,400 - £68,800 Location:   Cambridge/Hybrid Contract:   Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function   - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice   - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4–8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essential   experience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible   rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 18 February  2026. If you are shortlisted and progressed through the stages, you can expect:       A 40-minute screening call with the Hiring Manager.  First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the   gov.uk   website for guidance to understand your own eligibility based on the role you are applying for.   Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
04/02/2026
Full time
Principal Developer Team Lead Salary:   £51,400 - £68,800 Location:   Cambridge/Hybrid Contract:   Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function   - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice   - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4–8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essential   experience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible   rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 18 February  2026. If you are shortlisted and progressed through the stages, you can expect:       A 40-minute screening call with the Hiring Manager.  First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the   gov.uk   website for guidance to understand your own eligibility based on the role you are applying for.   Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
Cambridge University Press & Assessment
Site Reliability Engineer Team Lead
Cambridge University Press & Assessment Cambridge/Hybrid (with 2-3 days per week in office)
Job Title:  English Technology Platform SRE Team Lead Salary:  £68,600 - £91,700 Location:  Cambridge/Hybrid (with 2-3 days per week in office) Contract:  Permanent  Hours:  Full time Are you ready to shape the future of technology platforms at the heart of Cambridge's academic excellence? Join us as our English Technology Platform SRE Team Lead and help drive innovation, reliability, and intelligent automation in a world-class environment. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role   The SRE Team Lead will lead a mature Site Reliability Engineering function within the Platform Operations Team, working closely with Platform Support and Engineering teams. This role demands strong thought leadership, technical depth, and strategic direction for the discipline, with a particular emphasis on leveraging AI-driven operations (AIOps) and FinOps practices to optimise reliability, performance, and cloud spend. Although this is a hands-on technical role, the SRE Team Lead will also manage a small team of SRE, providing clear direction and ensuring consistent, data-driven, AI-enhanced service delivery across the platforms while working collaboratively with existing support and engineering groups. Apply core SRE and DevOps principles—culture, automation, testing, measurement, and continuous improvement—to build and optimise pipelines focused on rapid, reliable software delivery. Integrate AIOps capabilities, such as automated anomaly detection and intelligent alerting, to further enhance operational excellence. Work with Solutions Architecture, Development, and QA teams to automate processes wherever possible, creating and improving stable CI/CD pipelines for both software and infrastructure. Develop tools that enable rapid provisioning of environments and resources across all teams, incorporating AI-assisted automation where beneficial. Use automation, observability, and monitoring tools to improve site reliability and proactively identify issues. Support development teams with troubleshooting, particularly in infrastructure, networking, and multi-tier application design. Serve as a subject matter expert for cloud services—especially AWS PaaS—while applying FinOps practices to ensure cloud cost transparency, optimisation, and efficient resource usage. Create and maintain robust technical documentation for the infrastructure of the English platforms, including operational runbooks enhanced with predictive and AI-supported insights. Stay engaged with developments in the SRE, DevOps, AIOps, and FinOps communities, continually introducing new practices and technologies to improve reliability, performance, automation, and cloud cost efficiency   This position has been classified as a hybrid role, requiring the selected candidate to typically spend 40-60% of their time collaborating and connecting face-to-face at their dedicated location. Aside from our hybrid principles, other flexible working requests will be considered from the first day of employment, including other work arrangements should you require adjustments due to a disability or long-term health condition.    About you A passion for Site reliability engineering and driven to understand, anticipate, and counter platform related issues before they become problems and staying up to date with the latest technological trends and developments Great communication allowing effective collaboration across technical leadership and various business stakeholders with the ability to present ideas and strategies clearly and persuasively. Demonstratable soft skills in motivating, inspiring and leading a team (direct line management is not part of the roles remit) Educated to degree level or equivalent and with a minimum of 5 years proven experience in a systems administration or dev-ops blended role. Experience implementing technologies such as Terraform, Github Actions & Containerization/Orchestration e.g. Kubernetes & Docker Expertise in Monitoring tools like New Relic, Grafana, Alert Manager and site24x7. Have extreme knowledge of cloud computing infrastructure, especially using Amazon Web Services (EKS, ECS, RDS, Route53 etc.) Excellent troubleshooting, debugging, communication and documentation skills Experience of working within an Agile product development environment. For a detailed job description, please refer to the link at the bottom of the advert on our careers site. We are a Disability Confident (DC) employer that is committed to equality and inclusion ensuring our recruitment process is accessible to all. The DC scheme's   Offer of an Interview   commitment applies to applicants who opt in, and disclose a disability or a long-term health condition, and best meet the minimum criteria for the role. In instances where interviewing all qualifying candidates is not practicable, we prioritise those who best meet the minimum criteria, as we would for applicants who do not have a disability or long-term health condition. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the  gov.uk   website for guidance to understand your own eligibility based on the role you are applying for. Rewards and benefits   We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible  rewards package , featuring family-friendly and planet-friendly benefits including:   28 days annual leave plus bank holidays Private medical and Permanent Health Insurance   Discretionary annual bonus   Group personal pension scheme Life assurance up to 4 x annual salary   Green travel schemes     Ready to pursue your potential? Apply now. We aim to support candidates by making our interview process clear and transparent. The closing date for all applications will be 4th February. We will review applications on an ongoing basis, and shortlisted candidates can expect interviews to take place shortly after it closes. If you are shortlisted and progressed through the stages, you can expect:  A 15-minute screening call with the Hiring Manager. Final stage virtual interview via MS Teams.  If you require any reasonable adjustments during the recruitment process due to a disability or a long-term health condition, there will be an opportunity for you to inform us via the online application form. We will do our best to accommodate your needs.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. We are committed to an equitable recruitment process. As such, applications must be submitted via our official online application procedure. Please refrain from sending your CV directly to our recruiters. If you experience technical difficulties or require additional support with submitting your online application, contact the Recruiter.  Why join us  Joining us is your opportunity to pursue potential. You will belong to a collaborative team that is exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it is safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background.  We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities. If you are ready to take the next step in your Cambridge journey, we welcome your application. Together, we continue to shape a culture where everyone feels empowered to succeed and motivated to make a difference— for ourselves, for each other, and for learners worldwide.
21/01/2026
Full time
Job Title:  English Technology Platform SRE Team Lead Salary:  £68,600 - £91,700 Location:  Cambridge/Hybrid (with 2-3 days per week in office) Contract:  Permanent  Hours:  Full time Are you ready to shape the future of technology platforms at the heart of Cambridge's academic excellence? Join us as our English Technology Platform SRE Team Lead and help drive innovation, reliability, and intelligent automation in a world-class environment. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role   The SRE Team Lead will lead a mature Site Reliability Engineering function within the Platform Operations Team, working closely with Platform Support and Engineering teams. This role demands strong thought leadership, technical depth, and strategic direction for the discipline, with a particular emphasis on leveraging AI-driven operations (AIOps) and FinOps practices to optimise reliability, performance, and cloud spend. Although this is a hands-on technical role, the SRE Team Lead will also manage a small team of SRE, providing clear direction and ensuring consistent, data-driven, AI-enhanced service delivery across the platforms while working collaboratively with existing support and engineering groups. Apply core SRE and DevOps principles—culture, automation, testing, measurement, and continuous improvement—to build and optimise pipelines focused on rapid, reliable software delivery. Integrate AIOps capabilities, such as automated anomaly detection and intelligent alerting, to further enhance operational excellence. Work with Solutions Architecture, Development, and QA teams to automate processes wherever possible, creating and improving stable CI/CD pipelines for both software and infrastructure. Develop tools that enable rapid provisioning of environments and resources across all teams, incorporating AI-assisted automation where beneficial. Use automation, observability, and monitoring tools to improve site reliability and proactively identify issues. Support development teams with troubleshooting, particularly in infrastructure, networking, and multi-tier application design. Serve as a subject matter expert for cloud services—especially AWS PaaS—while applying FinOps practices to ensure cloud cost transparency, optimisation, and efficient resource usage. Create and maintain robust technical documentation for the infrastructure of the English platforms, including operational runbooks enhanced with predictive and AI-supported insights. Stay engaged with developments in the SRE, DevOps, AIOps, and FinOps communities, continually introducing new practices and technologies to improve reliability, performance, automation, and cloud cost efficiency   This position has been classified as a hybrid role, requiring the selected candidate to typically spend 40-60% of their time collaborating and connecting face-to-face at their dedicated location. Aside from our hybrid principles, other flexible working requests will be considered from the first day of employment, including other work arrangements should you require adjustments due to a disability or long-term health condition.    About you A passion for Site reliability engineering and driven to understand, anticipate, and counter platform related issues before they become problems and staying up to date with the latest technological trends and developments Great communication allowing effective collaboration across technical leadership and various business stakeholders with the ability to present ideas and strategies clearly and persuasively. Demonstratable soft skills in motivating, inspiring and leading a team (direct line management is not part of the roles remit) Educated to degree level or equivalent and with a minimum of 5 years proven experience in a systems administration or dev-ops blended role. Experience implementing technologies such as Terraform, Github Actions & Containerization/Orchestration e.g. Kubernetes & Docker Expertise in Monitoring tools like New Relic, Grafana, Alert Manager and site24x7. Have extreme knowledge of cloud computing infrastructure, especially using Amazon Web Services (EKS, ECS, RDS, Route53 etc.) Excellent troubleshooting, debugging, communication and documentation skills Experience of working within an Agile product development environment. For a detailed job description, please refer to the link at the bottom of the advert on our careers site. We are a Disability Confident (DC) employer that is committed to equality and inclusion ensuring our recruitment process is accessible to all. The DC scheme's   Offer of an Interview   commitment applies to applicants who opt in, and disclose a disability or a long-term health condition, and best meet the minimum criteria for the role. In instances where interviewing all qualifying candidates is not practicable, we prioritise those who best meet the minimum criteria, as we would for applicants who do not have a disability or long-term health condition. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the  gov.uk   website for guidance to understand your own eligibility based on the role you are applying for. Rewards and benefits   We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible  rewards package , featuring family-friendly and planet-friendly benefits including:   28 days annual leave plus bank holidays Private medical and Permanent Health Insurance   Discretionary annual bonus   Group personal pension scheme Life assurance up to 4 x annual salary   Green travel schemes     Ready to pursue your potential? Apply now. We aim to support candidates by making our interview process clear and transparent. The closing date for all applications will be 4th February. We will review applications on an ongoing basis, and shortlisted candidates can expect interviews to take place shortly after it closes. If you are shortlisted and progressed through the stages, you can expect:  A 15-minute screening call with the Hiring Manager. Final stage virtual interview via MS Teams.  If you require any reasonable adjustments during the recruitment process due to a disability or a long-term health condition, there will be an opportunity for you to inform us via the online application form. We will do our best to accommodate your needs.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. We are committed to an equitable recruitment process. As such, applications must be submitted via our official online application procedure. Please refrain from sending your CV directly to our recruiters. If you experience technical difficulties or require additional support with submitting your online application, contact the Recruiter.  Why join us  Joining us is your opportunity to pursue potential. You will belong to a collaborative team that is exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it is safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background.  We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities. If you are ready to take the next step in your Cambridge journey, we welcome your application. Together, we continue to shape a culture where everyone feels empowered to succeed and motivated to make a difference— for ourselves, for each other, and for learners worldwide.
Triumph Consultants Ltd
Senior Front-End Developer (VW06/10)
Triumph Consultants Ltd Leeds, Yorkshire
Senior Front End Developer Hybrid working in Leeds or Newcastle - 3 days per week Competitive Market Rate We are seeking a Senior Front End Developer to join a team of expert Front End developers building accessible, user-focused digital services at scale. You'll work with Node.js, Express, React, and templating engines to deliver high-quality, GDS-compliant interfaces that make a real impact. What you'll do Lead the design and development of reusable Front End patterns and components. Build accessible, high-performing UIs that work across devices and browsers. Work with modern web stacks - HTML, CSS, JavaScript (client & server), Node.js, Express, TypeScript, React. Support CI/CD pipelines, Docker deployments, and end-to-end testing (TDD/BDD). Mentor and coach developers, shaping best practice and standards across teams. Collaborate across the wider engineering and design community to drive innovation. What we're looking for Proven experience building large-scale, high-traffic systems. Expertise in modern Front End frameworks and open-source technologies. Strong understanding of accessibility (WCAG) and progressive enhancement. Skilled in testing frameworks like Jest or Mocha. Experience integrating with APIs, databases (SQL/NoSQL), caching tools (Redis), and cloud platforms (AWS or Azure). Confident leading teams, setting direction, and promoting a collaborative, agile culture. How to Apply Quote the Job Title and Reference Number in your application. Submit your CV in Word format. Applications are reviewed on a rolling basis-early submission is recommended. We will also add your details to our mail out lists. Please note you may receive details of roles outside of your immediate vicinity, as many candidates are able to relocate temporarily for work. Please disregard any such emails that are not of interest and let us know if you would rather not receive such mailouts and/or if you wish us to delete your details and prefer to apply direct to our advertised roles. If you do not hear from us within three working days, unfortunately your application has not been shortlisted on this occasion. Thank you for your interest in working with us.
07/10/2025
Contractor
Senior Front End Developer Hybrid working in Leeds or Newcastle - 3 days per week Competitive Market Rate We are seeking a Senior Front End Developer to join a team of expert Front End developers building accessible, user-focused digital services at scale. You'll work with Node.js, Express, React, and templating engines to deliver high-quality, GDS-compliant interfaces that make a real impact. What you'll do Lead the design and development of reusable Front End patterns and components. Build accessible, high-performing UIs that work across devices and browsers. Work with modern web stacks - HTML, CSS, JavaScript (client & server), Node.js, Express, TypeScript, React. Support CI/CD pipelines, Docker deployments, and end-to-end testing (TDD/BDD). Mentor and coach developers, shaping best practice and standards across teams. Collaborate across the wider engineering and design community to drive innovation. What we're looking for Proven experience building large-scale, high-traffic systems. Expertise in modern Front End frameworks and open-source technologies. Strong understanding of accessibility (WCAG) and progressive enhancement. Skilled in testing frameworks like Jest or Mocha. Experience integrating with APIs, databases (SQL/NoSQL), caching tools (Redis), and cloud platforms (AWS or Azure). Confident leading teams, setting direction, and promoting a collaborative, agile culture. How to Apply Quote the Job Title and Reference Number in your application. Submit your CV in Word format. Applications are reviewed on a rolling basis-early submission is recommended. We will also add your details to our mail out lists. Please note you may receive details of roles outside of your immediate vicinity, as many candidates are able to relocate temporarily for work. Please disregard any such emails that are not of interest and let us know if you would rather not receive such mailouts and/or if you wish us to delete your details and prefer to apply direct to our advertised roles. If you do not hear from us within three working days, unfortunately your application has not been shortlisted on this occasion. Thank you for your interest in working with us.
The Bridge IT Recruitment
Head of IT Security and Platform Engineering (Hybrid) Newcastle - To £115k+ Bens
The Bridge IT Recruitment Newcastle Upon Tyne, Tyne And Wear
My client, a Global organisation based in Newcastle city centre are seeking an experienced Head of Security and Platform Engineering to start ASAP. This pivotal role takes the lead in delivering breakthrough improvements in reliability and performance across technology platforms, ensuring our systems consistently exceed expectations. As the leading force behind our cyber security agenda, you will champion a step change in modern security controls introducing cutting-edge measures that protect the business. You will lead four core technology towers, and inspire teams to set bold targets, measure progress, and celebrate success as we raise the bar for platform resilience, scalability, and security. Key Responsibilities: Strategic Leadership & Governance Define and drive the vision, strategy, and roadmaps for Platform towers, aligned with business objectives and risk appetite. Oversee integration and collaboration across the four core platform towers: Digital Workspace Services (DWS) Support and System Reliability Engineering (SSRE) Platform and Cloud Engineering (PaCE) Security & Network Operations (SNOPs) Establish and socialise the Cyber Security Strategy and Roadmap, ensuring alignment with enterprise resilience and regulatory requirements Cyber Security Leadership Shape the cyber security vision and build a corresponding technical roadmap which delivers a world class security controls across cloud infrastructure, networks, end points, identity & access management, application security, and threat detection. Collaborate closely with the SNOPs Lead to adapt the SNOPs roadmap priorities in line with shifts in industry, evolving threat landscape and regulatory requirements. Ensure effective 24/7 security operations (inc. security incident management) Collaborate closely with the Enterprise Resilience function (1st Line of Defence) to ensure integrated risk management and incident response. Promote stakeholder engagement and cross-functional collaboration to embed a culture of security awareness and ownership across the organisation. Operational Oversight Ensure high availability, performance, and security of all technology systems and infrastructure. Monitor and improve service levels, incident resolution times, and system reliability metrics. Lead cross-functional coordination for escalations, major incidents, and service continuity planning. Team Leadership & Development Provide leadership and direction to platform tower leads Foster a culture of continuous improvement, collaboration, and innovation across all teams. Support recruitment, onboarding, and capability development to meet evolving technology needs. Technology Platform Delivery Oversee the delivery and lifecycle management of: Microsoft 365 and collaboration platforms Cloud platforms (design, automation, cost optimisation) Network and security operations (compliance, threat management) Monitoring, observability, and backup/recovery systems Ensure alignment with architectural standards and regulatory requirements (e.g., DORA, Cyber Essentials Plus). Stakeholder Engagement Act as the escalation point for unresolved issues across platform towers. Collaborate with product teams, business units, and external vendors to ensure service excellence and alignment with user needs. Represent Technology in all relevant Information Security, Risk and project Committees, ensuring visibility and accountability for and robust management of cyber security risks. Represent Security and Platforms and in governance forums such as the Architectural Review Board (ARB). Essential Skills Proven leadership in managing cyber security and cross-functional technology teams in a complex, global environment. Deep understanding of IT infrastructure, cloud platforms (e.g., Azure), and enterprise collaboration tools (e.g., Microsoft 365). Strong grasp of ITIL-based service management, including incident, change, and problem management. Expertise in security and compliance frameworks, including DORA and Cyber Essentials Plus. Prior hands-on experience in delivering security solutions within enterprise environments Knowledge of disaster recovery, business continuity, and vulnerability management. Excellent communication, stakeholder management, and vendor negotiation skills. Qualifications Bachelor s degree in Computer Science, Information Systems, or a related field (Master s preferred). ITIL Foundation certification (Intermediate or Expert level desirable). Relevant cloud certifications (e.g., Microsoft Certified: Azure Solutions Architect, AWS Certified Solutions Architect). Experience 10+ years in IT leadership roles, with at least 5 years managing platform or infrastructure services. Demonstrated success in leading digital transformation or cloud migration initiatives. Experience working in regulated environments with a strong focus on security and compliance The role is Hybrid working 3 office days a week in a central Newcastle location great for transport links by train, car or bus. Apply now for immediate consideration.
07/10/2025
Full time
My client, a Global organisation based in Newcastle city centre are seeking an experienced Head of Security and Platform Engineering to start ASAP. This pivotal role takes the lead in delivering breakthrough improvements in reliability and performance across technology platforms, ensuring our systems consistently exceed expectations. As the leading force behind our cyber security agenda, you will champion a step change in modern security controls introducing cutting-edge measures that protect the business. You will lead four core technology towers, and inspire teams to set bold targets, measure progress, and celebrate success as we raise the bar for platform resilience, scalability, and security. Key Responsibilities: Strategic Leadership & Governance Define and drive the vision, strategy, and roadmaps for Platform towers, aligned with business objectives and risk appetite. Oversee integration and collaboration across the four core platform towers: Digital Workspace Services (DWS) Support and System Reliability Engineering (SSRE) Platform and Cloud Engineering (PaCE) Security & Network Operations (SNOPs) Establish and socialise the Cyber Security Strategy and Roadmap, ensuring alignment with enterprise resilience and regulatory requirements Cyber Security Leadership Shape the cyber security vision and build a corresponding technical roadmap which delivers a world class security controls across cloud infrastructure, networks, end points, identity & access management, application security, and threat detection. Collaborate closely with the SNOPs Lead to adapt the SNOPs roadmap priorities in line with shifts in industry, evolving threat landscape and regulatory requirements. Ensure effective 24/7 security operations (inc. security incident management) Collaborate closely with the Enterprise Resilience function (1st Line of Defence) to ensure integrated risk management and incident response. Promote stakeholder engagement and cross-functional collaboration to embed a culture of security awareness and ownership across the organisation. Operational Oversight Ensure high availability, performance, and security of all technology systems and infrastructure. Monitor and improve service levels, incident resolution times, and system reliability metrics. Lead cross-functional coordination for escalations, major incidents, and service continuity planning. Team Leadership & Development Provide leadership and direction to platform tower leads Foster a culture of continuous improvement, collaboration, and innovation across all teams. Support recruitment, onboarding, and capability development to meet evolving technology needs. Technology Platform Delivery Oversee the delivery and lifecycle management of: Microsoft 365 and collaboration platforms Cloud platforms (design, automation, cost optimisation) Network and security operations (compliance, threat management) Monitoring, observability, and backup/recovery systems Ensure alignment with architectural standards and regulatory requirements (e.g., DORA, Cyber Essentials Plus). Stakeholder Engagement Act as the escalation point for unresolved issues across platform towers. Collaborate with product teams, business units, and external vendors to ensure service excellence and alignment with user needs. Represent Technology in all relevant Information Security, Risk and project Committees, ensuring visibility and accountability for and robust management of cyber security risks. Represent Security and Platforms and in governance forums such as the Architectural Review Board (ARB). Essential Skills Proven leadership in managing cyber security and cross-functional technology teams in a complex, global environment. Deep understanding of IT infrastructure, cloud platforms (e.g., Azure), and enterprise collaboration tools (e.g., Microsoft 365). Strong grasp of ITIL-based service management, including incident, change, and problem management. Expertise in security and compliance frameworks, including DORA and Cyber Essentials Plus. Prior hands-on experience in delivering security solutions within enterprise environments Knowledge of disaster recovery, business continuity, and vulnerability management. Excellent communication, stakeholder management, and vendor negotiation skills. Qualifications Bachelor s degree in Computer Science, Information Systems, or a related field (Master s preferred). ITIL Foundation certification (Intermediate or Expert level desirable). Relevant cloud certifications (e.g., Microsoft Certified: Azure Solutions Architect, AWS Certified Solutions Architect). Experience 10+ years in IT leadership roles, with at least 5 years managing platform or infrastructure services. Demonstrated success in leading digital transformation or cloud migration initiatives. Experience working in regulated environments with a strong focus on security and compliance The role is Hybrid working 3 office days a week in a central Newcastle location great for transport links by train, car or bus. Apply now for immediate consideration.
Scope AT Limited
AVP Infrastructure Cloud Support - AWS, Terraform, Python, DevOps, SRE - Permanent
Scope AT Limited
AVP Infrastructure Cloud Support - AWS, Terraform, Python, DevOps, SRE - Permanent Job purpose This role is supporting the AWS Public cloud infrastructure and implementation of Infrastructure as Code using Terraform. The role will work closely with the SRE and Engineering teams to ensure that the Cloud environment has sufficient observability and is appropriately managed. What you will be doing: Responsible for ensuring the Production service is prioritized, with all service incidents, problems and requests for cloud hosted services responded to and actioned. Responsible for maintaining the reliability and security of the Cloud Hosted environments. Improve Observability and Telemetry in the Cloud Hosted environments utilizing SRE methodology to give SLA, SLO and SLIs. Ensure risks within the Cloud hosted environment are documented and regularly reviewed. Identified operational risk issues are captured with appropriate actions tracked to agreed timelines. Define and implement standards and procedures to adhere to current best practice and drive continual service improvement. Responsible for ensuring Security standards are implemented and maintained in the Cloud hosted environment. Including delivery of upgrades and security updates to minimise risk and ensure stability for all cloud hosted services. Responsible for maintaining service resilience for all cloud hosted services, including backup and disaster recovery processes. Where necessary plan and conduct quarterly DR tests for all cloud hosted services ensuring any findings are captured and addressed promptly. What we're looking for: Must have strong technical operational skills in supporting AWS Cloud Hosted environments and at least 3 years in an Infrastructure support role. Strong understanding of Infrastructure as Code technologies, ideally including Terraform and Ansible. Operational risk and control management processes, including an understanding of Security best practice and how to apply this safely within a Production environment. Asset management and life cycle (EOS/EOL) process management. Planning and leading disaster recovery fail-overs of IT systems and services. Preferably experience of working in a regulated financial services/banking organization. Able to understand and use AWS including an understanding of AWS services, security and networking. Knowledge of at least 1 programming language, preferably Python. Knowledge of CI/CD specifically relating to Cloud Hosted environments. Including an understanding of some of the Infrastructure as Code tools GIT, Terraform, Ansible, Jenkins. Permanent Role - Hybrid working (Central London based) - Candidate must be eligible to work in the UK By applying to this job you are sending us your CV, which may contain personal information. Please refer to our Privacy Notice to understand how we process this information. In short, in order to supply you with work finding services, we will hold and process your personal data, and only with your express permission we will share this personal data with a client (or a third party working on behalf of the client) by email or by upload to the Client/third parties vendor management system. By giving us permission to send your CV to a client, this constitutes permission to share the personal data that would be necessary to consider your application, interview you (Phone/video/face to face) and if successful hire you. Scope AT acts as an employment agency for Permanent Recruitment and an employment business for the supply of temporary workers. By applying for this job you accept the Terms and Conditions, Data Protection Policy, Privacy Notice and Disclaimers which can be found at our website.
06/10/2025
Full time
AVP Infrastructure Cloud Support - AWS, Terraform, Python, DevOps, SRE - Permanent Job purpose This role is supporting the AWS Public cloud infrastructure and implementation of Infrastructure as Code using Terraform. The role will work closely with the SRE and Engineering teams to ensure that the Cloud environment has sufficient observability and is appropriately managed. What you will be doing: Responsible for ensuring the Production service is prioritized, with all service incidents, problems and requests for cloud hosted services responded to and actioned. Responsible for maintaining the reliability and security of the Cloud Hosted environments. Improve Observability and Telemetry in the Cloud Hosted environments utilizing SRE methodology to give SLA, SLO and SLIs. Ensure risks within the Cloud hosted environment are documented and regularly reviewed. Identified operational risk issues are captured with appropriate actions tracked to agreed timelines. Define and implement standards and procedures to adhere to current best practice and drive continual service improvement. Responsible for ensuring Security standards are implemented and maintained in the Cloud hosted environment. Including delivery of upgrades and security updates to minimise risk and ensure stability for all cloud hosted services. Responsible for maintaining service resilience for all cloud hosted services, including backup and disaster recovery processes. Where necessary plan and conduct quarterly DR tests for all cloud hosted services ensuring any findings are captured and addressed promptly. What we're looking for: Must have strong technical operational skills in supporting AWS Cloud Hosted environments and at least 3 years in an Infrastructure support role. Strong understanding of Infrastructure as Code technologies, ideally including Terraform and Ansible. Operational risk and control management processes, including an understanding of Security best practice and how to apply this safely within a Production environment. Asset management and life cycle (EOS/EOL) process management. Planning and leading disaster recovery fail-overs of IT systems and services. Preferably experience of working in a regulated financial services/banking organization. Able to understand and use AWS including an understanding of AWS services, security and networking. Knowledge of at least 1 programming language, preferably Python. Knowledge of CI/CD specifically relating to Cloud Hosted environments. Including an understanding of some of the Infrastructure as Code tools GIT, Terraform, Ansible, Jenkins. Permanent Role - Hybrid working (Central London based) - Candidate must be eligible to work in the UK By applying to this job you are sending us your CV, which may contain personal information. Please refer to our Privacy Notice to understand how we process this information. In short, in order to supply you with work finding services, we will hold and process your personal data, and only with your express permission we will share this personal data with a client (or a third party working on behalf of the client) by email or by upload to the Client/third parties vendor management system. By giving us permission to send your CV to a client, this constitutes permission to share the personal data that would be necessary to consider your application, interview you (Phone/video/face to face) and if successful hire you. Scope AT acts as an employment agency for Permanent Recruitment and an employment business for the supply of temporary workers. By applying for this job you accept the Terms and Conditions, Data Protection Policy, Privacy Notice and Disclaimers which can be found at our website.

Modal Window

  • Home
  • Contact
  • About Us
  • FAQs
  • Terms & Conditions
  • Privacy
  • Employer
  • Post a Job
  • Search Resumes
  • Sign in
  • Job Seeker
  • Find Jobs
  • Create Resume
  • Sign in
  • IT blog
  • Facebook
  • Twitter
  • LinkedIn
  • Youtube
© 2008-2026 IT Job Board