it job board logo
  • Home
  • Find IT Jobs
  • Register CV
  • Career Advice
  • Contact us
  • Employers
    • Register as Employer
    • Pricing Plans
  • Recruiting? Post a job
  • Sign in
  • Sign up
  • Home
  • Find IT Jobs
  • Register CV
  • Career Advice
  • Contact us
  • Employers
    • Register as Employer
    • Pricing Plans
Sorry, that job is no longer available. Here are some results that may be similar to the job you were looking for.

10 jobs found

Email me jobs like this
Refine Search
Current Search
site reliability engineering sre manager
Morson Edge
IT Manager
Morson Edge Manchester, Lancashire
IT Manager (CDN, AWS & SRE Focus) Manchester (Hybrid - 2 days in office) Up to £80,000 + Benefits Permanent, Full-Time The Opportunity Morson Edge are are looking for an experienced IT Manager to lead and evolve a highperforming infrastructure and reliability function. This is a key leadership role where you'll shape strategy, improve system resilience, and drive best practices across CDN, AWS cloud environments, and Site Reliability Engineering (SRE) . You'll work at the intersection of infrastructure, performance, and reliability-ensuring systems are scalable, secure, and always available. What You'll Be Doing Lead, mentor, and develop a team of engineers across cloud infrastructure and SRE Own and optimise AWS environments , ensuring scalability, cost-efficiency, and security Manage and enhance CDN performance and delivery strategies Drive adoption of SRE principles including SLIs, SLOs, and error budgets Improve system observability through monitoring, logging, and alerting Collaborate with engineering and product teams to support high-availability services Oversee incident management, root cause analysis, and continuous improvement Define and implement infrastructure best practices and automation What We're Looking For Proven experience in an IT Manager/Infrastructure Manager/SRE Lead role Strong expertise in AWS (EC2, Lambda, CloudFront, VPC, etc.) Solid understanding of Content Delivery Networks (CDN) and performance optimisation Experience implementing or working within SRE frameworks Knowledge of Infrastructure as Code (eg, Terraform, CloudFormation) Strong background in monitoring tools (eg, Prometheus, Grafana, Datadog) Excellent leadership and stakeholder management skills Nice to Have Experience with containerisation (Docker, Kubernetes) Exposure to DevOps culture and CI/CD pipelines Security and compliance awareness in cloud environments What's in It for You Salary up to £80,000 Hybrid working (2 days per week in Manchester office) Pension scheme Training and development opportunities A chance to shape and lead a modern, cloud-first infrastructure function
01/04/2026
Full time
IT Manager (CDN, AWS & SRE Focus) Manchester (Hybrid - 2 days in office) Up to £80,000 + Benefits Permanent, Full-Time The Opportunity Morson Edge are are looking for an experienced IT Manager to lead and evolve a highperforming infrastructure and reliability function. This is a key leadership role where you'll shape strategy, improve system resilience, and drive best practices across CDN, AWS cloud environments, and Site Reliability Engineering (SRE) . You'll work at the intersection of infrastructure, performance, and reliability-ensuring systems are scalable, secure, and always available. What You'll Be Doing Lead, mentor, and develop a team of engineers across cloud infrastructure and SRE Own and optimise AWS environments , ensuring scalability, cost-efficiency, and security Manage and enhance CDN performance and delivery strategies Drive adoption of SRE principles including SLIs, SLOs, and error budgets Improve system observability through monitoring, logging, and alerting Collaborate with engineering and product teams to support high-availability services Oversee incident management, root cause analysis, and continuous improvement Define and implement infrastructure best practices and automation What We're Looking For Proven experience in an IT Manager/Infrastructure Manager/SRE Lead role Strong expertise in AWS (EC2, Lambda, CloudFront, VPC, etc.) Solid understanding of Content Delivery Networks (CDN) and performance optimisation Experience implementing or working within SRE frameworks Knowledge of Infrastructure as Code (eg, Terraform, CloudFormation) Strong background in monitoring tools (eg, Prometheus, Grafana, Datadog) Excellent leadership and stakeholder management skills Nice to Have Experience with containerisation (Docker, Kubernetes) Exposure to DevOps culture and CI/CD pipelines Security and compliance awareness in cloud environments What's in It for You Salary up to £80,000 Hybrid working (2 days per week in Manchester office) Pension scheme Training and development opportunities A chance to shape and lead a modern, cloud-first infrastructure function
Moorepay
Site Reliability Engineer (CloudOps)
Moorepay Manchester, Lancashire
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
01/04/2026
Full time
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
Randstad Technologies
SRE - Site Reliability Engineer
Randstad Technologies
Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : £55 Per Hour - £62 Per Hour Inside IR35 Job Overview We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Responsibilities Design, deploy and scale observability platforms Manage and scale Prometheus monitoring systems Deploy and maintain large Elasticsearch clusters Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks Automate infrastructure using Terraform and Ansible Develop tools and scripts using Python, Go, Ruby or Bash Work with Linux systems (Debian/Ubuntu) Participate in on-call rotation Improve system reliability, performance and scalability Required Skills 5+ years experience in Site Reliability Engineering / DevOps Strong Linux systems experience Observability and Monitoring tools experience Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana) Kafka Terraform / Infrastructure as Code Ansible / Configuration Management Programming experience (Python, Go, Ruby or Bash) Distributed systems and cloud infrastructure experience This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it khushboo. Co. uk Randstad Technologies is acting as an Employment Business in relation to this vacancy.
01/04/2026
Contractor
Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : £55 Per Hour - £62 Per Hour Inside IR35 Job Overview We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Responsibilities Design, deploy and scale observability platforms Manage and scale Prometheus monitoring systems Deploy and maintain large Elasticsearch clusters Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks Automate infrastructure using Terraform and Ansible Develop tools and scripts using Python, Go, Ruby or Bash Work with Linux systems (Debian/Ubuntu) Participate in on-call rotation Improve system reliability, performance and scalability Required Skills 5+ years experience in Site Reliability Engineering / DevOps Strong Linux systems experience Observability and Monitoring tools experience Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana) Kafka Terraform / Infrastructure as Code Ansible / Configuration Management Programming experience (Python, Go, Ruby or Bash) Distributed systems and cloud infrastructure experience This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it khushboo. Co. uk Randstad Technologies is acting as an Employment Business in relation to this vacancy.
Randstad Technologies Recruitment
SRE - Site Reliability Engineer
Randstad Technologies Recruitment
Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : 55 Per Hour - 62 Per Hour Inside IR35 Job Overview We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Responsibilities Design, deploy and scale observability platforms Manage and scale Prometheus monitoring systems Deploy and maintain large Elasticsearch clusters Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks Automate infrastructure using Terraform and Ansible Develop tools and scripts using Python, Go, Ruby or Bash Work with Linux systems (Debian/Ubuntu) Participate in on-call rotation Improve system reliability, performance and scalability Required Skills 5+ years experience in Site Reliability Engineering / DevOps Strong Linux systems experience Observability and Monitoring tools experience Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana) Kafka Terraform / Infrastructure as Code Ansible / Configuration Management Programming experience (Python, Go, Ruby or Bash) Distributed systems and cloud infrastructure experience This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it khushboo. Co. uk Randstad Technologies is acting as an Employment Business in relation to this vacancy.
31/03/2026
Contractor
Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : 55 Per Hour - 62 Per Hour Inside IR35 Job Overview We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Responsibilities Design, deploy and scale observability platforms Manage and scale Prometheus monitoring systems Deploy and maintain large Elasticsearch clusters Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks Automate infrastructure using Terraform and Ansible Develop tools and scripts using Python, Go, Ruby or Bash Work with Linux systems (Debian/Ubuntu) Participate in on-call rotation Improve system reliability, performance and scalability Required Skills 5+ years experience in Site Reliability Engineering / DevOps Strong Linux systems experience Observability and Monitoring tools experience Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana) Kafka Terraform / Infrastructure as Code Ansible / Configuration Management Programming experience (Python, Go, Ruby or Bash) Distributed systems and cloud infrastructure experience This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it khushboo. Co. uk Randstad Technologies is acting as an Employment Business in relation to this vacancy.
Cambridge University Press & Assessment
Principal Developer Team Lead
Cambridge University Press & Assessment
Principal Developer Team Lead Salary:   £51,400 - £68,800 Location:   Cambridge/Hybrid Contract:   Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function   - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice   - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4–8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essential   experience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible   rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 18 February  2026. If you are shortlisted and progressed through the stages, you can expect:       A 40-minute screening call with the Hiring Manager.  First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the   gov.uk   website for guidance to understand your own eligibility based on the role you are applying for.   Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
04/02/2026
Full time
Principal Developer Team Lead Salary:   £51,400 - £68,800 Location:   Cambridge/Hybrid Contract:   Permanent This Principal Developer Team Lead position offers a pivotal opportunity to shape the technical future of a world-renowned academic organisation. You'll spearhead the migration of enterprise systems to cutting-edge cloud-native AWS architectures, while balancing hands-on technical leadership with people management responsibilities. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role We're seeking a hands-on Principal Developer Team Lead to drive the technical transformation of our Exam Technology Organisation as we migrate legacy enterprise applications to modern, cloud-native architectures on AWS. You'll balance technical leadership with people management, leading a team of 4-8 developers while establishing the foundations for our future technology stack. Your initial focus will be on two strategic priorities: Evolving our SRE function   - Building the DevOps infrastructure, automation, and tooling that enables Site Reliability Engineering practices across development and operations teams Advancing our AI development practice   - Establishing standards, frameworks, and best practices for responsibly integrating AI capabilities into our education platforms. What You'll Do Technical Leadership Lead migration of legacy applications to cloud-native AWS architectures Build DevOps automation to support SRE practices Establish AI/ML development standards and frameworks Set observability, monitoring, and incident response standards Promote best practices in web, event-driven, and cloud-native technologies Provide technical expertise and oversee code reviews People Leadership Manage and mentor a team of 4–8 developers, providing coaching, development plan Identifying training needs in AI/ML and SRE. Support recruitment and foster a culture of continual improvement and wellbeing. Delivery & Collaboration Deliver software in agile squads Collaborate with architects, SREs, product owners, and infrastructure teams Liaise with stakeholders to identify education sector needs Plan and estimate migrations and feature delivery Coordinate with service management, security, and AWS experts About you Essential   experience Degree or equivalent Proven technical team leadership Skilled in two or more modern programming languages Experience with AWS cloud and infrastructure DevOps skills: automation, CI/CD, infrastructure-as-code Understanding of SRE and observability Experience in web-apps and modern frameworks Strong communicator with technical and non-technical audiences Technical Expertise CI/CD pipelines, automation frameworks, and developer tooling Observability tools, monitoring, logging, and alerting systems Responsible AI practices and governance Event-driven architecture and microservices patterns Software design patterns and scalability best practices Security principles in cloud environments Leadership Qualities Ability to set technical standards and provide thought leadership Experience balancing people management with hands-on contribution Strong mentoring and coaching skills Collaborative approach that builds trust across teams Passion for continuous learning in AI/ML and DevOps Promotes inclusion and continuous improvement You'll be instrumental in our digital transformation, establishing the foundations for reliable, innovative systems that serve millions of learners, teachers, and researchers worldwide. By evolving our SRE function and advancing our AI practice, you'll empower teams to deliver high-performance solutions while responsibly harnessing cutting-edge technologies. If you would like to know more about this opportunity and what will make you successful, please see the full job description attached to the bottom of this vacancy on our careers site. Rewards and benefits We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible   rewards package , featuring family-friendly and planet-friendly benefits including: 28 days annual leave plus bank holidays Private medical and Permanent Health Insurance Discretionary annual bonus Group personal pension scheme Life assurance up to 4 x annual salary Green travel schemes We are a hybrid working organisation, and we offer a range of flexible working options from day one. We expect most hybrid-working colleagues to spend 40-60% of their time at their dedicated office or location. We will also consider other work arrangements if you wish to work more flexibly or require adjustments due to a disability. Ready to pursue your potential? Apply now. We review applications on an ongoing basis, with a closing date for all applications being 18 February  2026. If you are shortlisted and progressed through the stages, you can expect:       A 40-minute screening call with the Hiring Manager.  First stage interview via MS Teams or in person. You will be provided with a brief to complete a role related task which will need to be returned by email in advance of your interview.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the   gov.uk   website for guidance to understand your own eligibility based on the role you are applying for.   Why join us Joining us is your opportunity to pursue potential. You'll belong to a collaborative team that's exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it's safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background. We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities.
Cambridge University Press & Assessment
Site Reliability Engineer Team Lead
Cambridge University Press & Assessment Cambridge/Hybrid (with 2-3 days per week in office)
Job Title:  English Technology Platform SRE Team Lead Salary:  £68,600 - £91,700 Location:  Cambridge/Hybrid (with 2-3 days per week in office) Contract:  Permanent  Hours:  Full time Are you ready to shape the future of technology platforms at the heart of Cambridge's academic excellence? Join us as our English Technology Platform SRE Team Lead and help drive innovation, reliability, and intelligent automation in a world-class environment. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role   The SRE Team Lead will lead a mature Site Reliability Engineering function within the Platform Operations Team, working closely with Platform Support and Engineering teams. This role demands strong thought leadership, technical depth, and strategic direction for the discipline, with a particular emphasis on leveraging AI-driven operations (AIOps) and FinOps practices to optimise reliability, performance, and cloud spend. Although this is a hands-on technical role, the SRE Team Lead will also manage a small team of SRE, providing clear direction and ensuring consistent, data-driven, AI-enhanced service delivery across the platforms while working collaboratively with existing support and engineering groups. Apply core SRE and DevOps principles—culture, automation, testing, measurement, and continuous improvement—to build and optimise pipelines focused on rapid, reliable software delivery. Integrate AIOps capabilities, such as automated anomaly detection and intelligent alerting, to further enhance operational excellence. Work with Solutions Architecture, Development, and QA teams to automate processes wherever possible, creating and improving stable CI/CD pipelines for both software and infrastructure. Develop tools that enable rapid provisioning of environments and resources across all teams, incorporating AI-assisted automation where beneficial. Use automation, observability, and monitoring tools to improve site reliability and proactively identify issues. Support development teams with troubleshooting, particularly in infrastructure, networking, and multi-tier application design. Serve as a subject matter expert for cloud services—especially AWS PaaS—while applying FinOps practices to ensure cloud cost transparency, optimisation, and efficient resource usage. Create and maintain robust technical documentation for the infrastructure of the English platforms, including operational runbooks enhanced with predictive and AI-supported insights. Stay engaged with developments in the SRE, DevOps, AIOps, and FinOps communities, continually introducing new practices and technologies to improve reliability, performance, automation, and cloud cost efficiency   This position has been classified as a hybrid role, requiring the selected candidate to typically spend 40-60% of their time collaborating and connecting face-to-face at their dedicated location. Aside from our hybrid principles, other flexible working requests will be considered from the first day of employment, including other work arrangements should you require adjustments due to a disability or long-term health condition.    About you A passion for Site reliability engineering and driven to understand, anticipate, and counter platform related issues before they become problems and staying up to date with the latest technological trends and developments Great communication allowing effective collaboration across technical leadership and various business stakeholders with the ability to present ideas and strategies clearly and persuasively. Demonstratable soft skills in motivating, inspiring and leading a team (direct line management is not part of the roles remit) Educated to degree level or equivalent and with a minimum of 5 years proven experience in a systems administration or dev-ops blended role. Experience implementing technologies such as Terraform, Github Actions & Containerization/Orchestration e.g. Kubernetes & Docker Expertise in Monitoring tools like New Relic, Grafana, Alert Manager and site24x7. Have extreme knowledge of cloud computing infrastructure, especially using Amazon Web Services (EKS, ECS, RDS, Route53 etc.) Excellent troubleshooting, debugging, communication and documentation skills Experience of working within an Agile product development environment. For a detailed job description, please refer to the link at the bottom of the advert on our careers site. We are a Disability Confident (DC) employer that is committed to equality and inclusion ensuring our recruitment process is accessible to all. The DC scheme's   Offer of an Interview   commitment applies to applicants who opt in, and disclose a disability or a long-term health condition, and best meet the minimum criteria for the role. In instances where interviewing all qualifying candidates is not practicable, we prioritise those who best meet the minimum criteria, as we would for applicants who do not have a disability or long-term health condition. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the  gov.uk   website for guidance to understand your own eligibility based on the role you are applying for. Rewards and benefits   We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible  rewards package , featuring family-friendly and planet-friendly benefits including:   28 days annual leave plus bank holidays Private medical and Permanent Health Insurance   Discretionary annual bonus   Group personal pension scheme Life assurance up to 4 x annual salary   Green travel schemes     Ready to pursue your potential? Apply now. We aim to support candidates by making our interview process clear and transparent. The closing date for all applications will be 4th February. We will review applications on an ongoing basis, and shortlisted candidates can expect interviews to take place shortly after it closes. If you are shortlisted and progressed through the stages, you can expect:  A 15-minute screening call with the Hiring Manager. Final stage virtual interview via MS Teams.  If you require any reasonable adjustments during the recruitment process due to a disability or a long-term health condition, there will be an opportunity for you to inform us via the online application form. We will do our best to accommodate your needs.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. We are committed to an equitable recruitment process. As such, applications must be submitted via our official online application procedure. Please refrain from sending your CV directly to our recruiters. If you experience technical difficulties or require additional support with submitting your online application, contact the Recruiter.  Why join us  Joining us is your opportunity to pursue potential. You will belong to a collaborative team that is exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it is safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background.  We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities. If you are ready to take the next step in your Cambridge journey, we welcome your application. Together, we continue to shape a culture where everyone feels empowered to succeed and motivated to make a difference— for ourselves, for each other, and for learners worldwide.
21/01/2026
Full time
Job Title:  English Technology Platform SRE Team Lead Salary:  £68,600 - £91,700 Location:  Cambridge/Hybrid (with 2-3 days per week in office) Contract:  Permanent  Hours:  Full time Are you ready to shape the future of technology platforms at the heart of Cambridge's academic excellence? Join us as our English Technology Platform SRE Team Lead and help drive innovation, reliability, and intelligent automation in a world-class environment. We are Cambridge University Press & Assessment, a world-leading academic publisher and assessment organisation and a proud part of the University of Cambridge.  About the role   The SRE Team Lead will lead a mature Site Reliability Engineering function within the Platform Operations Team, working closely with Platform Support and Engineering teams. This role demands strong thought leadership, technical depth, and strategic direction for the discipline, with a particular emphasis on leveraging AI-driven operations (AIOps) and FinOps practices to optimise reliability, performance, and cloud spend. Although this is a hands-on technical role, the SRE Team Lead will also manage a small team of SRE, providing clear direction and ensuring consistent, data-driven, AI-enhanced service delivery across the platforms while working collaboratively with existing support and engineering groups. Apply core SRE and DevOps principles—culture, automation, testing, measurement, and continuous improvement—to build and optimise pipelines focused on rapid, reliable software delivery. Integrate AIOps capabilities, such as automated anomaly detection and intelligent alerting, to further enhance operational excellence. Work with Solutions Architecture, Development, and QA teams to automate processes wherever possible, creating and improving stable CI/CD pipelines for both software and infrastructure. Develop tools that enable rapid provisioning of environments and resources across all teams, incorporating AI-assisted automation where beneficial. Use automation, observability, and monitoring tools to improve site reliability and proactively identify issues. Support development teams with troubleshooting, particularly in infrastructure, networking, and multi-tier application design. Serve as a subject matter expert for cloud services—especially AWS PaaS—while applying FinOps practices to ensure cloud cost transparency, optimisation, and efficient resource usage. Create and maintain robust technical documentation for the infrastructure of the English platforms, including operational runbooks enhanced with predictive and AI-supported insights. Stay engaged with developments in the SRE, DevOps, AIOps, and FinOps communities, continually introducing new practices and technologies to improve reliability, performance, automation, and cloud cost efficiency   This position has been classified as a hybrid role, requiring the selected candidate to typically spend 40-60% of their time collaborating and connecting face-to-face at their dedicated location. Aside from our hybrid principles, other flexible working requests will be considered from the first day of employment, including other work arrangements should you require adjustments due to a disability or long-term health condition.    About you A passion for Site reliability engineering and driven to understand, anticipate, and counter platform related issues before they become problems and staying up to date with the latest technological trends and developments Great communication allowing effective collaboration across technical leadership and various business stakeholders with the ability to present ideas and strategies clearly and persuasively. Demonstratable soft skills in motivating, inspiring and leading a team (direct line management is not part of the roles remit) Educated to degree level or equivalent and with a minimum of 5 years proven experience in a systems administration or dev-ops blended role. Experience implementing technologies such as Terraform, Github Actions & Containerization/Orchestration e.g. Kubernetes & Docker Expertise in Monitoring tools like New Relic, Grafana, Alert Manager and site24x7. Have extreme knowledge of cloud computing infrastructure, especially using Amazon Web Services (EKS, ECS, RDS, Route53 etc.) Excellent troubleshooting, debugging, communication and documentation skills Experience of working within an Agile product development environment. For a detailed job description, please refer to the link at the bottom of the advert on our careers site. We are a Disability Confident (DC) employer that is committed to equality and inclusion ensuring our recruitment process is accessible to all. The DC scheme's   Offer of an Interview   commitment applies to applicants who opt in, and disclose a disability or a long-term health condition, and best meet the minimum criteria for the role. In instances where interviewing all qualifying candidates is not practicable, we prioritise those who best meet the minimum criteria, as we would for applicants who do not have a disability or long-term health condition. Cambridge University Press & Assessment is an approved UK employer for the sponsorship of eligible roles and applicants under the Skilled Worker visa route. Please refer to the  gov.uk   website for guidance to understand your own eligibility based on the role you are applying for. Rewards and benefits   We will support you to be at your best in work and to live well outside of it. In addition to competitive salaries, we offer a world-class, flexible  rewards package , featuring family-friendly and planet-friendly benefits including:   28 days annual leave plus bank holidays Private medical and Permanent Health Insurance   Discretionary annual bonus   Group personal pension scheme Life assurance up to 4 x annual salary   Green travel schemes     Ready to pursue your potential? Apply now. We aim to support candidates by making our interview process clear and transparent. The closing date for all applications will be 4th February. We will review applications on an ongoing basis, and shortlisted candidates can expect interviews to take place shortly after it closes. If you are shortlisted and progressed through the stages, you can expect:  A 15-minute screening call with the Hiring Manager. Final stage virtual interview via MS Teams.  If you require any reasonable adjustments during the recruitment process due to a disability or a long-term health condition, there will be an opportunity for you to inform us via the online application form. We will do our best to accommodate your needs.  Please note that successful applicants will be subject to satisfactory background checks including DBS due to working in a regulated industry. We are committed to an equitable recruitment process. As such, applications must be submitted via our official online application procedure. Please refrain from sending your CV directly to our recruiters. If you experience technical difficulties or require additional support with submitting your online application, contact the Recruiter.  Why join us  Joining us is your opportunity to pursue potential. You will belong to a collaborative team that is exploring new and better ways to serve students, teachers and researchers across the globe – for the benefit of individuals, society and the world. Sharing our mission will inspire your own growth, development and progress, in an environment which embraces difference, change and aspiration. Cambridge University Press & Assessment is committed to being a place where anyone can enjoy a successful career, where it is safe to speak up, and where we learn continuously to improve together. We welcome applications from all candidates, regardless of demographic characteristics (age, disability, educational attainment, ethnicity, gender, marital status, neurodiversity, religion, sex, gender identity and sexual identity), cultural, or social class/background.  We believe better outcomes come through diversity of thought, background and approach. We welcome applications from people from all backgrounds and communities, actively seeking to employ people from a wide range of different communities. If you are ready to take the next step in your Cambridge journey, we welcome your application. Together, we continue to shape a culture where everyone feels empowered to succeed and motivated to make a difference— for ourselves, for each other, and for learners worldwide.
ARM (Advanced Resource Managers)
Senior Site Reliability Engineer
ARM (Advanced Resource Managers)
Senior Site Reliability Engineer 6 months Remote £Negotiable - INSIDE IR35 Tech Stack Multiple Platforms and Applications AWS and Azure - Cloud Mainframe skills would be handy Latest applications on Cloud Dev Ops skills would be helpful Attitude of being part of the team and owning the outcomes Advocate - to change the culture to SRE Disclaimer: This vacancy is being advertised by either Advanced Resource Managers Limited, Advanced Resource Managers IT Limited or Advanced Resource Managers Engineering Limited ("ARM"). ARM is a specialist talent acquisition and management consultancy. We provide technical contingency recruitment and a portfolio of more complex resource solutions. Our specialist recruitment divisions cover the entire technical arena, including some of the most economically and strategically important industries in the UK and the world today. We will never send your CV without your permission. Where the role is marked as Outside IR35 in the advertisement this is subject to receipt of a final Status Determination Statement from the end Client and may be subject to change.
06/10/2025
Contractor
Senior Site Reliability Engineer 6 months Remote £Negotiable - INSIDE IR35 Tech Stack Multiple Platforms and Applications AWS and Azure - Cloud Mainframe skills would be handy Latest applications on Cloud Dev Ops skills would be helpful Attitude of being part of the team and owning the outcomes Advocate - to change the culture to SRE Disclaimer: This vacancy is being advertised by either Advanced Resource Managers Limited, Advanced Resource Managers IT Limited or Advanced Resource Managers Engineering Limited ("ARM"). ARM is a specialist talent acquisition and management consultancy. We provide technical contingency recruitment and a portfolio of more complex resource solutions. Our specialist recruitment divisions cover the entire technical arena, including some of the most economically and strategically important industries in the UK and the world today. We will never send your CV without your permission. Where the role is marked as Outside IR35 in the advertisement this is subject to receipt of a final Status Determination Statement from the end Client and may be subject to change.
Department for International Trade
Senior Site Reliability Engineer
Department for International Trade
Contents Location About the job Benefits Things you need to know Apply and further information Location Belfast, Cardiff, Darlington, Edinburgh, London About the job Summary Join a team at the heart of the global economy! We create digital services, data tools and technology for businesses to prosper around the world. Have a look at our video ! Our Digital, Data and Technology team develops and operates tools, services, and platforms that enable the UK government to provide world leading support to businesses in the UK and overseas. Youll get to constantly push boundaries in an environment free of heavy legacy, driven by curiosity, social purpose, diversity of thought, entrepreneurship, and the aspiration to offer an incredible experience to all our users. Find out more on our blog, Digital Trade. Job description Can we rely on you to make us more reliable? The Department for International Trade (DIT) helps businesses export and invest, and we need Site Reliability Engineers (SRE) to make sure our internet services work as users expect . Responsibilities As SRE you will work to give development teams the tools for their job, including application performance monitoring, exception, log and metrics aggregation, dashboards, and declarative CD/CI pipelines. Youll evangelise product teams about service-level indicators, objectives, and error budgets, and negotiate them. Youll help build and scale our global product platform. Our tech stack includes: Amazon Web Services Azure Jenkins Terraform Kubernetes Elasticsearch Python PostgreSQL Sentry Redis Jenkins Essential Skills and Experience You should be able to demonstrate essential skills and experience of: Demonstrable experience and fluency in one or more programming languages, writing clean and effective code. Ability to build code defined, reliable and well tested infrastructure on top of cloud computing systems. Experience in designing, analysing, and troubleshooting distributed systems. Knowledge of Unix fundamentals and TCP/IP networking. Ability to see user impact in the infrastructure changes. Desirable Skills and Experience While not essential, it would be ideal if you have demonstrable skills and experience of: PaaS,Kubernetes,Django,oauth2/saml2 integrations in Python. Cloud experience with either Google Cloud, Amazon Web Services or Azure. Experience coding infrastructure (i.e., Terraform). Experience in defining and measuring Service Level Objectives. Experience in observability driven development. Experience in prototyping through reuseof existingOpen Sourcecomponents. Benefits Learning and development tailored to your role An environment with flexible working options A culture encouraging inclusion and diversity A Civil Service pension with an average employer contribution of 27% Things you need to know Security Successful candidates must pass a disclosure and barring security check. Successful candidates must meet the security requirements before they can be appointed. The level of security needed is security check . See our vetting charter . People working with government assets must complete basic personnel security standard checks. Selection process details We are closely monitoring the situation regarding the coronavirus, and will be following central Government advice as it is issued. There is therefore a risk that recruitment to this post may be subject to change at short notice. In addition, where appropriate, you may be invited to attend a video interview. Please continue to follow the application process as normal and ensure that you check your emails regularly as all updates from us will be sent to you this way. Assessment and Interview As part of the application process you will be asked to upload a CV which outlines your experience, skills and fit for the role. At the sift stage for this role, Inspire People will assess you against the essential criteria listed above to compile a longlist of applications. If you are progressed through to this stage, you will be asked to complete a short, pre-recorded video interview with Inspire People or provide written answers to questions. These applications will then be sifted by DIT hiring managers. At the interview stage for this role, we will assess your technical/specialist experience, outlined in the above role description, testing your ability through relevant assessments/presentations and ask you questions around Behaviours and Technical skills, which are part of the Civil Service Success Profiles . The technical element within the interview, where you will be asked a series of questions to demonstrate your specific professional skills and knowledge related directly to the job role and context, will assess against capabilities which are outlined under DevOps engineer within the DDaT framework which can be found here: . These Technical Skills include: Availability and capacity management Development process optimisations Information security Modern standards approach Programming and build (software engineering) Prototyping Service support Systems design Systems integration User focus You will also be assessed against the Behaviours of: Making Effective Decisions Communicating and influencing Developing Self and Others Changing and Improving Offer Stage Appointments may be made to candidates in merit order based on location preferences. The salary we will offer is determined using interview performance. Scores at interview translate to proficiency levels and an associated salary. Once a successful candidate has a proficiency level and is part of the capability framework, they will be given opportunities to self-assess to progress through the pay scale within their grade during their time at DIT. For further explanation of proficiency levels and more information about DDaT click here. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. Candidates who pass the bar at interview but are not the highest scoring will be held on a 12-month reserve list for future appointments. Candidates who are judged to be a near miss at interview may be offered a post at the grade below the one advertised. If successful and transferring from another Government Department a criminal record check may be carried out. Harmonised terms and conditions are attached. Please take time to read the document to determine how these may affect you. Please note the successful candidate will be expected to remain in post for a minimum of 18 months before being released for another role. Any move to the Department for International Trade from another employer will mean you can no longer access childcare vouchers. This includes moves between government departments. You may however be eligible for other government schemes, including Tax Free Childcare. Determine your eligibility at New entrants are expected to join on the minimum of the pay band. Reasonable adjustment If a person with disabilities is put at a substantial disadvantage compared to a non-disabled person, we have a duty to make reasonable changes to our processes. If you need a change to be made so that you can make your application, you should contact the DDaT Recruitment team before the closing date to discuss your needs. Our recruitment process is underpinned by the principle of appointment on the basis of fair and open competition and appointment on merit, as outlined in the Civil Service Commissioners Recruitment Principles. If you feel your application has not been treated in accordance with these principles and you wish to make a complaint, you should in the first instance contact DIT by email: If you are not satisfied with the response you receive from the Department, you can contact the Civil Service Commission: Click here to visit Civil Service Commission. If you are experiencing accessibility problems with any attachments on this advert, please contact the email address in the 'Contact point for applicants' section. For further information and to apply please click the link to direct you to the advertisers website. Further Information This role requires SC clearance, a condition of which is to have been present in the UK for 3 out of the past 5 years. For more information on security clearance, the Civil Service Code, our recruitment principles, and our complaints procedure, click here. Feedback will only be provided if you attend an interview or assessment. Nationality requirements This job is broadly open to the following groups: ..... click apply for full job details
24/09/2022
Full time
Contents Location About the job Benefits Things you need to know Apply and further information Location Belfast, Cardiff, Darlington, Edinburgh, London About the job Summary Join a team at the heart of the global economy! We create digital services, data tools and technology for businesses to prosper around the world. Have a look at our video ! Our Digital, Data and Technology team develops and operates tools, services, and platforms that enable the UK government to provide world leading support to businesses in the UK and overseas. Youll get to constantly push boundaries in an environment free of heavy legacy, driven by curiosity, social purpose, diversity of thought, entrepreneurship, and the aspiration to offer an incredible experience to all our users. Find out more on our blog, Digital Trade. Job description Can we rely on you to make us more reliable? The Department for International Trade (DIT) helps businesses export and invest, and we need Site Reliability Engineers (SRE) to make sure our internet services work as users expect . Responsibilities As SRE you will work to give development teams the tools for their job, including application performance monitoring, exception, log and metrics aggregation, dashboards, and declarative CD/CI pipelines. Youll evangelise product teams about service-level indicators, objectives, and error budgets, and negotiate them. Youll help build and scale our global product platform. Our tech stack includes: Amazon Web Services Azure Jenkins Terraform Kubernetes Elasticsearch Python PostgreSQL Sentry Redis Jenkins Essential Skills and Experience You should be able to demonstrate essential skills and experience of: Demonstrable experience and fluency in one or more programming languages, writing clean and effective code. Ability to build code defined, reliable and well tested infrastructure on top of cloud computing systems. Experience in designing, analysing, and troubleshooting distributed systems. Knowledge of Unix fundamentals and TCP/IP networking. Ability to see user impact in the infrastructure changes. Desirable Skills and Experience While not essential, it would be ideal if you have demonstrable skills and experience of: PaaS,Kubernetes,Django,oauth2/saml2 integrations in Python. Cloud experience with either Google Cloud, Amazon Web Services or Azure. Experience coding infrastructure (i.e., Terraform). Experience in defining and measuring Service Level Objectives. Experience in observability driven development. Experience in prototyping through reuseof existingOpen Sourcecomponents. Benefits Learning and development tailored to your role An environment with flexible working options A culture encouraging inclusion and diversity A Civil Service pension with an average employer contribution of 27% Things you need to know Security Successful candidates must pass a disclosure and barring security check. Successful candidates must meet the security requirements before they can be appointed. The level of security needed is security check . See our vetting charter . People working with government assets must complete basic personnel security standard checks. Selection process details We are closely monitoring the situation regarding the coronavirus, and will be following central Government advice as it is issued. There is therefore a risk that recruitment to this post may be subject to change at short notice. In addition, where appropriate, you may be invited to attend a video interview. Please continue to follow the application process as normal and ensure that you check your emails regularly as all updates from us will be sent to you this way. Assessment and Interview As part of the application process you will be asked to upload a CV which outlines your experience, skills and fit for the role. At the sift stage for this role, Inspire People will assess you against the essential criteria listed above to compile a longlist of applications. If you are progressed through to this stage, you will be asked to complete a short, pre-recorded video interview with Inspire People or provide written answers to questions. These applications will then be sifted by DIT hiring managers. At the interview stage for this role, we will assess your technical/specialist experience, outlined in the above role description, testing your ability through relevant assessments/presentations and ask you questions around Behaviours and Technical skills, which are part of the Civil Service Success Profiles . The technical element within the interview, where you will be asked a series of questions to demonstrate your specific professional skills and knowledge related directly to the job role and context, will assess against capabilities which are outlined under DevOps engineer within the DDaT framework which can be found here: . These Technical Skills include: Availability and capacity management Development process optimisations Information security Modern standards approach Programming and build (software engineering) Prototyping Service support Systems design Systems integration User focus You will also be assessed against the Behaviours of: Making Effective Decisions Communicating and influencing Developing Self and Others Changing and Improving Offer Stage Appointments may be made to candidates in merit order based on location preferences. The salary we will offer is determined using interview performance. Scores at interview translate to proficiency levels and an associated salary. Once a successful candidate has a proficiency level and is part of the capability framework, they will be given opportunities to self-assess to progress through the pay scale within their grade during their time at DIT. For further explanation of proficiency levels and more information about DDaT click here. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. Candidates who pass the bar at interview but are not the highest scoring will be held on a 12-month reserve list for future appointments. Candidates who are judged to be a near miss at interview may be offered a post at the grade below the one advertised. If successful and transferring from another Government Department a criminal record check may be carried out. Harmonised terms and conditions are attached. Please take time to read the document to determine how these may affect you. Please note the successful candidate will be expected to remain in post for a minimum of 18 months before being released for another role. Any move to the Department for International Trade from another employer will mean you can no longer access childcare vouchers. This includes moves between government departments. You may however be eligible for other government schemes, including Tax Free Childcare. Determine your eligibility at New entrants are expected to join on the minimum of the pay band. Reasonable adjustment If a person with disabilities is put at a substantial disadvantage compared to a non-disabled person, we have a duty to make reasonable changes to our processes. If you need a change to be made so that you can make your application, you should contact the DDaT Recruitment team before the closing date to discuss your needs. Our recruitment process is underpinned by the principle of appointment on the basis of fair and open competition and appointment on merit, as outlined in the Civil Service Commissioners Recruitment Principles. If you feel your application has not been treated in accordance with these principles and you wish to make a complaint, you should in the first instance contact DIT by email: If you are not satisfied with the response you receive from the Department, you can contact the Civil Service Commission: Click here to visit Civil Service Commission. If you are experiencing accessibility problems with any attachments on this advert, please contact the email address in the 'Contact point for applicants' section. For further information and to apply please click the link to direct you to the advertisers website. Further Information This role requires SC clearance, a condition of which is to have been present in the UK for 3 out of the past 5 years. For more information on security clearance, the Civil Service Code, our recruitment principles, and our complaints procedure, click here. Feedback will only be provided if you attend an interview or assessment. Nationality requirements This job is broadly open to the following groups: ..... click apply for full job details
Department for International Trade
Lead site Reliability Engineer
Department for International Trade
Contents Location About the job Benefits Things you need to know Apply and further information Location Belfast, Cardiff, Darlington, Edinburgh, London About the job Summary Join a team at the heart of the global economy! We create digital services, data tools and technology for businesses to prosper around the world. Have a look at our video ! Our Digital, Data and Technology team develops and operates tools, services, and platforms that enable the UK government to provide world leading support to businesses in the UK and overseas. Youll get to constantly push boundaries in an environment free of heavy legacy, driven by curiosity, social purpose, diversity of thought, entrepreneurship, and the aspiration to offer an incredible experience to all our users. Find out more on our blog, Digital Trade. Job description As our Lead Site Reliability Engineer, you will lead a team of site reliability engineers who are committed to delivering excellent services and continual improvement. You will drive adaption of best practices and be responsible for managingour platform hosting. You will influence our future hosting strategy, helping to develop the team's roadmap of work and lead the support of several services offerings including CI/CD, Account Management, Containerisation, Network Connectivity, Cloud Cost Optimisation, Service Performance and Availability. Working with development teams to create reusable components, enabling service delivery at pace, you will automate the oversight of systems at scale, covering a hybrid cloud environment, including but not limited to AWS, GovUK PaaS, & Azure. Responsibilities In your day-to-day role, you will: Set the SRE teams technical direction, working with the technology leadership team. Provide technical leadership & guidance to the SRE team through coaching and mentoring. Lead the sharing of knowledge and good practice to develop the teams capability. Identify and lead on modernization initiatives through continuous improvement. Give development teams the tools for their job, including infrastructure, APM, exception, log aggregation, dashboards, and declarative CD/CI pipelines. Collaboratively develop the future hosting strategy. Actively lead the support of service offerings (Account Management, Security, CI/CD, Automation, Containerization, Service Performance & cloud Infrastructure). Ensure security, stability, and capacity are embedded in services deployments. Champion the adoption of emerging technology to automate tasks & deployments. Solve complex issues using root cause analysis, progressing opportunities to improve reliability, security, capability of infrastructure, application, and site services. Essential Skills and Experience Youll have demonstrable skills and experience of: Team leadership: managing workload, coaching, and mentoring technical staff. Setting the direction for technical teams, and liaising with colleagues to establish requirements and identify, propose, initiate work. Identifying good practices, sharing experiences, and championing your team's agenda, acting as the voice of the team. Working with cloud technology and the use of orchestration tools, developing infrastructure as code on top of cloud computing services. Deploying & managing CI/CD pipelines. Identification of process optimization opportunities and leading teams to deliver service improvements. Experience of information security, designing, quality-reviewing and quality-assurance solutions with security controls embedded. Designing and reviewing systems, selecting appropriate design standards, methods and tools and ensure they are applied effectively. Benefits Learning and development tailored to your role An environment with flexible working options A culture encouraging inclusion and diversity A Civil Service pension with an average employer contribution of 27% Things you need to know Security Successful candidates must pass a disclosure and barring security check. Successful candidates must meet the security requirements before they can be appointed. The level of security needed is security check . See our vetting charter . People working with government assets must complete basic personnel security standard checks. Selection process details We are closely monitoring the situation regarding the coronavirus, and will be following central Government advice as it is issued. There is therefore a risk that recruitment to this post may be subject to change at short notice. In addition, where appropriate, you may be invited to attend a video interview. Please continue to follow the application process as normal and ensure that you check your emails regularly as all updates from us will be sent to you this way. Assessment and Interview As part of the application process you will be asked to upload a CV which outlines your experience, skills and fit for the role. At the sift stage for this role, Inspire People will assess you against the essential criteria listed above to compile a long list of applications. If you are progressed through to this stage, you will be asked to complete a short, pre-recorded video interview with Inspire People or provide written answers to questions. These applications will then be sifted by DIT hiring managers. Initial sifting will take place the week commencing 26th September, with CV submissions to DIT on the 30th September. Interviews will take place the week commencing 10th October. Please note that these dates are indicative and may be subject to change. At the interview stage for this role, we will assess your technical/specialist experience, outlined in the above role description, testing your ability through relevant assessments/presentations and ask you questions around Behaviours and Technical skills, which are part of the Civil Service Success Profiles . The technical element within the interview, where you will be asked a series of questions to demonstrate your specific professional skills and knowledge related directly to the job role and context, will assess against these Technical Skills: Availability and capacity management Information security Modern standards approach Programming and build (software engineering) Prototyping Service support Systems design Systems integration User focus You will also be assessed against the Behaviours of: Communicating and Influencing Developing Self and Others Changing and Improving Making Effective Decisions Offer Stage Appointments may be made to candidates in merit order based on location preferences. The salary we will offer is determined using interview performance. Scores at interview translate to proficiency levels and an associated salary. Once a successful candidate has a proficiency level and is part of the capability framework, they will be given opportunities to self-assess to progress through the pay scale within their grade during their time at DIT. For further explanation of proficiency levels and more information about DDaT click here. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. Candidates who pass the bar at interview but are not the highest scoring will be held on a 12-month reserve list for future appointments. Candidates who are judged to be a near miss at interview may be offered a post at the grade below the one advertised. If successful and transferring from another Government Department a criminal record check may be carried out. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. Harmonised terms and conditions are attached. Please take time to read the document to determine how these may affect you. Please note the successful candidate will be expected to remain in post for a minimum of 18 months before being released for another role. Any move to the Department for International Trade from another employer will mean you can no longer access childcare vouchers. This includes moves between government departments. You may however be eligible for other government schemes, including Tax Free Childcare. Determine your eligibility at New entrants are expected to join on the minimum of the pay band. Reasonable adjustment If a person with disabilities is put at a substantial disadvantage compared to a non-disabled person, we have a duty to make reasonable changes to our processes. If you need a change to be made so that you can make your application, you should contact the DDaT Recruitment team before the closing date to discuss your needs. ..... click apply for full job details
23/09/2022
Full time
Contents Location About the job Benefits Things you need to know Apply and further information Location Belfast, Cardiff, Darlington, Edinburgh, London About the job Summary Join a team at the heart of the global economy! We create digital services, data tools and technology for businesses to prosper around the world. Have a look at our video ! Our Digital, Data and Technology team develops and operates tools, services, and platforms that enable the UK government to provide world leading support to businesses in the UK and overseas. Youll get to constantly push boundaries in an environment free of heavy legacy, driven by curiosity, social purpose, diversity of thought, entrepreneurship, and the aspiration to offer an incredible experience to all our users. Find out more on our blog, Digital Trade. Job description As our Lead Site Reliability Engineer, you will lead a team of site reliability engineers who are committed to delivering excellent services and continual improvement. You will drive adaption of best practices and be responsible for managingour platform hosting. You will influence our future hosting strategy, helping to develop the team's roadmap of work and lead the support of several services offerings including CI/CD, Account Management, Containerisation, Network Connectivity, Cloud Cost Optimisation, Service Performance and Availability. Working with development teams to create reusable components, enabling service delivery at pace, you will automate the oversight of systems at scale, covering a hybrid cloud environment, including but not limited to AWS, GovUK PaaS, & Azure. Responsibilities In your day-to-day role, you will: Set the SRE teams technical direction, working with the technology leadership team. Provide technical leadership & guidance to the SRE team through coaching and mentoring. Lead the sharing of knowledge and good practice to develop the teams capability. Identify and lead on modernization initiatives through continuous improvement. Give development teams the tools for their job, including infrastructure, APM, exception, log aggregation, dashboards, and declarative CD/CI pipelines. Collaboratively develop the future hosting strategy. Actively lead the support of service offerings (Account Management, Security, CI/CD, Automation, Containerization, Service Performance & cloud Infrastructure). Ensure security, stability, and capacity are embedded in services deployments. Champion the adoption of emerging technology to automate tasks & deployments. Solve complex issues using root cause analysis, progressing opportunities to improve reliability, security, capability of infrastructure, application, and site services. Essential Skills and Experience Youll have demonstrable skills and experience of: Team leadership: managing workload, coaching, and mentoring technical staff. Setting the direction for technical teams, and liaising with colleagues to establish requirements and identify, propose, initiate work. Identifying good practices, sharing experiences, and championing your team's agenda, acting as the voice of the team. Working with cloud technology and the use of orchestration tools, developing infrastructure as code on top of cloud computing services. Deploying & managing CI/CD pipelines. Identification of process optimization opportunities and leading teams to deliver service improvements. Experience of information security, designing, quality-reviewing and quality-assurance solutions with security controls embedded. Designing and reviewing systems, selecting appropriate design standards, methods and tools and ensure they are applied effectively. Benefits Learning and development tailored to your role An environment with flexible working options A culture encouraging inclusion and diversity A Civil Service pension with an average employer contribution of 27% Things you need to know Security Successful candidates must pass a disclosure and barring security check. Successful candidates must meet the security requirements before they can be appointed. The level of security needed is security check . See our vetting charter . People working with government assets must complete basic personnel security standard checks. Selection process details We are closely monitoring the situation regarding the coronavirus, and will be following central Government advice as it is issued. There is therefore a risk that recruitment to this post may be subject to change at short notice. In addition, where appropriate, you may be invited to attend a video interview. Please continue to follow the application process as normal and ensure that you check your emails regularly as all updates from us will be sent to you this way. Assessment and Interview As part of the application process you will be asked to upload a CV which outlines your experience, skills and fit for the role. At the sift stage for this role, Inspire People will assess you against the essential criteria listed above to compile a long list of applications. If you are progressed through to this stage, you will be asked to complete a short, pre-recorded video interview with Inspire People or provide written answers to questions. These applications will then be sifted by DIT hiring managers. Initial sifting will take place the week commencing 26th September, with CV submissions to DIT on the 30th September. Interviews will take place the week commencing 10th October. Please note that these dates are indicative and may be subject to change. At the interview stage for this role, we will assess your technical/specialist experience, outlined in the above role description, testing your ability through relevant assessments/presentations and ask you questions around Behaviours and Technical skills, which are part of the Civil Service Success Profiles . The technical element within the interview, where you will be asked a series of questions to demonstrate your specific professional skills and knowledge related directly to the job role and context, will assess against these Technical Skills: Availability and capacity management Information security Modern standards approach Programming and build (software engineering) Prototyping Service support Systems design Systems integration User focus You will also be assessed against the Behaviours of: Communicating and Influencing Developing Self and Others Changing and Improving Making Effective Decisions Offer Stage Appointments may be made to candidates in merit order based on location preferences. The salary we will offer is determined using interview performance. Scores at interview translate to proficiency levels and an associated salary. Once a successful candidate has a proficiency level and is part of the capability framework, they will be given opportunities to self-assess to progress through the pay scale within their grade during their time at DIT. For further explanation of proficiency levels and more information about DDaT click here. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. Candidates who pass the bar at interview but are not the highest scoring will be held on a 12-month reserve list for future appointments. Candidates who are judged to be a near miss at interview may be offered a post at the grade below the one advertised. If successful and transferring from another Government Department a criminal record check may be carried out. The Department for International Trade embraces and values diversity in all forms. We welcome and pride ourselves on the positive impact diversity has on the work we do, and we promote equality of opportunity throughout the organisation. Harmonised terms and conditions are attached. Please take time to read the document to determine how these may affect you. Please note the successful candidate will be expected to remain in post for a minimum of 18 months before being released for another role. Any move to the Department for International Trade from another employer will mean you can no longer access childcare vouchers. This includes moves between government departments. You may however be eligible for other government schemes, including Tax Free Childcare. Determine your eligibility at New entrants are expected to join on the minimum of the pay band. Reasonable adjustment If a person with disabilities is put at a substantial disadvantage compared to a non-disabled person, we have a duty to make reasonable changes to our processes. If you need a change to be made so that you can make your application, you should contact the DDaT Recruitment team before the closing date to discuss your needs. ..... click apply for full job details
Lorien
Senior Azure Site Reliability Engineer
Lorien Huddersfield, Yorkshire
Senior Azure Site Reliability Engineer £80,000 - £100,000 + Bonus + Benefits UK Wide A business who builds cutting-edge, "super-greenfield" platforms for world-leading software companies spanning across the Banking, Government, and Investment Management sectors is looking for an experienced Azure SRE to join the team. Currently sitting at 40 employees, this company possess a world-class engineering team, where collaboration is key, and quality of the products they build for customers comes first. Your Main Role: You will work with the DevOps and Operations teams to conquer engineering best practices, meet release deadlines, and push yourselves to the limit to build incredible platforms within the Microsoft stack. Your profile: Extensive knowledge of the Azure services Strong experience working with Azure DevOps. Deploying a variety of Azure services; desirably experience in a couple of the following: Front Door, Azure Firewall, Functions, App services, Azure SQL, Redis Cache. Experience with configuration management tools and infrastructure as code; Terraform, ARM templates, Cloud Formation, Deployment manager, YAML or PowerShell DSC. Previous leadership experience Remote or Hybrid? Fully Remote Senior Azure Site Reliability Engineer £80,000 - £100,000 + Bonus + Benefits UK Wide IND_PC1 Carbon60, Lorien, SRG - the Impellam Group STEM Portfolio is acting as an Employment Business in relation to this vacancy.
21/09/2022
Full time
Senior Azure Site Reliability Engineer £80,000 - £100,000 + Bonus + Benefits UK Wide A business who builds cutting-edge, "super-greenfield" platforms for world-leading software companies spanning across the Banking, Government, and Investment Management sectors is looking for an experienced Azure SRE to join the team. Currently sitting at 40 employees, this company possess a world-class engineering team, where collaboration is key, and quality of the products they build for customers comes first. Your Main Role: You will work with the DevOps and Operations teams to conquer engineering best practices, meet release deadlines, and push yourselves to the limit to build incredible platforms within the Microsoft stack. Your profile: Extensive knowledge of the Azure services Strong experience working with Azure DevOps. Deploying a variety of Azure services; desirably experience in a couple of the following: Front Door, Azure Firewall, Functions, App services, Azure SQL, Redis Cache. Experience with configuration management tools and infrastructure as code; Terraform, ARM templates, Cloud Formation, Deployment manager, YAML or PowerShell DSC. Previous leadership experience Remote or Hybrid? Fully Remote Senior Azure Site Reliability Engineer £80,000 - £100,000 + Bonus + Benefits UK Wide IND_PC1 Carbon60, Lorien, SRG - the Impellam Group STEM Portfolio is acting as an Employment Business in relation to this vacancy.

Modal Window

  • Home
  • Contact
  • About Us
  • FAQs
  • Terms & Conditions
  • Privacy
  • Employer
  • Post a Job
  • Search Resumes
  • Sign in
  • Job Seeker
  • Find Jobs
  • Create Resume
  • Sign in
  • IT blog
  • Facebook
  • Twitter
  • LinkedIn
  • Youtube
© 2008-2026 IT Job Board