Browse IT Jobs | IT Job Board

Senior SRE

Dormont Manufacturing Co

Our Story So Far: Since our founding in 2019, Pigment has become one of the fastest-growing SaaS companies in the world today. Our product, a highly efficient Enterprise Performance Management (EPM) platform is helping companies achieve their financial goals by quickly responding to dynamic factors in their respective markets including Tech, Retail, CPG & Financial Services. In less than 5 years, Pigment has grown to over 450 employees across offices in New York, Toronto, London, Paris and soon San Mateo, and attracted a total of $393M in investment from some of the top Venture Capital firms globally. We serve companies including Unilever, Deliveroo, Gong and Brex to name a few! The opportunity We are looking for a Senior SRE profile who will design and implement the Pigment infrastructure for tomorrow. Pigment is a technically challenging platform. It calculates and synchronizes large datasets that must be updated in real-time. Data comes from client models that can be up to millions of rows. This data can be pivoted, transformed, and aggregated on-demand through our formula engine while rendering live on our front end. Here are the challenges to be tackled as an SRE : Define and build the infrastructure needed to answer our performance challenges, automate it, and make it scalable. In particular, ensure that the infrastructure scales in and out according to the platform usage. Secure high availability and redundancy of the Pigment platform. Ensure that the platform's performance and correctness are monitored accordingly, spread observability best practices across the engineering team. Participate in incident response. Accompany Pigment geographical expansion as the company grows and we sign clients overseas. Work with our security team on the implementation of their roadmap when related to infrastructure and development pipelines (code repository, credentials management, ) Continuously chase inefficiencies within our development practices and pipelines. Seek improvement and automation wherever possible. Drive change across the software engineering team. In addition to SRE responsibilities, we expect you to contribute to software development activities. We believe SRE engineers should collaborate closely with software engineers, fostering a shared understanding of day-to-day challenges, rather than operating in separate teams. Last but not least, you will not be alone! You will be part of an SRE team. Our Engineering team Our Engineering team is responsible for developing our SaaS platform and building a comprehensive and user-friendly product. Pigment engineers participate in the entire application development lifecycle, focusing on design, coding, and keeping the production platform up and running. They can be specialized, but there is no strict separation between the infrastructure, backend, and the frontend. We value user-centricity and pragmatism: we choose the most relevant tools for the problem we have to solve, understanding the strengths and constraints of each technology. Our engineering culture also values curiosity, humility, trust, ownership and team spirit. Technical stack: Kubernetes (GKE, hosted on Google Cloud Platform): hosting all our infrastructure components except some PostgreSQL hosts, hosted on Google Cloud SQL Terraform: all our infrastructure is managed via this infrastructure as code (creation of GKE clusters, Google Cloud SQL databases, GCS buckets, IAM permissions ) Databases: PostgreSQL (CloudSQL and CloudNativePG), SingleStore, ElasticSearch RabbitMQ: queuing system Temporal.io: workflow automation platform Others: ArgoCD, Istio, Vault, Github, Docker (for local development) Backend: Microservices written in C# ASP.NET Core 8 (running exclusively on Linux) & Golang Frontend: React + Typescript, Jest, Cypress, Vite Who you are More than knowledge of a specific cloud provider, language, or automation framework, we are looking for great engineering skills: the ability to translate product requirements into an elegant and simple architecture, and then make sure our product runs well on it. We are also looking for engineers who understand the product and the customer's needs in detail and can suggest innovative ideas: in the end, it's all about delivering value to end-users. In any case, you have: Experience as a software engineer & DevOps / SRE Experience with a container orchestration platform (Kubernetes is a plus) Experience with a public cloud provider, GCP is a plus Proven experience in software developments with languages such as C#, Java, C++, Golang, Rust, JavaScript, Python, or Ruby (this list is not exhaustive). Experience with observability tools (e.g. Datadog, Prometheus, ELK, Jaeger ) Great team spirit with a problem-solving attitude. A good dose of humility and the willingness to grow (no matter your seniority!). Fluency in English Pigment is an equal opportunity employer. We believe diversity is a strength and fosters innovation. We are committed to enabling everyone to feel included and valued at the workplace. All qualified applicants will receive consideration for employment without regard to age, color, family, gender identity, marital status, national origin, physical or mental disability, sex (including pregnancy), sexual orientation, social origin, or any other characteristic protected by applicable laws. We may process your personal data in accordance with our HR Data Protection Notice.

09/06/2026

Full time

Our Story So Far: Since our founding in 2019, Pigment has become one of the fastest-growing SaaS companies in the world today. Our product, a highly efficient Enterprise Performance Management (EPM) platform is helping companies achieve their financial goals by quickly responding to dynamic factors in their respective markets including Tech, Retail, CPG & Financial Services. In less than 5 years, Pigment has grown to over 450 employees across offices in New York, Toronto, London, Paris and soon San Mateo, and attracted a total of $393M in investment from some of the top Venture Capital firms globally. We serve companies including Unilever, Deliveroo, Gong and Brex to name a few! The opportunity We are looking for a Senior SRE profile who will design and implement the Pigment infrastructure for tomorrow. Pigment is a technically challenging platform. It calculates and synchronizes large datasets that must be updated in real-time. Data comes from client models that can be up to millions of rows. This data can be pivoted, transformed, and aggregated on-demand through our formula engine while rendering live on our front end. Here are the challenges to be tackled as an SRE : Define and build the infrastructure needed to answer our performance challenges, automate it, and make it scalable. In particular, ensure that the infrastructure scales in and out according to the platform usage. Secure high availability and redundancy of the Pigment platform. Ensure that the platform's performance and correctness are monitored accordingly, spread observability best practices across the engineering team. Participate in incident response. Accompany Pigment geographical expansion as the company grows and we sign clients overseas. Work with our security team on the implementation of their roadmap when related to infrastructure and development pipelines (code repository, credentials management, ) Continuously chase inefficiencies within our development practices and pipelines. Seek improvement and automation wherever possible. Drive change across the software engineering team. In addition to SRE responsibilities, we expect you to contribute to software development activities. We believe SRE engineers should collaborate closely with software engineers, fostering a shared understanding of day-to-day challenges, rather than operating in separate teams. Last but not least, you will not be alone! You will be part of an SRE team. Our Engineering team Our Engineering team is responsible for developing our SaaS platform and building a comprehensive and user-friendly product. Pigment engineers participate in the entire application development lifecycle, focusing on design, coding, and keeping the production platform up and running. They can be specialized, but there is no strict separation between the infrastructure, backend, and the frontend. We value user-centricity and pragmatism: we choose the most relevant tools for the problem we have to solve, understanding the strengths and constraints of each technology. Our engineering culture also values curiosity, humility, trust, ownership and team spirit. Technical stack: Kubernetes (GKE, hosted on Google Cloud Platform): hosting all our infrastructure components except some PostgreSQL hosts, hosted on Google Cloud SQL Terraform: all our infrastructure is managed via this infrastructure as code (creation of GKE clusters, Google Cloud SQL databases, GCS buckets, IAM permissions ) Databases: PostgreSQL (CloudSQL and CloudNativePG), SingleStore, ElasticSearch RabbitMQ: queuing system Temporal.io: workflow automation platform Others: ArgoCD, Istio, Vault, Github, Docker (for local development) Backend: Microservices written in C# ASP.NET Core 8 (running exclusively on Linux) & Golang Frontend: React + Typescript, Jest, Cypress, Vite Who you are More than knowledge of a specific cloud provider, language, or automation framework, we are looking for great engineering skills: the ability to translate product requirements into an elegant and simple architecture, and then make sure our product runs well on it. We are also looking for engineers who understand the product and the customer's needs in detail and can suggest innovative ideas: in the end, it's all about delivering value to end-users. In any case, you have: Experience as a software engineer & DevOps / SRE Experience with a container orchestration platform (Kubernetes is a plus) Experience with a public cloud provider, GCP is a plus Proven experience in software developments with languages such as C#, Java, C++, Golang, Rust, JavaScript, Python, or Ruby (this list is not exhaustive). Experience with observability tools (e.g. Datadog, Prometheus, ELK, Jaeger ) Great team spirit with a problem-solving attitude. A good dose of humility and the willingness to grow (no matter your seniority!). Fluency in English Pigment is an equal opportunity employer. We believe diversity is a strength and fosters innovation. We are committed to enabling everyone to feel included and valued at the workplace. All qualified applicants will receive consideration for employment without regard to age, color, family, gender identity, marital status, national origin, physical or mental disability, sex (including pregnancy), sexual orientation, social origin, or any other characteristic protected by applicable laws. We may process your personal data in accordance with our HR Data Protection Notice.

Site Reliability Engineer

VIQU IT

Senior Site Reliability Engineer (AWS / CDK / TypeScript) Remote First Occasional travel to Leeds £40,000 - £50,000 + benefits No Sponsorship Available VIQU have partnered with a major UK technology-led organisation undergoing a significant transformation following a large-scale business merger. As part of a wider move away from contractor-heavy delivery, they are investing heavily in permanent engineering talent and building out a high-performing cloud and platform function. They are looking for a Site Reliability Engineer to help improve the reliability, scalability and automation of their AWS estate. This is a hands-on engineering role working across cloud infrastructure, observability, CI/CD and platform tooling, helping development teams deliver faster and more reliably. You ll be joining a collaborative engineering environment with the opportunity to influence platform standards, improve operational resilience and support modern DevOps and SRE practices across the business. Key responsibilities: Build, maintain and improve scalable AWS infrastructure. Develop and manage Infrastructure as Code using AWS CDK. Support CI/CD pipelines and deployment automation. Improve monitoring, logging and observability across distributed systems. Support incident management, root cause analysis and platform reliability improvements. Work closely with engineering and architecture teams to improve operational performance and security. Contribute to cloud best practice, automation and platform engineering standards. Key requirements: Strong experience in a Site Reliability Engineering, DevOps or Platform Engineering role. Strong AWS experience within production environments. Experience with AWS CDK (TypeScript preferred). Strong TypeScript experience. Experience with CI/CD tooling such as Jenkins or GitLab CI. Containerisation experience with Docker, Kubernetes, EKS or ECS. Experience with observability tooling such as Prometheus, Grafana, AppDynamics or OpenSearch. Experience with scripting or development using Python, TypeScript or Java. Understanding of cloud security and reliability best practices. AWS certifications are desirable but not essential. Apply now to speak with VIQU IT in confidence. Or contact Aaron Chiverton on (url removed) . Know someone great? Refer them and receive up to £1,000 if successful (terms apply). For more exciting roles and opportunities, follow us on IT Recruitment.

08/06/2026

Full time

Senior Site Reliability Engineer (AWS / CDK / TypeScript) Remote First Occasional travel to Leeds £40,000 - £50,000 + benefits No Sponsorship Available VIQU have partnered with a major UK technology-led organisation undergoing a significant transformation following a large-scale business merger. As part of a wider move away from contractor-heavy delivery, they are investing heavily in permanent engineering talent and building out a high-performing cloud and platform function. They are looking for a Site Reliability Engineer to help improve the reliability, scalability and automation of their AWS estate. This is a hands-on engineering role working across cloud infrastructure, observability, CI/CD and platform tooling, helping development teams deliver faster and more reliably. You ll be joining a collaborative engineering environment with the opportunity to influence platform standards, improve operational resilience and support modern DevOps and SRE practices across the business. Key responsibilities: Build, maintain and improve scalable AWS infrastructure. Develop and manage Infrastructure as Code using AWS CDK. Support CI/CD pipelines and deployment automation. Improve monitoring, logging and observability across distributed systems. Support incident management, root cause analysis and platform reliability improvements. Work closely with engineering and architecture teams to improve operational performance and security. Contribute to cloud best practice, automation and platform engineering standards. Key requirements: Strong experience in a Site Reliability Engineering, DevOps or Platform Engineering role. Strong AWS experience within production environments. Experience with AWS CDK (TypeScript preferred). Strong TypeScript experience. Experience with CI/CD tooling such as Jenkins or GitLab CI. Containerisation experience with Docker, Kubernetes, EKS or ECS. Experience with observability tooling such as Prometheus, Grafana, AppDynamics or OpenSearch. Experience with scripting or development using Python, TypeScript or Java. Understanding of cloud security and reliability best practices. AWS certifications are desirable but not essential. Apply now to speak with VIQU IT in confidence. Or contact Aaron Chiverton on (url removed) . Know someone great? Refer them and receive up to £1,000 if successful (terms apply). For more exciting roles and opportunities, follow us on IT Recruitment.

Senior DevOps Cloud Engineer

Target Cardiff, South Glamorgan

Senior DevOps Cloud Engineer Permanent Hybrid (flexible) Up to £50,000 At Target Group, we build secure, scalable technology that powers critical services - and we're looking for a Senior DevOps Cloud Engineer to help lead the next phase of our cloud and platform journey. This is an exciting opportunity for a hands on technical leader who thrives in AWS, DevOps automation, cloud networking and security. If you enjoy shaping engineering standards, mentoring others and building resilient platforms that enable modern delivery at scale, we'd love to hear from you. What can you expect? As a Senior DevOps Cloud Engineer, you'll be a technical leader responsible for designing, building, securing and optimising cloud infrastructure and networking on the Amazon Web Services (AWS) platform. You'll lead by example through hands on engineering while providing technical direction, standards and guidance across our DevOps and cloud engineering community. From shaping highly scalable, resilient and cost efficient platforms to influencing tooling, automation and operational excellence, you'll play a key role in how we build for the future. Working closely with DevOps, Security, SRE, platform and delivery teams, you'll help shape secure by design architectures, drive best practice in automation and reliability, and support a collaborative engineering culture focused on continuous improvement. What you'll be doing Acting as a senior technical authority for DevOps, cloud engineering and AWS platform design Leading the design, deployment and optimisation of secure, scalable and highly available AWS architectures Defining engineering standards, patterns and best practices for infrastructure, networking, security and automation Providing technical leadership for AWS networking, including VPC design, Transit Gateway, Route 53, load balancing and hybrid connectivity Championing Infrastructure as Code, CI/CD and GitOps approaches using tools such as Terraform, Terragrunt, CloudFormation, GitHub Actions, Jenkins or GitLab CI Embedding security, observability, compliance and reliability into cloud-native and hybrid platforms Mentoring engineers, guiding architectural decisions and helping raise capability across the wider engineering community What we're looking for We're looking for someone who combines deep technical expertise with strong leadership and collaboration skills - someone who enjoys solving complex infrastructure challenges and supporting others to do their best work. Extensive hands on AWS experience across compute, networking, storage and security Deep expertise in AWS networking, including VPCs, Transit Gateway, Route 53, ALB/NLB and PrivateLink Strong experience with Infrastructure as Code using Terraform, Terragrunt and/or CloudFormation Proven experience with CI/CD, GitOps, containerisation and Kubernetes/EKS, plus strong scripting skills in Python, Bash or PowerShell Desirable (but not essential): Experience supporting large scale data or analytics platforms Knowledge of Zero Trust and identity centric security models Experience with SIEM/SOC tooling and cloud threat detection capabilities Exposure to AWS Control Tower, Organisations and multi account governance AWS certifications such as Solutions Architect, SysOps, Security Specialty or FinOps related accreditation A degree in Computer Science, IT, Cloud Computing or a related discipline Why join Target? We're proud to offer a competitive and flexible benefits package, designed to support your wellbeing, lifestyle and career growth: Core Benefits Competitive salary of up to £50,000 per annum depending on skills and experience 30 days holiday plus bank holidays - from day one Hybrid working policy Defined Contribution Pension Scheme (employer matched up to 6%) Company paid Private Medical Insurance (benefit in kind) Group Life Assurance Group Income Protection Discretionary annual bonus scheme Annual pay review Flexible & Lifestyle Benefits My Flex benefits platform - access to a wide range of voluntary benefits Technology Buying Scheme (salary sacrifice) Gym Flex - discounted gym and health club memberships Dental Insurance Critical Illness Cover Health Cash Plan Cycle to Work scheme Tastecard / Coffee Club Employee Discount Scheme across hundreds of retailers Wellbeing & Support Wisdom Wellbeing - confidential health and wellbeing support, including EAP Free flu vaccinations and eye tests, plus contributions towards glasses Recognition Scheme celebrating successes across the business Free mortgage advice and support Charitable payroll giving Access to a GP 24 hours a day, 7 days a week, 365 days a year through GP24 Everest Funeral Concierge Free Bereavement and Probate Advice and Support Enhanced parental leave Life at Target You'll be joining a team that genuinely: Celebrates success through our My Recognition portal Invests in your development with regular feedback and support Cares about wellbeing as much as delivery Encourages curiosity, innovation and best practice We're committed to creating a Diverse & Inclusive culture through the execution of our D&I strategy, community relationships, our people & leaders. Grow your future with us!

08/06/2026

Full time

Senior DevOps Cloud Engineer Permanent Hybrid (flexible) Up to £50,000 At Target Group, we build secure, scalable technology that powers critical services - and we're looking for a Senior DevOps Cloud Engineer to help lead the next phase of our cloud and platform journey. This is an exciting opportunity for a hands on technical leader who thrives in AWS, DevOps automation, cloud networking and security. If you enjoy shaping engineering standards, mentoring others and building resilient platforms that enable modern delivery at scale, we'd love to hear from you. What can you expect? As a Senior DevOps Cloud Engineer, you'll be a technical leader responsible for designing, building, securing and optimising cloud infrastructure and networking on the Amazon Web Services (AWS) platform. You'll lead by example through hands on engineering while providing technical direction, standards and guidance across our DevOps and cloud engineering community. From shaping highly scalable, resilient and cost efficient platforms to influencing tooling, automation and operational excellence, you'll play a key role in how we build for the future. Working closely with DevOps, Security, SRE, platform and delivery teams, you'll help shape secure by design architectures, drive best practice in automation and reliability, and support a collaborative engineering culture focused on continuous improvement. What you'll be doing Acting as a senior technical authority for DevOps, cloud engineering and AWS platform design Leading the design, deployment and optimisation of secure, scalable and highly available AWS architectures Defining engineering standards, patterns and best practices for infrastructure, networking, security and automation Providing technical leadership for AWS networking, including VPC design, Transit Gateway, Route 53, load balancing and hybrid connectivity Championing Infrastructure as Code, CI/CD and GitOps approaches using tools such as Terraform, Terragrunt, CloudFormation, GitHub Actions, Jenkins or GitLab CI Embedding security, observability, compliance and reliability into cloud-native and hybrid platforms Mentoring engineers, guiding architectural decisions and helping raise capability across the wider engineering community What we're looking for We're looking for someone who combines deep technical expertise with strong leadership and collaboration skills - someone who enjoys solving complex infrastructure challenges and supporting others to do their best work. Extensive hands on AWS experience across compute, networking, storage and security Deep expertise in AWS networking, including VPCs, Transit Gateway, Route 53, ALB/NLB and PrivateLink Strong experience with Infrastructure as Code using Terraform, Terragrunt and/or CloudFormation Proven experience with CI/CD, GitOps, containerisation and Kubernetes/EKS, plus strong scripting skills in Python, Bash or PowerShell Desirable (but not essential): Experience supporting large scale data or analytics platforms Knowledge of Zero Trust and identity centric security models Experience with SIEM/SOC tooling and cloud threat detection capabilities Exposure to AWS Control Tower, Organisations and multi account governance AWS certifications such as Solutions Architect, SysOps, Security Specialty or FinOps related accreditation A degree in Computer Science, IT, Cloud Computing or a related discipline Why join Target? We're proud to offer a competitive and flexible benefits package, designed to support your wellbeing, lifestyle and career growth: Core Benefits Competitive salary of up to £50,000 per annum depending on skills and experience 30 days holiday plus bank holidays - from day one Hybrid working policy Defined Contribution Pension Scheme (employer matched up to 6%) Company paid Private Medical Insurance (benefit in kind) Group Life Assurance Group Income Protection Discretionary annual bonus scheme Annual pay review Flexible & Lifestyle Benefits My Flex benefits platform - access to a wide range of voluntary benefits Technology Buying Scheme (salary sacrifice) Gym Flex - discounted gym and health club memberships Dental Insurance Critical Illness Cover Health Cash Plan Cycle to Work scheme Tastecard / Coffee Club Employee Discount Scheme across hundreds of retailers Wellbeing & Support Wisdom Wellbeing - confidential health and wellbeing support, including EAP Free flu vaccinations and eye tests, plus contributions towards glasses Recognition Scheme celebrating successes across the business Free mortgage advice and support Charitable payroll giving Access to a GP 24 hours a day, 7 days a week, 365 days a year through GP24 Everest Funeral Concierge Free Bereavement and Probate Advice and Support Enhanced parental leave Life at Target You'll be joining a team that genuinely: Celebrates success through our My Recognition portal Invests in your development with regular feedback and support Cares about wellbeing as much as delivery Encourages curiosity, innovation and best practice We're committed to creating a Diverse & Inclusive culture through the execution of our D&I strategy, community relationships, our people & leaders. Grow your future with us!

Senior .Net Engineer - Argos (SV)

Sainsbury's Supermarkets Ltd Coventry, Warwickshire

Salary: Competitive Plus Benefits Location: Coventry Store Support Centre - Ansty Park and Home, Coventry, CV7 9RD Contract type: Permanent Business area: Sainsbury's Tech Closing date: 14 June 2026 Requisition ID: About the Role We're looking for a talented.NET Engineerto help design, build and deliver high performing technology that improves customer experience, drives business efficiency and reduces operating costs. You'll bring curiosity, analytical thinking and a willingness to challenge how things are done - always looking for ways to improve engineering practices within your team. What You'll Do As a .NET Engineer, you will: Design and build large scale, high performance services using the latest technologies and engineering standards. Work with technologies such asReact.js,SQL,MongoDB,Kubernetes,Docker, serverless functions and event driven architecture. Implement and enhance cloud-based solutions acrossAWSandGoogle Cloud Platform. Contribute to the technical roadmap, shaping the long term architecture and engineering strategy. Support internal frameworks and services that improve capability across the wider organisation. Share knowledge, contribute to best practices and enable continuous improvement within the engineering community. Who You Are You're a proactive, driven engineer who is passionate about modern engineering practices, cloud technology and delivering high quality solutions. You promote agile ways of working and enjoy solving complex problems in a collaborative environment. To be successful in this role, you will need: Technical Skills Strong experience in.NET developmentand associated engineering principles. Proficiency withrelational and non relational databases(e.g., SQL, MongoDB). Experience working withcloud platforms(AWS and/or Google Cloud Platform). Knowledge ofReact.jsand modern frontend development standards. Experience withDocker,Kubernetes, and container orchestration. Understanding ofserverless functionsandevent driven architecture. Solid understanding ofsoftware design,security principles, andDevSecOpspractices. Hands on experience withCI/CD pipelinesand infrastructure as code. Ways of Working Ability to collaborate effectively across multidisciplinary teams. Self driven mindset with a passion for continuous learning and innovation. What's in It for You Colleague discount across Sainsbury's, Argos and Habitat Pension plan Access to a wide range of discounts (gyms, restaurants, retail and more)

08/06/2026

Full time

Salary: Competitive Plus Benefits Location: Coventry Store Support Centre - Ansty Park and Home, Coventry, CV7 9RD Contract type: Permanent Business area: Sainsbury's Tech Closing date: 14 June 2026 Requisition ID: About the Role We're looking for a talented.NET Engineerto help design, build and deliver high performing technology that improves customer experience, drives business efficiency and reduces operating costs. You'll bring curiosity, analytical thinking and a willingness to challenge how things are done - always looking for ways to improve engineering practices within your team. What You'll Do As a .NET Engineer, you will: Design and build large scale, high performance services using the latest technologies and engineering standards. Work with technologies such asReact.js,SQL,MongoDB,Kubernetes,Docker, serverless functions and event driven architecture. Implement and enhance cloud-based solutions acrossAWSandGoogle Cloud Platform. Contribute to the technical roadmap, shaping the long term architecture and engineering strategy. Support internal frameworks and services that improve capability across the wider organisation. Share knowledge, contribute to best practices and enable continuous improvement within the engineering community. Who You Are You're a proactive, driven engineer who is passionate about modern engineering practices, cloud technology and delivering high quality solutions. You promote agile ways of working and enjoy solving complex problems in a collaborative environment. To be successful in this role, you will need: Technical Skills Strong experience in.NET developmentand associated engineering principles. Proficiency withrelational and non relational databases(e.g., SQL, MongoDB). Experience working withcloud platforms(AWS and/or Google Cloud Platform). Knowledge ofReact.jsand modern frontend development standards. Experience withDocker,Kubernetes, and container orchestration. Understanding ofserverless functionsandevent driven architecture. Solid understanding ofsoftware design,security principles, andDevSecOpspractices. Hands on experience withCI/CD pipelinesand infrastructure as code. Ways of Working Ability to collaborate effectively across multidisciplinary teams. Self driven mindset with a passion for continuous learning and innovation. What's in It for You Colleague discount across Sainsbury's, Argos and Habitat Pension plan Access to a wide range of discounts (gyms, restaurants, retail and more)

Senior Site Reliability Engineer

iManage City, Belfast

Senior Site Reliability Engineer - iManage SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams - SRE teams are anchored to iManage offices across the globe. Tuesdays and Thursdays are dedicated to in office collaboration, rapid innovation, and developing a sense of belonging at iManage. Mondays and Fridays are reserved for focus time to get things done. Have the best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage means You are an engineer, a builder, and a systems thinker. You'll create middleware and platform guardrails that empower developers to innovate quickly and reliably. You combine deep technical judgment with empathy to eliminate customer pain, especially when working with enthusiastic teams stewarding the world's most privileged data. You uplift those around you, act as a subject matter expert, mentor others, and drive change. You chase contributing factors over root causes, value code over documentation, and documentation over process. You'll engage in and often lead architectural discussions, reduce toil, and deliver scalable, resilient platforms that support our customers and organization. As a Senior SRE, you'll help scale our cloud platform, collaborate across teams to promote standardization and resiliency, and participate in on call rotations. You'll become a key voice in observability, change management, and service scalability, providing guidance during complex technical decisions and high impact events. iManage is experiencing explosive growth in its flagship cloud product. We're seeking senior software and systems engineers specializing in reliability and platform services to join our transformative cloud journey. This requires rethinking technical decisions with a beginner's mindset and a focus on resilience and sustainability. If you write code, think in systems, embrace complexity and automation, and are passionate about service resilience and scalability - we want to talk to you. sRE Responsibilities Eliminate TOIL through automation and software development. Partner cross functionally with application teams and internal stakeholders. Create a modern, cloud native platform that is resilient, cost effective, and secure by default. Scale cloud infrastructure to support our Kubernetes based ecosystem. Maintain the freshness and utility of platform services. Improve the security posture of our products. Design automation, orchestration, observability, and disaster readiness into our products. Participate in production support and on call rotations, providing senior level guidance during critical events. Lead incident management and post incident retrospectives, coaching teams in these practices. Qualifications Experience writing design documents, postmortems, and refactoring application code. Built automation to reduce operational burden or developed internal SaaS tools. Ability to advocate for SRE principles (e.g., SLOs vs SLAs) and introduce them effectively. Experience in public cloud or hosted datacenter environments (Azure and AKS preferred). A passion for collaborative teamwork and influencing reliability best practices across teams. Bonus Points Hands on experience with Linux server stacks (Ubuntu/Debian preferred). Knowledge of cloud provisioning platforms (Terraform preferred). Exposure to configuration management tools (Chef preferred). Experience with containerization/clustering technologies (Docker preferred). Familiarity with observability and alerting tools (Prometheus/Grafana or ELK/EFK). Practical experience with CI/CD pipelines and rollout strategies. A bachelor's degree (or equivalent experience) in Computer Engineering or related field. Proficiency in one or more programming languages (e.g., Java, Python, Golang). Familiarity with scripting languages (e.g., PowerShell, Bash, Python, Ruby). Benefits Creating an inclusive environment where you're encouraged to help shape the culture. Market leading salary determined through a fair and consistent process, equitable for all employees. Annual performance based bonus. Enhanced parental leave (20 weeks for primary and 10 weeks for secondary caregiver at 100% pay). Matching pension contribution (up to 6%). Private medical insurance and cash plan. Group life cover, income protection, and critical illness protection. Flexible time off policy, 25 days of annual leave with additional flexibility. Wellness days each year to prioritize mental health and well being. Access to RethinkCare, a global behavioral health platform. We welcome those who come with a growth mindset and a hunger for learning; if you are excited about this role but your past experience doesn't align perfectly with every qualification, we encourage you to apply anyway. iManage is committed to providing an excellent candidate experience and will never ask you to engage in recruitment activity via text and exclusively communicate from emails using domain. If you have any concerns or questions about communications you have received, please send them to so our team members can review. iManage provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

08/06/2026

Full time

Senior Site Reliability Engineer - iManage SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams - SRE teams are anchored to iManage offices across the globe. Tuesdays and Thursdays are dedicated to in office collaboration, rapid innovation, and developing a sense of belonging at iManage. Mondays and Fridays are reserved for focus time to get things done. Have the best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage means You are an engineer, a builder, and a systems thinker. You'll create middleware and platform guardrails that empower developers to innovate quickly and reliably. You combine deep technical judgment with empathy to eliminate customer pain, especially when working with enthusiastic teams stewarding the world's most privileged data. You uplift those around you, act as a subject matter expert, mentor others, and drive change. You chase contributing factors over root causes, value code over documentation, and documentation over process. You'll engage in and often lead architectural discussions, reduce toil, and deliver scalable, resilient platforms that support our customers and organization. As a Senior SRE, you'll help scale our cloud platform, collaborate across teams to promote standardization and resiliency, and participate in on call rotations. You'll become a key voice in observability, change management, and service scalability, providing guidance during complex technical decisions and high impact events. iManage is experiencing explosive growth in its flagship cloud product. We're seeking senior software and systems engineers specializing in reliability and platform services to join our transformative cloud journey. This requires rethinking technical decisions with a beginner's mindset and a focus on resilience and sustainability. If you write code, think in systems, embrace complexity and automation, and are passionate about service resilience and scalability - we want to talk to you. sRE Responsibilities Eliminate TOIL through automation and software development. Partner cross functionally with application teams and internal stakeholders. Create a modern, cloud native platform that is resilient, cost effective, and secure by default. Scale cloud infrastructure to support our Kubernetes based ecosystem. Maintain the freshness and utility of platform services. Improve the security posture of our products. Design automation, orchestration, observability, and disaster readiness into our products. Participate in production support and on call rotations, providing senior level guidance during critical events. Lead incident management and post incident retrospectives, coaching teams in these practices. Qualifications Experience writing design documents, postmortems, and refactoring application code. Built automation to reduce operational burden or developed internal SaaS tools. Ability to advocate for SRE principles (e.g., SLOs vs SLAs) and introduce them effectively. Experience in public cloud or hosted datacenter environments (Azure and AKS preferred). A passion for collaborative teamwork and influencing reliability best practices across teams. Bonus Points Hands on experience with Linux server stacks (Ubuntu/Debian preferred). Knowledge of cloud provisioning platforms (Terraform preferred). Exposure to configuration management tools (Chef preferred). Experience with containerization/clustering technologies (Docker preferred). Familiarity with observability and alerting tools (Prometheus/Grafana or ELK/EFK). Practical experience with CI/CD pipelines and rollout strategies. A bachelor's degree (or equivalent experience) in Computer Engineering or related field. Proficiency in one or more programming languages (e.g., Java, Python, Golang). Familiarity with scripting languages (e.g., PowerShell, Bash, Python, Ruby). Benefits Creating an inclusive environment where you're encouraged to help shape the culture. Market leading salary determined through a fair and consistent process, equitable for all employees. Annual performance based bonus. Enhanced parental leave (20 weeks for primary and 10 weeks for secondary caregiver at 100% pay). Matching pension contribution (up to 6%). Private medical insurance and cash plan. Group life cover, income protection, and critical illness protection. Flexible time off policy, 25 days of annual leave with additional flexibility. Wellness days each year to prioritize mental health and well being. Access to RethinkCare, a global behavioral health platform. We welcome those who come with a growth mindset and a hunger for learning; if you are excited about this role but your past experience doesn't align perfectly with every qualification, we encourage you to apply anyway. iManage is committed to providing an excellent candidate experience and will never ask you to engage in recruitment activity via text and exclusively communicate from emails using domain. If you have any concerns or questions about communications you have received, please send them to so our team members can review. iManage provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

DataOps Engineer

Dormont Manufacturing Co

CoreWeave is The Essential Cloud for AI . Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. We're proud to be a Living Wage accredited Employer. What You'll Do: The Monolith AI Platform Engineering Team at CoreWeave is responsible for building and scaling the data and workflow backbone that powers the world's most advanced engineering simulation and AI workflows - our ambition is to become the super intelligent AI test lab for the engineering industry, helping customers ship science, faster. From high throughput data ingestion and feature pipelines to model training and real time inference, our platform delivers the performant, reliable, and trustworthy data foundation trusted by the world's largest engineering companies. The Senior DataOps Engineer II will own and drive all things data observability and operations across our client estate - building the practices, tooling, and culture that make Monolith's data flows debuggable, auditable, and safe to evolve. You'll sit at the intersection of platform engineering, data engineering, and reliability, implementing end to end lineage and DataOps practices while mentoring data producers and consumers on how to manage data as a first class product. You'll partner closely with Monolith's Product, Engineering and forward deployed teams, as well as with CoreWeave's infrastructure and AI platform groups, to turn fragmented, real world engineering data into well governed, observable, and operationally robust pipelines powering our SaaS platform and client specific deployments. About the Role: We're seeking an Senior DataOps Engineer II who can act as the hands on owner for Monolith's data observability and operational surface: from batch and streaming pipelines running on our platform, through to the lineage, quality, and runbooks that keep customer environments healthy. You'll define and roll out DataOps practices (CI/CD, infra as code, data SLOs, incident response) across the Monolith estate, implement end to end data lineage and observability, and serve as the go to mentor for engineering teams and client facing colleagues on best practice data management. In this role, you will: Own Monolith's Data Observability & Operations Surface Design and implement the end to end observability stack for data workloads (metrics, logs, traces, and data quality signals) across batch and streaming pipelines. Define and maintain operational SLOs/SLAs for critical data flows powering training, inference, and analytics, and ensure they are measurable and actionable. Build dashboards, alerts, and runbooks that allow engineers and on call responders to quickly detect, triage, and remediate data incidents. Standardise "golden paths" for how teams instrument pipelines, expose health signals, and respond to data related failures. Implement Data Lineage, Quality & Governance Deploy and maintain end to end data lineage for key domains - from client sources through transformations to features, models, and downstream analytics so teams can debug, audit, and reason about change. Define and roll out data quality checks (schema, freshness, completeness, distribution, drift) and ensure failures integrate cleanly into alerting and incident workflows. Partner with Security, Compliance, and customer facing teams to encode data governance requirements (e.g., retention, residency, access controls) into our pipelines and tooling. Help shape metadata models and catalog conventions so that producers and consumers can reliably discover, understand, and use shared datasets. Enable DataOps Practices Across Teams Establish CI/CD patterns for data pipelines and related infrastructure, including testing strategies, promotion workflows, and change management guardrails. Drive adoption of infra as code for data infrastructure (e.g., pipeline orchestration, storage, observability components), reducing manual drift across environments. Define and continuously improve DataOps processes - incident response, post incident review, change review, on call rotations - with a focus on learning rather than blame. Evaluate and integrate best of breed DataOps and observability tooling where it accelerates our teams, balancing build vs. buy pragmatically. Partner Across Monolith, CoreWeave & Clients Work with Monolith platform, data, agent, and reliability teams to expose observability and lineage as shared services and patterns other engineers can build on. Collaborate with CoreWeave infrastructure and AI platform teams to leverage underlying storage, compute, networking, and observability in service of robust data flows. Serve as a technical escalation point for forward deployed and customer facing engineers when data issues cross service boundaries or require deeper architectural insight. Mentor data producers (product teams, integrations, forward deployed engineers) and data consumers (data scientists, analysts, client engineers) on resilient schemas, contracts, and operational practices. Who You Are: Experience & Level Typically 5-6+ years of experience in DataOps, Data Engineering, DevOps/SRE for data platforms, or similar roles, including end to end ownership of production data pipelines and their operations. Proven track record of operating at Senior IC scope: leading cross team initiatives, introducing new practices/tooling, and improving reliability at the platform level. DataOps, Pipelines & Tooling Strong hands on experience designing, deploying, and operating data pipelines in production (batch and/or streaming), including failure modes, retries, and backfills. Practical experience with data orchestration and ETL/ELT tooling (e.g., Airflow, Dagster, dbt, Temporal, or similar) and comfort evaluating and integrating new tools where appropriate. Solid SQL and/or Spark skills and experience with at least one major analytical database or warehouse; familiarity with time series / telemetry data is a plus. Observability, Lineage & Data Quality Extensive experience implementing data observability - metrics, logging, tracing, dashboards, and alerting - for data centric workloads. Hands on work with data quality frameworks and/or observability platforms to monitor freshness, completeness, schema changes, and anomalies. Experience deploying and using data lineage or metadata/catalog solutions, and applying them to debugging, compliance, and change impact analysis. Platform, Infrastructure & Automation Comfortable working in containerised, cloud native environments (Kubernetes plus at least one major cloud provider); experience with GPU or compute intensive workloads is a bonus. Strong automation mindset: infra as code, CI/CD, and configuration management for data infrastructure and observability components. Proficient in Python for building tooling, pipeline glue, and platform integrations; additional languages are a plus. Collaboration, Mentorship & Communication Clear communicator who can explain complex data flows and failure modes to both deeply technical and non specialist audiences. Experience mentoring engineers and data practitioners on better data management, observability, and operational hygiene - through documentation, examples, reviews, and office hours. Comfortable working in a fast moving, high ambiguity environment where we balance rapid iteration with the safety and reliability demanded by enterprise engineering clients. Preferred: Experience in ML/AI platforms or MLOps environments where data pipelines power experimentation, training, and inference at scale. Background with test, simulation, or time series data (e.g., physical test benches, battery labs, automotive/aerospace R&D). Familiarity with feature stores, experiment tracking, or model registries and their interaction with upstream data pipelines. Prior work in multi tenant SaaS platforms, especially those with strong compliance, observability, and uptime requirements. Experience supporting or partnering closely with forward deployed / professional services teams in complex customer environments. Wondering if you're a good fit? We believe in investing in our people, and value candidates who bring diverse experiences - even if you don't tick every single box. Here are a few qualities we've found compatible with our team. If some of this sounds like you, we'd love to talk: Data obsessed operator - You care deeply about making data systems observable, predictable, and easy to reason about, not just "working most of the time." Systems thinker - You enjoy mapping complex data flows across services, understanding failure modes, and designing for graceful degradation and rapid recovery. Pragmatic - You know when to build the ideal abstraction and when to ship the smallest change that meaningfully reduces risk or toil. Collaborative mentor . click apply for full job details

08/06/2026

Full time

CoreWeave is The Essential Cloud for AI . Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. We're proud to be a Living Wage accredited Employer. What You'll Do: The Monolith AI Platform Engineering Team at CoreWeave is responsible for building and scaling the data and workflow backbone that powers the world's most advanced engineering simulation and AI workflows - our ambition is to become the super intelligent AI test lab for the engineering industry, helping customers ship science, faster. From high throughput data ingestion and feature pipelines to model training and real time inference, our platform delivers the performant, reliable, and trustworthy data foundation trusted by the world's largest engineering companies. The Senior DataOps Engineer II will own and drive all things data observability and operations across our client estate - building the practices, tooling, and culture that make Monolith's data flows debuggable, auditable, and safe to evolve. You'll sit at the intersection of platform engineering, data engineering, and reliability, implementing end to end lineage and DataOps practices while mentoring data producers and consumers on how to manage data as a first class product. You'll partner closely with Monolith's Product, Engineering and forward deployed teams, as well as with CoreWeave's infrastructure and AI platform groups, to turn fragmented, real world engineering data into well governed, observable, and operationally robust pipelines powering our SaaS platform and client specific deployments. About the Role: We're seeking an Senior DataOps Engineer II who can act as the hands on owner for Monolith's data observability and operational surface: from batch and streaming pipelines running on our platform, through to the lineage, quality, and runbooks that keep customer environments healthy. You'll define and roll out DataOps practices (CI/CD, infra as code, data SLOs, incident response) across the Monolith estate, implement end to end data lineage and observability, and serve as the go to mentor for engineering teams and client facing colleagues on best practice data management. In this role, you will: Own Monolith's Data Observability & Operations Surface Design and implement the end to end observability stack for data workloads (metrics, logs, traces, and data quality signals) across batch and streaming pipelines. Define and maintain operational SLOs/SLAs for critical data flows powering training, inference, and analytics, and ensure they are measurable and actionable. Build dashboards, alerts, and runbooks that allow engineers and on call responders to quickly detect, triage, and remediate data incidents. Standardise "golden paths" for how teams instrument pipelines, expose health signals, and respond to data related failures. Implement Data Lineage, Quality & Governance Deploy and maintain end to end data lineage for key domains - from client sources through transformations to features, models, and downstream analytics so teams can debug, audit, and reason about change. Define and roll out data quality checks (schema, freshness, completeness, distribution, drift) and ensure failures integrate cleanly into alerting and incident workflows. Partner with Security, Compliance, and customer facing teams to encode data governance requirements (e.g., retention, residency, access controls) into our pipelines and tooling. Help shape metadata models and catalog conventions so that producers and consumers can reliably discover, understand, and use shared datasets. Enable DataOps Practices Across Teams Establish CI/CD patterns for data pipelines and related infrastructure, including testing strategies, promotion workflows, and change management guardrails. Drive adoption of infra as code for data infrastructure (e.g., pipeline orchestration, storage, observability components), reducing manual drift across environments. Define and continuously improve DataOps processes - incident response, post incident review, change review, on call rotations - with a focus on learning rather than blame. Evaluate and integrate best of breed DataOps and observability tooling where it accelerates our teams, balancing build vs. buy pragmatically. Partner Across Monolith, CoreWeave & Clients Work with Monolith platform, data, agent, and reliability teams to expose observability and lineage as shared services and patterns other engineers can build on. Collaborate with CoreWeave infrastructure and AI platform teams to leverage underlying storage, compute, networking, and observability in service of robust data flows. Serve as a technical escalation point for forward deployed and customer facing engineers when data issues cross service boundaries or require deeper architectural insight. Mentor data producers (product teams, integrations, forward deployed engineers) and data consumers (data scientists, analysts, client engineers) on resilient schemas, contracts, and operational practices. Who You Are: Experience & Level Typically 5-6+ years of experience in DataOps, Data Engineering, DevOps/SRE for data platforms, or similar roles, including end to end ownership of production data pipelines and their operations. Proven track record of operating at Senior IC scope: leading cross team initiatives, introducing new practices/tooling, and improving reliability at the platform level. DataOps, Pipelines & Tooling Strong hands on experience designing, deploying, and operating data pipelines in production (batch and/or streaming), including failure modes, retries, and backfills. Practical experience with data orchestration and ETL/ELT tooling (e.g., Airflow, Dagster, dbt, Temporal, or similar) and comfort evaluating and integrating new tools where appropriate. Solid SQL and/or Spark skills and experience with at least one major analytical database or warehouse; familiarity with time series / telemetry data is a plus. Observability, Lineage & Data Quality Extensive experience implementing data observability - metrics, logging, tracing, dashboards, and alerting - for data centric workloads. Hands on work with data quality frameworks and/or observability platforms to monitor freshness, completeness, schema changes, and anomalies. Experience deploying and using data lineage or metadata/catalog solutions, and applying them to debugging, compliance, and change impact analysis. Platform, Infrastructure & Automation Comfortable working in containerised, cloud native environments (Kubernetes plus at least one major cloud provider); experience with GPU or compute intensive workloads is a bonus. Strong automation mindset: infra as code, CI/CD, and configuration management for data infrastructure and observability components. Proficient in Python for building tooling, pipeline glue, and platform integrations; additional languages are a plus. Collaboration, Mentorship & Communication Clear communicator who can explain complex data flows and failure modes to both deeply technical and non specialist audiences. Experience mentoring engineers and data practitioners on better data management, observability, and operational hygiene - through documentation, examples, reviews, and office hours. Comfortable working in a fast moving, high ambiguity environment where we balance rapid iteration with the safety and reliability demanded by enterprise engineering clients. Preferred: Experience in ML/AI platforms or MLOps environments where data pipelines power experimentation, training, and inference at scale. Background with test, simulation, or time series data (e.g., physical test benches, battery labs, automotive/aerospace R&D). Familiarity with feature stores, experiment tracking, or model registries and their interaction with upstream data pipelines. Prior work in multi tenant SaaS platforms, especially those with strong compliance, observability, and uptime requirements. Experience supporting or partnering closely with forward deployed / professional services teams in complex customer environments. Wondering if you're a good fit? We believe in investing in our people, and value candidates who bring diverse experiences - even if you don't tick every single box. Here are a few qualities we've found compatible with our team. If some of this sounds like you, we'd love to talk: Data obsessed operator - You care deeply about making data systems observable, predictable, and easy to reason about, not just "working most of the time." Systems thinker - You enjoy mapping complex data flows across services, understanding failure modes, and designing for graceful degradation and rapid recovery. Pragmatic - You know when to build the ideal abstraction and when to ship the smallest change that meaningfully reduces risk or toil. Collaborative mentor . click apply for full job details

Senior Site Reliability Engineer

iManage

Senior Site Reliability Engineer - iManage SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams - SRE teams are anchored to iManage offices across the globe. Tuesdays and Thursdays are dedicated to in office collaboration, rapid innovation, and developing a sense of belonging at iManage. Mondays and Fridays are reserved for focus time to get things done. Have the best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage means You are an engineer, a builder, and a systems thinker. You'll create middleware and platform guardrails that empower developers to innovate quickly and reliably. You combine deep technical judgment with empathy to eliminate customer pain, especially when working with enthusiastic teams stewarding the world's most privileged data. You uplift those around you, act as a subject matter expert, mentor others, and drive change. You chase contributing factors over root causes, value code over documentation, and documentation over process. You'll engage in and often lead architectural discussions, reduce toil, and deliver scalable, resilient platforms that support our customers and organization. As a Senior SRE, you'll help scale our cloud platform, collaborate across teams to promote standardization and resiliency, and participate in on call rotations. You'll become a key voice in observability, change management, and service scalability, providing guidance during complex technical decisions and high impact events. iManage is experiencing explosive growth in its flagship cloud product. We're seeking senior software and systems engineers specializing in reliability and platform services to join our transformative cloud journey. This requires rethinking technical decisions with a beginner's mindset and a focus on resilience and sustainability. If you write code, think in systems, embrace complexity and automation, and are passionate about service resilience and scalability - we want to talk to you. sRE Responsibilities Eliminate TOIL through automation and software development. Partner cross functionally with application teams and internal stakeholders. Create a modern, cloud native platform that is resilient, cost effective, and secure by default. Scale cloud infrastructure to support our Kubernetes based ecosystem. Maintain the freshness and utility of platform services. Improve the security posture of our products. Design automation, orchestration, observability, and disaster readiness into our products. Participate in production support and on call rotations, providing senior level guidance during critical events. Lead incident management and post incident retrospectives, coaching teams in these practices. Qualifications Experience writing design documents, postmortems, and refactoring application code. Built automation to reduce operational burden or developed internal SaaS tools. Ability to advocate for SRE principles (e.g., SLOs vs SLAs) and introduce them effectively. Experience in public cloud or hosted datacenter environments (Azure and AKS preferred). A passion for collaborative teamwork and influencing reliability best practices across teams. Bonus Points Hands on experience with Linux server stacks (Ubuntu/Debian preferred). Knowledge of cloud provisioning platforms (Terraform preferred). Exposure to configuration management tools (Chef preferred). Experience with containerization/clustering technologies (Docker preferred). Familiarity with observability and alerting tools (Prometheus/Grafana or ELK/EFK). Practical experience with CI/CD pipelines and rollout strategies. A bachelor's degree (or equivalent experience) in Computer Engineering or related field. Proficiency in one or more programming languages (e.g., Java, Python, Golang). Familiarity with scripting languages (e.g., PowerShell, Bash, Python, Ruby). Benefits Creating an inclusive environment where you're encouraged to help shape the culture. Market leading salary determined through a fair and consistent process, equitable for all employees. Annual performance based bonus. Enhanced parental leave (20 weeks for primary and 10 weeks for secondary caregiver at 100% pay). Matching pension contribution (up to 6%). Private medical insurance and cash plan. Group life cover, income protection, and critical illness protection. Flexible time off policy, 25 days of annual leave with additional flexibility. Wellness days each year to prioritize mental health and well being. Access to RethinkCare, a global behavioral health platform. We welcome those who come with a growth mindset and a hunger for learning; if you are excited about this role but your past experience doesn't align perfectly with every qualification, we encourage you to apply anyway. iManage is committed to providing an excellent candidate experience and will never ask you to engage in recruitment activity via text and exclusively communicate from emails using domain. If you have any concerns or questions about communications you have received, please send them to so our team members can review. iManage provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

08/06/2026

Full time

Senior Site Reliability Engineer - iManage SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams - SRE teams are anchored to iManage offices across the globe. Tuesdays and Thursdays are dedicated to in office collaboration, rapid innovation, and developing a sense of belonging at iManage. Mondays and Fridays are reserved for focus time to get things done. Have the best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Senior Site Reliability Engineer at iManage means You are an engineer, a builder, and a systems thinker. You'll create middleware and platform guardrails that empower developers to innovate quickly and reliably. You combine deep technical judgment with empathy to eliminate customer pain, especially when working with enthusiastic teams stewarding the world's most privileged data. You uplift those around you, act as a subject matter expert, mentor others, and drive change. You chase contributing factors over root causes, value code over documentation, and documentation over process. You'll engage in and often lead architectural discussions, reduce toil, and deliver scalable, resilient platforms that support our customers and organization. As a Senior SRE, you'll help scale our cloud platform, collaborate across teams to promote standardization and resiliency, and participate in on call rotations. You'll become a key voice in observability, change management, and service scalability, providing guidance during complex technical decisions and high impact events. iManage is experiencing explosive growth in its flagship cloud product. We're seeking senior software and systems engineers specializing in reliability and platform services to join our transformative cloud journey. This requires rethinking technical decisions with a beginner's mindset and a focus on resilience and sustainability. If you write code, think in systems, embrace complexity and automation, and are passionate about service resilience and scalability - we want to talk to you. sRE Responsibilities Eliminate TOIL through automation and software development. Partner cross functionally with application teams and internal stakeholders. Create a modern, cloud native platform that is resilient, cost effective, and secure by default. Scale cloud infrastructure to support our Kubernetes based ecosystem. Maintain the freshness and utility of platform services. Improve the security posture of our products. Design automation, orchestration, observability, and disaster readiness into our products. Participate in production support and on call rotations, providing senior level guidance during critical events. Lead incident management and post incident retrospectives, coaching teams in these practices. Qualifications Experience writing design documents, postmortems, and refactoring application code. Built automation to reduce operational burden or developed internal SaaS tools. Ability to advocate for SRE principles (e.g., SLOs vs SLAs) and introduce them effectively. Experience in public cloud or hosted datacenter environments (Azure and AKS preferred). A passion for collaborative teamwork and influencing reliability best practices across teams. Bonus Points Hands on experience with Linux server stacks (Ubuntu/Debian preferred). Knowledge of cloud provisioning platforms (Terraform preferred). Exposure to configuration management tools (Chef preferred). Experience with containerization/clustering technologies (Docker preferred). Familiarity with observability and alerting tools (Prometheus/Grafana or ELK/EFK). Practical experience with CI/CD pipelines and rollout strategies. A bachelor's degree (or equivalent experience) in Computer Engineering or related field. Proficiency in one or more programming languages (e.g., Java, Python, Golang). Familiarity with scripting languages (e.g., PowerShell, Bash, Python, Ruby). Benefits Creating an inclusive environment where you're encouraged to help shape the culture. Market leading salary determined through a fair and consistent process, equitable for all employees. Annual performance based bonus. Enhanced parental leave (20 weeks for primary and 10 weeks for secondary caregiver at 100% pay). Matching pension contribution (up to 6%). Private medical insurance and cash plan. Group life cover, income protection, and critical illness protection. Flexible time off policy, 25 days of annual leave with additional flexibility. Wellness days each year to prioritize mental health and well being. Access to RethinkCare, a global behavioral health platform. We welcome those who come with a growth mindset and a hunger for learning; if you are excited about this role but your past experience doesn't align perfectly with every qualification, we encourage you to apply anyway. iManage is committed to providing an excellent candidate experience and will never ask you to engage in recruitment activity via text and exclusively communicate from emails using domain. If you have any concerns or questions about communications you have received, please send them to so our team members can review. iManage provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Senior or Staff Software Engineer, SRE/ Platform Team

OneSignal, Inc.

Senior or Staff Software Engineer, SRE/ Platform Team OneSignal is a leading omnichannel customer engagement solution, powering personalized customer journeys across mobile and web push notifications, in-app messaging, SMS, and email. On a mission to democratize customer engagement, we enable businesses to keep their 1.5B monthly active users engaged and up to date by delivering over 1.3T messages a year! 1 in 4 app publishers trust OneSignal to power their customer engagement! And we support companies in 140 countries! Our customers range from startups and small businesses just getting off the ground to established companies such as Live Nation, American Express, Whole Foods, Zynga, and many more. We're Series C, venture-backed by SignalFire, Rakuten Ventures, Y Combinator, HubSpot, and BAM Elevate. We offer remote work as the default option in the United States in California, Colorado, Massachusetts, New York, New Jersey, Oregon, Pennsylvania, Texas, Utah and Washington. As well as in the UK, Singapore, and Canada - with plans to expand the locations we support in the future. Some roles are hybrid roles and will be listed as such. We have offices in San Mateo, CA and London, UK, and offer flex seating options for employees to work together in-person in NY and other areas. Hiring in Singapore is done in partnership with a local EOR, and hiring in Canada is done in partnership with Rippling's EOR. OneSignal has a lot of the great tech startup qualities you'd expect, but we don't stop there. Our massive scale and small team, emphasis on collaboration, and focus on ownership and personal growth make OneSignal a uniquely great place to work. About The Team: We have grown rapidly to where we are today, serving billions of HTTP requests daily. We achieved this scale by writing scale-sensitive components in languages like Rust and Go. This potent combination of high performance with efficient resource utilization has given us an incredible competitive edge. We are seeking a Platform Engineer to join our team and help us scale by managing and developing the next generation of our infrastructure. While we currently maintain a 99.95 % uptime, we are dedicated to sustaining this level of reliability as our product and business expand. In this role, your core responsibility will be software engineering with a specialized focus on operations, infrastructure, and automation. You will develop the systems that power our product, enhance internal services, and provide architectural guidance to product teams to ensure optimal service operability. You will leverage Kubernetes to automate data center functions and create services that streamline database operations. A major aspect of this position is gaining a deep enough understanding of our systems to move beyond manual intervention and build sophisticated software solutions that fully automate these processes. What You'll Do: Optimize and Elevate Performance: Identify bottlenecks in our systems and unleash your creativity to introduce cutting edge optimizations. You'll have the chance to improve the performance of our databases and evaluate innovative storage technologies that will elevate our infrastructure to new heights. Forge Infrastructure as Code: Take the lead in setting up robust infrastructure and configuration as code with Kubernetes and Terraform. You'll be at the forefront of shaping our foundational architecture, ensuring it's both resilient and scalable. Drive Observability and Monitoring: Establish and maintain a state of the art observability and monitoring stack. Your insights will enable us to stay ahead of potential issues, ensuring our services remain reliable and performant. Craft the Golden Path for CI/CD: Define and implement best practices for continuous integration and deployment. Your work will streamline the deployment process for our engineering teams, allowing them to roll out new features swiftly and safely. Collaborate Across Teams: Work closely with engineering teams to architect highly scalable, observable services. Your collaboration will be essential in creating a cohesive and efficient development environment. Be a Key Player in Incident Response: Join the on call rotation and play a crucial role in maintaining our systems' health. Your expertise will be vital in troubleshooting and resolving issues, ensuring our services always meet the highest standards. What you'll bring: At least 8 years of platform experience Experience operating reliable production systems at scale Knowledge of Linux systems internals Desire and ability to automate tasks Experience managing PostgreSQL for high scale throughput systems, or similar experience with other relevant SQL datastores. Operational experience deploying and managing Kubernetes Experience working with Cloud Providers (AWS/GCP/Azure) We value a variety of experiences, so these are not required. It would be an added bonus if you have experience in any of the following: Recently writing Go and/or Rust Working with ScyllaDB The base salary in UK for a Senior Software Engineer full time position is between GBP 100,000 and GBP 125,000, and for the Staff level is GBP 125,000 and GBP 145,000. Your exact starting salary is determined by a number of factors such as your experience, skills, and qualifications. In addition to base salary, we also offer a competitive equity program and comprehensive and inclusive benefits. Qualities we look for: Friendliness & Empathy Accountability & Collaboration Proactiveness & Urgency Growth Mindset & Love of Learning In keeping with our beliefs and goals, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status. Above and beyond discrimination/harassment based on 'protected categories,' we also strive to prevent other, subtler forms of inappropriate behavior (e.g., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place in our workplace. Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on OneSignal. Please inform us if you need assistance completing any forms or otherwise participating in the application and/or interview process.

07/06/2026

Full time

Senior or Staff Software Engineer, SRE/ Platform Team OneSignal is a leading omnichannel customer engagement solution, powering personalized customer journeys across mobile and web push notifications, in-app messaging, SMS, and email. On a mission to democratize customer engagement, we enable businesses to keep their 1.5B monthly active users engaged and up to date by delivering over 1.3T messages a year! 1 in 4 app publishers trust OneSignal to power their customer engagement! And we support companies in 140 countries! Our customers range from startups and small businesses just getting off the ground to established companies such as Live Nation, American Express, Whole Foods, Zynga, and many more. We're Series C, venture-backed by SignalFire, Rakuten Ventures, Y Combinator, HubSpot, and BAM Elevate. We offer remote work as the default option in the United States in California, Colorado, Massachusetts, New York, New Jersey, Oregon, Pennsylvania, Texas, Utah and Washington. As well as in the UK, Singapore, and Canada - with plans to expand the locations we support in the future. Some roles are hybrid roles and will be listed as such. We have offices in San Mateo, CA and London, UK, and offer flex seating options for employees to work together in-person in NY and other areas. Hiring in Singapore is done in partnership with a local EOR, and hiring in Canada is done in partnership with Rippling's EOR. OneSignal has a lot of the great tech startup qualities you'd expect, but we don't stop there. Our massive scale and small team, emphasis on collaboration, and focus on ownership and personal growth make OneSignal a uniquely great place to work. About The Team: We have grown rapidly to where we are today, serving billions of HTTP requests daily. We achieved this scale by writing scale-sensitive components in languages like Rust and Go. This potent combination of high performance with efficient resource utilization has given us an incredible competitive edge. We are seeking a Platform Engineer to join our team and help us scale by managing and developing the next generation of our infrastructure. While we currently maintain a 99.95 % uptime, we are dedicated to sustaining this level of reliability as our product and business expand. In this role, your core responsibility will be software engineering with a specialized focus on operations, infrastructure, and automation. You will develop the systems that power our product, enhance internal services, and provide architectural guidance to product teams to ensure optimal service operability. You will leverage Kubernetes to automate data center functions and create services that streamline database operations. A major aspect of this position is gaining a deep enough understanding of our systems to move beyond manual intervention and build sophisticated software solutions that fully automate these processes. What You'll Do: Optimize and Elevate Performance: Identify bottlenecks in our systems and unleash your creativity to introduce cutting edge optimizations. You'll have the chance to improve the performance of our databases and evaluate innovative storage technologies that will elevate our infrastructure to new heights. Forge Infrastructure as Code: Take the lead in setting up robust infrastructure and configuration as code with Kubernetes and Terraform. You'll be at the forefront of shaping our foundational architecture, ensuring it's both resilient and scalable. Drive Observability and Monitoring: Establish and maintain a state of the art observability and monitoring stack. Your insights will enable us to stay ahead of potential issues, ensuring our services remain reliable and performant. Craft the Golden Path for CI/CD: Define and implement best practices for continuous integration and deployment. Your work will streamline the deployment process for our engineering teams, allowing them to roll out new features swiftly and safely. Collaborate Across Teams: Work closely with engineering teams to architect highly scalable, observable services. Your collaboration will be essential in creating a cohesive and efficient development environment. Be a Key Player in Incident Response: Join the on call rotation and play a crucial role in maintaining our systems' health. Your expertise will be vital in troubleshooting and resolving issues, ensuring our services always meet the highest standards. What you'll bring: At least 8 years of platform experience Experience operating reliable production systems at scale Knowledge of Linux systems internals Desire and ability to automate tasks Experience managing PostgreSQL for high scale throughput systems, or similar experience with other relevant SQL datastores. Operational experience deploying and managing Kubernetes Experience working with Cloud Providers (AWS/GCP/Azure) We value a variety of experiences, so these are not required. It would be an added bonus if you have experience in any of the following: Recently writing Go and/or Rust Working with ScyllaDB The base salary in UK for a Senior Software Engineer full time position is between GBP 100,000 and GBP 125,000, and for the Staff level is GBP 125,000 and GBP 145,000. Your exact starting salary is determined by a number of factors such as your experience, skills, and qualifications. In addition to base salary, we also offer a competitive equity program and comprehensive and inclusive benefits. Qualities we look for: Friendliness & Empathy Accountability & Collaboration Proactiveness & Urgency Growth Mindset & Love of Learning In keeping with our beliefs and goals, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status. Above and beyond discrimination/harassment based on 'protected categories,' we also strive to prevent other, subtler forms of inappropriate behavior (e.g., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place in our workplace. Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on OneSignal. Please inform us if you need assistance completing any forms or otherwise participating in the application and/or interview process.

Senior Site Reliability Engineer (SRE)

The Investigo Group

Role: Senior Site Reliability Engineer (SRE) - Kubernetes / OpenShift Location: Remote - UK (possible paid occasional travel to TIG Secure site locations as required) Job Type: Full-time, Permanent (37.5 hours) Salary: Competitive + benefits + package Security Clearance Requirements Please note that holding a current Security Clearance is not essential at the time of application, but eligibility is required. This role requires the successful candidate to be eligible for Security Check (SC) clearance. To meet this requirement, applicants must: Have the right to work in the UK Have lived in the UK continuously for the past 5 years Not have spent more than 6 months outside the UK in total during that period Be willing to undergo security vetting as part of the onboarding process About You You're an experienced SRE, Platform Engineer or Cloud Engineer with strong hands on experience running Kubernetes in production environments. You're comfortable working across Linux, Kubernetes, cloud native tooling, automation, observability, CI/CD and infrastructure as code. You understand that reliability, security and operational maturity are critical to how modern platforms support engineering teams and customer facing services. You enjoy treating infrastructure as a product, automating repeatable work, improving resilience, and building platforms that other engineers can rely on. You're calm under pressure, methodical during incidents, and able to turn operational challenges into long term improvements. You may have worked in a regulated, secure, government, defence, financial services, telecoms, managed services or cloud native environment, but most importantly you have operated Kubernetes at depth and understand the realities of production ownership. You're a senior individual contributor who can mentor others, influence engineering practice, and provide technical authority without needing formal line management responsibility. About the Role We're looking for a Senior Site Reliability Engineer (SRE) to help operate, harden and mature our production OKD / Kubernetes platforms. This is a hands on engineering role focused on reliability, automation, observability, GitOps, CI/CD and secure platform operations. You'll work across the full stack, from bare metal and virtualisation through to Kubernetes control plane operations, ingress, identity, monitoring, developer platform tooling and application delivery. The role will play a key part in improving the operational maturity of our platform estate, supporting the migration from VMware to KVM, strengthening GitOps and CI/CD practices, and helping ensure our platforms remain secure, scalable and aligned to the needs of regulated customer environments. You'll work closely with platform, application, AI, networking, security, QA and architecture teams to build reliable foundations that enable other engineering teams to deliver safely and at pace. This is not a ticket handling role. It is a senior engineering position where you'll be expected to own problems, drive improvements, and help shape how TIG operates critical cloud native infrastructure. About the Team You'll be joining our Cloud team, working closely with Platform Engineering and wider engineering teams responsible for the foundational platforms on which TIG's services run. This is a great opportunity to join a small, senior technical environment where you can have direct ownership, meaningful influence, and visibility across modern platform engineering, Kubernetes, automation, observability, security and cloud native delivery. Key Responsibilities Operate, harden and extend production OpenShift / OKD / Kubernetes clusters across on premises and hybrid environments. Support the migration from VMware to KVM, helping modernise the underlying compute and storage layer. Own and improve CI/CD processes across the full lifecycle of platform and application components. Work with platform and application engineers to support cloud native delivery using tools such as Helm and Kustomize. Develop and mature GitOps deployment practices using tools such as Argo CD or Flux. Maintain and improve core platform services including identity, ingress, observability, certificate management, service mesh and container registry capabilities. Build and operate observability across logs, metrics, traces, alerting, SLOs and error budgets. Improve platform hardening in line with secure and regulated environment requirements, including network policy, SELinux, image provenance, secret management and audit. Automate repeatable operational tasks using tools such as Ansible, Terraform, Helm, Kustomize, Go, Python or equivalent technologies. Lead incident response activity, support blameless post mortems and drive systemic fixes. Partner with networking and security teams on platform integration, segmentation, load balancing and accreditation evidence. Create and maintain clear technical documentation, runbooks, design notes and operational guidance. Mentor other engineers and act as a senior technical authority across cloud and Kubernetes operations. Participate in an on call rota, with appropriate compensation. Success in This Role Looks Like A more reliable, secure and measurable production Kubernetes estate. Improved platform observability, with meaningful alerting, SLOs and trend data that engineering teams actively use. Progress against the VMware to KVM migration, with a clear and automated path for the underlying infrastructure layer. A mature GitOps approach covering platform and application components, including rollback, drift detection and operational control. Improved CI/CD practices that help teams move at pace while considering security, QA and compliance earlier in the lifecycle. Well documented, supportable and scalable platform services. Stronger incident response, clearer runbooks and post mortems that lead to real operational improvements. Recognition as a technical authority for Kubernetes, cloud and platform operations across the organisation. What We're Looking For We're looking for a Senior Site Reliability Engineer (SRE) with strong experience operating production Kubernetes environments. This role is well suited to someone who combines deep technical capability with strong operational discipline. You'll be comfortable taking ownership of complex platform challenges, improving reliability, and working collaboratively across engineering, security, networking and architecture teams. Essential Experience & Skills Strong experience running production Kubernetes environments, not just consuming or deploying into them. Strong Linux fundamentals, including systemd, networking, storage and performance troubleshooting. Experience with at least one Kubernetes distribution such as OKD, OpenShift, vanilla Kubernetes, Rancher, EKS, AKS or GKE. Solid infrastructure as code experience, including Ansible plus Terraform or equivalent, alongside tools such as Helm and Kustomize. GitOps and CI/CD experience managing full application and component lifecycles, using tools such as Argo CD, Flux, GitHub Actions or similar. Prometheus, Grafana, Elastic Stack / LGTM, OpenTelemetry or similar. Experience working with identity and access technologies such as OIDC, SAML, SCIM or Keycloak. Experience with virtualisation or infrastructure platforms such as KVM, libvirt or VMware. Scripting or tooling experience using Go, Python, shell scripting or similar. Strong troubleshooting, problem solving and analytical skills. Experience working in secure, regulated or enterprise scale environments. Strong communication skills, with the ability to produce clear documentation, runbooks, post mortems and technical guidance. Eligible to hold UK SC clearance. Desirable (Not Essential) Specific OpenShift or OKD experience, including operators, MachineConfig or SCCs. Service mesh experience such as Istio or Linkerd. Policy engine experience such as OPA, Gatekeeper or Kyverno. Cloud native application deployment experience using Helm, Terraform, Kustomize or similar. Storage experience such as Ceph, Longhorn, OpenShift Data Foundation or equivalent. Networking experience including BGP, VXLAN, Palo Alto or Juniper technologies. Software supply chain security experience, including SBOMs, image signing, admission control or tools such as Sigstore. Experience operating AI, ML or GPU enabled platforms. CKA, CKAD, CKS, Red Hat certifications or equivalent. Active or recent UK SC clearance. Recognised open source contributions to the Kubernetes ecosystem. Soft Skills & Behaviours Calm, structured and methodical under pressure. Strong written and verbal communication skills. Collaborative working style across platform, development, QA, security, networking and architecture teams. Strong sense of ownership and accountability. Automation first mindset, with a focus on removing repeatable manual work. Able to influence technical practice through evidence, example and credibility. Pragmatic and solutions focused approach to problem solving. Curious about why systems fail, not just how to bring them back online. . click apply for full job details

07/06/2026

Full time

Role: Senior Site Reliability Engineer (SRE) - Kubernetes / OpenShift Location: Remote - UK (possible paid occasional travel to TIG Secure site locations as required) Job Type: Full-time, Permanent (37.5 hours) Salary: Competitive + benefits + package Security Clearance Requirements Please note that holding a current Security Clearance is not essential at the time of application, but eligibility is required. This role requires the successful candidate to be eligible for Security Check (SC) clearance. To meet this requirement, applicants must: Have the right to work in the UK Have lived in the UK continuously for the past 5 years Not have spent more than 6 months outside the UK in total during that period Be willing to undergo security vetting as part of the onboarding process About You You're an experienced SRE, Platform Engineer or Cloud Engineer with strong hands on experience running Kubernetes in production environments. You're comfortable working across Linux, Kubernetes, cloud native tooling, automation, observability, CI/CD and infrastructure as code. You understand that reliability, security and operational maturity are critical to how modern platforms support engineering teams and customer facing services. You enjoy treating infrastructure as a product, automating repeatable work, improving resilience, and building platforms that other engineers can rely on. You're calm under pressure, methodical during incidents, and able to turn operational challenges into long term improvements. You may have worked in a regulated, secure, government, defence, financial services, telecoms, managed services or cloud native environment, but most importantly you have operated Kubernetes at depth and understand the realities of production ownership. You're a senior individual contributor who can mentor others, influence engineering practice, and provide technical authority without needing formal line management responsibility. About the Role We're looking for a Senior Site Reliability Engineer (SRE) to help operate, harden and mature our production OKD / Kubernetes platforms. This is a hands on engineering role focused on reliability, automation, observability, GitOps, CI/CD and secure platform operations. You'll work across the full stack, from bare metal and virtualisation through to Kubernetes control plane operations, ingress, identity, monitoring, developer platform tooling and application delivery. The role will play a key part in improving the operational maturity of our platform estate, supporting the migration from VMware to KVM, strengthening GitOps and CI/CD practices, and helping ensure our platforms remain secure, scalable and aligned to the needs of regulated customer environments. You'll work closely with platform, application, AI, networking, security, QA and architecture teams to build reliable foundations that enable other engineering teams to deliver safely and at pace. This is not a ticket handling role. It is a senior engineering position where you'll be expected to own problems, drive improvements, and help shape how TIG operates critical cloud native infrastructure. About the Team You'll be joining our Cloud team, working closely with Platform Engineering and wider engineering teams responsible for the foundational platforms on which TIG's services run. This is a great opportunity to join a small, senior technical environment where you can have direct ownership, meaningful influence, and visibility across modern platform engineering, Kubernetes, automation, observability, security and cloud native delivery. Key Responsibilities Operate, harden and extend production OpenShift / OKD / Kubernetes clusters across on premises and hybrid environments. Support the migration from VMware to KVM, helping modernise the underlying compute and storage layer. Own and improve CI/CD processes across the full lifecycle of platform and application components. Work with platform and application engineers to support cloud native delivery using tools such as Helm and Kustomize. Develop and mature GitOps deployment practices using tools such as Argo CD or Flux. Maintain and improve core platform services including identity, ingress, observability, certificate management, service mesh and container registry capabilities. Build and operate observability across logs, metrics, traces, alerting, SLOs and error budgets. Improve platform hardening in line with secure and regulated environment requirements, including network policy, SELinux, image provenance, secret management and audit. Automate repeatable operational tasks using tools such as Ansible, Terraform, Helm, Kustomize, Go, Python or equivalent technologies. Lead incident response activity, support blameless post mortems and drive systemic fixes. Partner with networking and security teams on platform integration, segmentation, load balancing and accreditation evidence. Create and maintain clear technical documentation, runbooks, design notes and operational guidance. Mentor other engineers and act as a senior technical authority across cloud and Kubernetes operations. Participate in an on call rota, with appropriate compensation. Success in This Role Looks Like A more reliable, secure and measurable production Kubernetes estate. Improved platform observability, with meaningful alerting, SLOs and trend data that engineering teams actively use. Progress against the VMware to KVM migration, with a clear and automated path for the underlying infrastructure layer. A mature GitOps approach covering platform and application components, including rollback, drift detection and operational control. Improved CI/CD practices that help teams move at pace while considering security, QA and compliance earlier in the lifecycle. Well documented, supportable and scalable platform services. Stronger incident response, clearer runbooks and post mortems that lead to real operational improvements. Recognition as a technical authority for Kubernetes, cloud and platform operations across the organisation. What We're Looking For We're looking for a Senior Site Reliability Engineer (SRE) with strong experience operating production Kubernetes environments. This role is well suited to someone who combines deep technical capability with strong operational discipline. You'll be comfortable taking ownership of complex platform challenges, improving reliability, and working collaboratively across engineering, security, networking and architecture teams. Essential Experience & Skills Strong experience running production Kubernetes environments, not just consuming or deploying into them. Strong Linux fundamentals, including systemd, networking, storage and performance troubleshooting. Experience with at least one Kubernetes distribution such as OKD, OpenShift, vanilla Kubernetes, Rancher, EKS, AKS or GKE. Solid infrastructure as code experience, including Ansible plus Terraform or equivalent, alongside tools such as Helm and Kustomize. GitOps and CI/CD experience managing full application and component lifecycles, using tools such as Argo CD, Flux, GitHub Actions or similar. Prometheus, Grafana, Elastic Stack / LGTM, OpenTelemetry or similar. Experience working with identity and access technologies such as OIDC, SAML, SCIM or Keycloak. Experience with virtualisation or infrastructure platforms such as KVM, libvirt or VMware. Scripting or tooling experience using Go, Python, shell scripting or similar. Strong troubleshooting, problem solving and analytical skills. Experience working in secure, regulated or enterprise scale environments. Strong communication skills, with the ability to produce clear documentation, runbooks, post mortems and technical guidance. Eligible to hold UK SC clearance. Desirable (Not Essential) Specific OpenShift or OKD experience, including operators, MachineConfig or SCCs. Service mesh experience such as Istio or Linkerd. Policy engine experience such as OPA, Gatekeeper or Kyverno. Cloud native application deployment experience using Helm, Terraform, Kustomize or similar. Storage experience such as Ceph, Longhorn, OpenShift Data Foundation or equivalent. Networking experience including BGP, VXLAN, Palo Alto or Juniper technologies. Software supply chain security experience, including SBOMs, image signing, admission control or tools such as Sigstore. Experience operating AI, ML or GPU enabled platforms. CKA, CKAD, CKS, Red Hat certifications or equivalent. Active or recent UK SC clearance. Recognised open source contributions to the Kubernetes ecosystem. Soft Skills & Behaviours Calm, structured and methodical under pressure. Strong written and verbal communication skills. Collaborative working style across platform, development, QA, security, networking and architecture teams. Strong sense of ownership and accountability. Automation first mindset, with a focus on removing repeatable manual work. Able to influence technical practice through evidence, example and credibility. Pragmatic and solutions focused approach to problem solving. Curious about why systems fail, not just how to bring them back online. . click apply for full job details

Infrastructure Engineer

Synthesia

About the role We're looking for an experienced DevOps Engineer to join our Cloud Infra team at Synthesia. Cloud Infra is a group that enables our Product engineers to build, and deploy Synthesia state of the art technologies. You can expect to work across cloud infrastructure, CI/CD pipelines, observability, and tooling, with autonomy to identify and fix bottlenecks in a fast moving AI company. This is a hands on senior IC role (roughly level 5 scope). You'll be joining a growing team that's shifting from enablement to direct execution, and you'll help shape how we scale our infrastructure over the next year. What you'll do Maintain and scale Kubernetes (EKS) clusters - managing workloads, deployments, and monitoring at production scale. Manage and evolve our AWS (and some GCP) cloud environments, balancing reliability, cost, and velocity. Own and improve our CI/CD systems (GitHub Actions on our self hosted AWS runners). Define and implement Infrastructure as Code using Terraform and Terragrunt. Strengthen observability via Datadog and enable teams to understand their systems in production. Collaborate with Product Engineers to deploy and monitor production services. Drive FinOps practices: vendor management, cost allocation, and financial feedback loops. Contribute to internal tooling, automation, and reporting platforms that improve developer experience. You'll thrive in this role if you have: Deep hands on DevOps / SRE / Platform experience in a SaaS or high traffic product environment. Strong Kubernetes experience - spinning up and managing clusters, not just consuming them. Proven AWS and/or GCP expertise. Proficiency with Terraform / Terragrunt, Linux, and Python scripting. Strong understanding of CI/CD design patterns. Experience with Datadog or similar observability tooling. Comfortable operating autonomously in ambiguous environments. A pragmatic mindset - focusing on scalable, maintainable solutions over theoretical perfection. A bias toward execution and written communication, especially in remote contexts. Bonus points Familiarity with Temporal.io, or workflow orchestration frameworks. Light frontend or tooling development experience (React, Node.js). Previous work supporting AI research or data intensive environments. Other important info This is a remote role from an EU country, UK or Switzerland or hybrid from one of our London, Munich, Copenhagen, or Zurich hubs. This is full time employment only - no contractors possible - usually through OysterHR or a local entity. We only sponsor visas if you are in the UK or some EU countries already.

07/06/2026

Full time

About the role We're looking for an experienced DevOps Engineer to join our Cloud Infra team at Synthesia. Cloud Infra is a group that enables our Product engineers to build, and deploy Synthesia state of the art technologies. You can expect to work across cloud infrastructure, CI/CD pipelines, observability, and tooling, with autonomy to identify and fix bottlenecks in a fast moving AI company. This is a hands on senior IC role (roughly level 5 scope). You'll be joining a growing team that's shifting from enablement to direct execution, and you'll help shape how we scale our infrastructure over the next year. What you'll do Maintain and scale Kubernetes (EKS) clusters - managing workloads, deployments, and monitoring at production scale. Manage and evolve our AWS (and some GCP) cloud environments, balancing reliability, cost, and velocity. Own and improve our CI/CD systems (GitHub Actions on our self hosted AWS runners). Define and implement Infrastructure as Code using Terraform and Terragrunt. Strengthen observability via Datadog and enable teams to understand their systems in production. Collaborate with Product Engineers to deploy and monitor production services. Drive FinOps practices: vendor management, cost allocation, and financial feedback loops. Contribute to internal tooling, automation, and reporting platforms that improve developer experience. You'll thrive in this role if you have: Deep hands on DevOps / SRE / Platform experience in a SaaS or high traffic product environment. Strong Kubernetes experience - spinning up and managing clusters, not just consuming them. Proven AWS and/or GCP expertise. Proficiency with Terraform / Terragrunt, Linux, and Python scripting. Strong understanding of CI/CD design patterns. Experience with Datadog or similar observability tooling. Comfortable operating autonomously in ambiguous environments. A pragmatic mindset - focusing on scalable, maintainable solutions over theoretical perfection. A bias toward execution and written communication, especially in remote contexts. Bonus points Familiarity with Temporal.io, or workflow orchestration frameworks. Light frontend or tooling development experience (React, Node.js). Previous work supporting AI research or data intensive environments. Other important info This is a remote role from an EU country, UK or Switzerland or hybrid from one of our London, Munich, Copenhagen, or Zurich hubs. This is full time employment only - no contractors possible - usually through OysterHR or a local entity. We only sponsor visas if you are in the UK or some EU countries already.

DevOps Engineer IV

Elsevier

DevOps Engineer IVApplylocations: London: Amsterdamtime type: Full timeposted on: Posted Yesterdayjob requisition id: R108966Would you like to be part of an engineering team that develops and maintains our cloud and platform services?Are you motivated by seeing how your work makes a difference to product delivery and customer outcomes? About our Team We are a growing, global team of 2000 entrepreneurial digital technologists. Collaborating in small agile teams, we have the freedom to leverage the latest technologies to build digital solutions. We create products that help scientists make breakthroughs and health professionals impact lives. About the Role As a DevOps Engineer, you will be an integral member of our team and part our global infrastructure group. You will be responsible for building and supporting products across our Life Sciences portfolio. You will help to improve the developer experience, enabling product teams to deliver quality software to customers easily and quickly. Responsibilities: • Working within our DevOps squad to ensure the reliability, scalability, performance and security of our platform and products • Identifying improvements to common tasks and self-service modules to allow teams to move at pace across the SDLC • Designing, building, and maintaining cost effective cloud environments for our organisation • Collaborating with central platform teams to support the on-boarding of products to our platforms • Seeking out and implementing efficiencies to erase toil to allow more focus more on project work and lasting improvements • Acting as a senior escalation point for internal and external customers to provide support for production issues Requirements: • Have experience working in global cross-functional teams to solve problems at scale with innovative solutions • Have experience DevOps and SRE (Site Reliability Engineer) best-practices • Have advanced skills in AWS, Linux, Puppet, Jenkins, Docker, Kubernetes, Terraform and GitHub • Have experience designing highly available and secure distributed systems that scale using AWS, Linux, and Kubernetes • Have experience in standardising and improving the deployment process using common platforms and CI/CD pipelines • Have a solid understanding of how tools like New Relic or Grafana can help observe product performance and identify root causes • Be part of an on-call rotation to provide support for our platform and customers • Enjoy creating efficiency through automation and 'Infrastructure as Code' using modern languages like Python or Go Work in a way that works for you We promote a healthy work/life balance across the organisation. With an average length of service of 9 years, we are confident that we offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals. • Working remotely from home or in our office in a flexible hybrid style • Working flexible hours - flexing the times you work in the day Working with us We are an equal opportunity employer with a commitment to help you succeed. Here, you will find an inclusive, agile, collaborative, innovative and fun environment, where everyone has a part to play. Regardless of the team you join, we promote a diverse environment with co-workers who are passionate about what they do, and how they do it. Working for you At Elsevier, we know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer: • Generous holiday allowance with the option to buy additional days • Health screening, eye care vouchers and private medical benefits • Wellbeing programs • Life assurance • Access to a competitive contributory pension scheme • Long service awards • Save As You Earn share option scheme • Travel Season ticket loan • Maternity, paternity and shared parental leave • Access to emergency care for both the elderly and children • RECARES days, giving you time to support the charities and causes that matter to you • Access to employee resource groups with dedicated time to volunteer • Access to extensive learning and development resources • Access to employee discounts via Perks at Work About Us A global leader in information and analytics, we help researchers and healthcare professionals advance science and improve health outcomes for the benefit of society. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science and research, health education and interactive learning, as well as exceptional healthcare and clinical practice. At Elsevier, your work contributes to the world's grand challenges and a more sustainable future. We harness innovative technologies to support science and healthcare to partner for a better world. Join Us Are you ready to help us progress science and health? Our technology leads to innovation, so join a forward-thinking digital business that is tackling world-scale challenges and align your ambitions with our passion for driving global knowledge-sharing. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here . Please read our Candidate Privacy Policy.We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

06/06/2026

Full time

DevOps Engineer IVApplylocations: London: Amsterdamtime type: Full timeposted on: Posted Yesterdayjob requisition id: R108966Would you like to be part of an engineering team that develops and maintains our cloud and platform services?Are you motivated by seeing how your work makes a difference to product delivery and customer outcomes? About our Team We are a growing, global team of 2000 entrepreneurial digital technologists. Collaborating in small agile teams, we have the freedom to leverage the latest technologies to build digital solutions. We create products that help scientists make breakthroughs and health professionals impact lives. About the Role As a DevOps Engineer, you will be an integral member of our team and part our global infrastructure group. You will be responsible for building and supporting products across our Life Sciences portfolio. You will help to improve the developer experience, enabling product teams to deliver quality software to customers easily and quickly. Responsibilities: • Working within our DevOps squad to ensure the reliability, scalability, performance and security of our platform and products • Identifying improvements to common tasks and self-service modules to allow teams to move at pace across the SDLC • Designing, building, and maintaining cost effective cloud environments for our organisation • Collaborating with central platform teams to support the on-boarding of products to our platforms • Seeking out and implementing efficiencies to erase toil to allow more focus more on project work and lasting improvements • Acting as a senior escalation point for internal and external customers to provide support for production issues Requirements: • Have experience working in global cross-functional teams to solve problems at scale with innovative solutions • Have experience DevOps and SRE (Site Reliability Engineer) best-practices • Have advanced skills in AWS, Linux, Puppet, Jenkins, Docker, Kubernetes, Terraform and GitHub • Have experience designing highly available and secure distributed systems that scale using AWS, Linux, and Kubernetes • Have experience in standardising and improving the deployment process using common platforms and CI/CD pipelines • Have a solid understanding of how tools like New Relic or Grafana can help observe product performance and identify root causes • Be part of an on-call rotation to provide support for our platform and customers • Enjoy creating efficiency through automation and 'Infrastructure as Code' using modern languages like Python or Go Work in a way that works for you We promote a healthy work/life balance across the organisation. With an average length of service of 9 years, we are confident that we offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals. • Working remotely from home or in our office in a flexible hybrid style • Working flexible hours - flexing the times you work in the day Working with us We are an equal opportunity employer with a commitment to help you succeed. Here, you will find an inclusive, agile, collaborative, innovative and fun environment, where everyone has a part to play. Regardless of the team you join, we promote a diverse environment with co-workers who are passionate about what they do, and how they do it. Working for you At Elsevier, we know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer: • Generous holiday allowance with the option to buy additional days • Health screening, eye care vouchers and private medical benefits • Wellbeing programs • Life assurance • Access to a competitive contributory pension scheme • Long service awards • Save As You Earn share option scheme • Travel Season ticket loan • Maternity, paternity and shared parental leave • Access to emergency care for both the elderly and children • RECARES days, giving you time to support the charities and causes that matter to you • Access to employee resource groups with dedicated time to volunteer • Access to extensive learning and development resources • Access to employee discounts via Perks at Work About Us A global leader in information and analytics, we help researchers and healthcare professionals advance science and improve health outcomes for the benefit of society. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science and research, health education and interactive learning, as well as exceptional healthcare and clinical practice. At Elsevier, your work contributes to the world's grand challenges and a more sustainable future. We harness innovative technologies to support science and healthcare to partner for a better world. Join Us Are you ready to help us progress science and health? Our technology leads to innovation, so join a forward-thinking digital business that is tackling world-scale challenges and align your ambitions with our passion for driving global knowledge-sharing. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here . Please read our Candidate Privacy Policy.We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

DevOps Engineer

DevOps projects Manchester, Lancashire

Join 1,800+ DevOps engineers getting weekly alerts for remote and US, EU roles that don't show up on the big boards. Junior to senior. Kubernetes, AWS, Terraform - filtered for your stack. Ensure digital sovereignty for your infrastructure. Get EU static IPs withfull data residency for compliance and peace of mind. Computer and Network Security 11 employees Manchester, UK, GB Est. 2015 CultureAI's innovative Human Risk Management Platform empowers you to identify workforce security risks, coach employees in the moment, and automate fixes. Strengthen resilience against phishing, Hey there, join us in revolutionising the way we understand and mitigate human risk! We're looking for our first DevOps Engineer to join our ever-growing Engineering team. We want this person to be responsible for maintaining the product production and technical infrastructure in an optimised state. The ideal candidate will be well versed in a DevOps role within a growth company with strong AWS, networking, Docker and Linux admin skills. Who are we? At CultureAI, we aim to help organisations prevent cyber breaches. We're looking for passionate, driven people to join our team and help us transform how businesses manage human cyber risk. We're disrupting the security awareness industry with our data driven approach to measuring employee security behaviours and driving personalised security coaching and interventions. Our mission is to make the world more secure. We see direct parallels between how we elevate security through our company and empower it within. We believe in creating a safe space for our employees to express, innovate and educate. Diversity and inclusion are at the core of what we do, helping us drive our security ambitions and making the world a more secure place for all. Day in the life Production-level support of our SaaS product hosted on AWS. Support the development of the product by building the best technical environment possible for the team to work within. Leading on projects to implement new systems and processes to both support the engineering team and maintain technical infrastructure behind all the systems. Building and maintaining the local software development environment used by Engineers alongside our AWS infrastructure to satisfy the development team. What you bring to the team The ideal candidate will have previously created and maintained software development environments utilising AWS technologies. This will require exceptional skills across multiple technical platforms and programming languages. Strong AWS experience across EC2, VPCs, IAM, S3, SQS, RDS (MySQL), ElastiCache (Redis). Advanced Docker skills for both developer and production environments. Implementing and managing production Kubernetes clusters with EKS including scaling and monitoring, Calico for policy management and Helm for packing. Good technical knowledge of networking including DNS, routing, subnets, NAT, network ACLs, firewalls, security groups. Experience working with software engineering environments across both Windows and Mac. Linux system admin and exceptional scripting skills (Bash/Python). CI/CD implementation/management & Git for version control. Beneficial Knowledge SRE tooling such as Datadog and Sentry MS EntraID, Google Workplace, OKTA management DataBricks/Kafka JetBrains TeamCity, YouTrack & Space Infrastructure as Code using Terraform/ AWS cloud formation What do we offer? Private healthcare scheme Share options 30 days holiday (including 3 CultureAI closure days to prioritise rest for our team at the end of the year) + bank holidays Remote work or pet friendly offices, the choice is yours Training opportunities Regular socials (non compulsory and not just going to the pub) Plus many more Next steps Stage 1 - Exploring your goals with our internal Recruiter Stage 2 - Deep dive with our VP of Engineering Stage 3 - Take home Technical Assessment Stage 4 - Panel interview with our VP of Engineering, CTO and Senior Developer Built and hosted in the EU we keep your data safe

06/06/2026

Full time

Join 1,800+ DevOps engineers getting weekly alerts for remote and US, EU roles that don't show up on the big boards. Junior to senior. Kubernetes, AWS, Terraform - filtered for your stack. Ensure digital sovereignty for your infrastructure. Get EU static IPs withfull data residency for compliance and peace of mind. Computer and Network Security 11 employees Manchester, UK, GB Est. 2015 CultureAI's innovative Human Risk Management Platform empowers you to identify workforce security risks, coach employees in the moment, and automate fixes. Strengthen resilience against phishing, Hey there, join us in revolutionising the way we understand and mitigate human risk! We're looking for our first DevOps Engineer to join our ever-growing Engineering team. We want this person to be responsible for maintaining the product production and technical infrastructure in an optimised state. The ideal candidate will be well versed in a DevOps role within a growth company with strong AWS, networking, Docker and Linux admin skills. Who are we? At CultureAI, we aim to help organisations prevent cyber breaches. We're looking for passionate, driven people to join our team and help us transform how businesses manage human cyber risk. We're disrupting the security awareness industry with our data driven approach to measuring employee security behaviours and driving personalised security coaching and interventions. Our mission is to make the world more secure. We see direct parallels between how we elevate security through our company and empower it within. We believe in creating a safe space for our employees to express, innovate and educate. Diversity and inclusion are at the core of what we do, helping us drive our security ambitions and making the world a more secure place for all. Day in the life Production-level support of our SaaS product hosted on AWS. Support the development of the product by building the best technical environment possible for the team to work within. Leading on projects to implement new systems and processes to both support the engineering team and maintain technical infrastructure behind all the systems. Building and maintaining the local software development environment used by Engineers alongside our AWS infrastructure to satisfy the development team. What you bring to the team The ideal candidate will have previously created and maintained software development environments utilising AWS technologies. This will require exceptional skills across multiple technical platforms and programming languages. Strong AWS experience across EC2, VPCs, IAM, S3, SQS, RDS (MySQL), ElastiCache (Redis). Advanced Docker skills for both developer and production environments. Implementing and managing production Kubernetes clusters with EKS including scaling and monitoring, Calico for policy management and Helm for packing. Good technical knowledge of networking including DNS, routing, subnets, NAT, network ACLs, firewalls, security groups. Experience working with software engineering environments across both Windows and Mac. Linux system admin and exceptional scripting skills (Bash/Python). CI/CD implementation/management & Git for version control. Beneficial Knowledge SRE tooling such as Datadog and Sentry MS EntraID, Google Workplace, OKTA management DataBricks/Kafka JetBrains TeamCity, YouTrack & Space Infrastructure as Code using Terraform/ AWS cloud formation What do we offer? Private healthcare scheme Share options 30 days holiday (including 3 CultureAI closure days to prioritise rest for our team at the end of the year) + bank holidays Remote work or pet friendly offices, the choice is yours Training opportunities Regular socials (non compulsory and not just going to the pub) Plus many more Next steps Stage 1 - Exploring your goals with our internal Recruiter Stage 2 - Deep dive with our VP of Engineering Stage 3 - Take home Technical Assessment Stage 4 - Panel interview with our VP of Engineering, CTO and Senior Developer Built and hosted in the EU we keep your data safe

SRE Technical Lead

Adecco Reading, Berkshire

SRE Technical Lead Reading/Hybrid (UK-based - mix of home, office, and client site) Must be eligible for SC Clearance We are seeking an experienced SRE Technical Lead to act as the technical authority for Site Reliability Engineering across complex, large-scale platforms. This is a senior, client-facing leadership role where you will be responsible for driving reliability, availability, and operational excellence across multi-team and multi-vendor environments. You will combine hands-on engineering expertise with strategic leadership, ensuring SRE practices are Embedded across the full service life cycle-from design through to production operations. As the SRE Technical Lead, you will: Define and implement SRE strategy, standards, and best practices, including SLAs, SLOs, and error budgets Embed reliability principles into platform and service design from the outset Lead key SRE practices such as reliability reviews, operational readiness, and toil reduction Drive automation across monitoring, incident response, and remediation Act as the technical escalation point for major incidents and high-risk releases Lead blameless post-incident reviews and ensure continuous improvement Establish observability and capacity management practices using modern tooling Identify and eliminate systemic reliability risks and operational inefficiencies Collaborate with engineering, platform, security, and operations teams across multiple vendors Provide coaching and mentorship to engineers, raising SRE capability across the organisation Essential experience: Deep expertise in Kubernetes and/or OpenShift Experience working in multi-cloud or hybrid cloud environments Strong understanding of SRE principles (SLOs, SLAs, error budgets, reliability engineering) Hands-on experience with observability tooling (eg, Prometheus, Grafana, OpenTelemetry, Loki, Tempo) Strong knowledge of Infrastructure as Code and GitOps (eg, Helm, Kustomize, ArgoCD, Tekton) Experience with CI/CD pipelines and automation Proven ability to operate as a technical leader in complex, multi-team environments

05/06/2026

Full time

SRE Technical Lead Reading/Hybrid (UK-based - mix of home, office, and client site) Must be eligible for SC Clearance We are seeking an experienced SRE Technical Lead to act as the technical authority for Site Reliability Engineering across complex, large-scale platforms. This is a senior, client-facing leadership role where you will be responsible for driving reliability, availability, and operational excellence across multi-team and multi-vendor environments. You will combine hands-on engineering expertise with strategic leadership, ensuring SRE practices are Embedded across the full service life cycle-from design through to production operations. As the SRE Technical Lead, you will: Define and implement SRE strategy, standards, and best practices, including SLAs, SLOs, and error budgets Embed reliability principles into platform and service design from the outset Lead key SRE practices such as reliability reviews, operational readiness, and toil reduction Drive automation across monitoring, incident response, and remediation Act as the technical escalation point for major incidents and high-risk releases Lead blameless post-incident reviews and ensure continuous improvement Establish observability and capacity management practices using modern tooling Identify and eliminate systemic reliability risks and operational inefficiencies Collaborate with engineering, platform, security, and operations teams across multiple vendors Provide coaching and mentorship to engineers, raising SRE capability across the organisation Essential experience: Deep expertise in Kubernetes and/or OpenShift Experience working in multi-cloud or hybrid cloud environments Strong understanding of SRE principles (SLOs, SLAs, error budgets, reliability engineering) Hands-on experience with observability tooling (eg, Prometheus, Grafana, OpenTelemetry, Loki, Tempo) Strong knowledge of Infrastructure as Code and GitOps (eg, Helm, Kustomize, ArgoCD, Tekton) Experience with CI/CD pipelines and automation Proven ability to operate as a technical leader in complex, multi-team environments

Cloud Architect

Hollybank Trustees Ltd Ilkley, Yorkshire

Location: Hybrid / Ilkley LS29 8FL, UK job type: Permanent / Full-time Sector and subsector: Technology Systems & Infrastructure Salary: Competitive salary SmartSearch's distinctive Anti-Money Laundering verification software protects our clients by offering the most advanced and comprehensive features available from an AML provider. SmartSearch has grown rapidly by fostering an incredibly collaborative and supportive culture. As we continue our ambitious growth plans, we will strive to remain a truly exciting, rewarding, and unique place to work. HOW WILL YOU MAKE A DIFFERENCE? We are looking for a Cloud Architect who will be responsible for defining, designing, and contributing to the delivery and continuous improvement of secure, scalable, resilient, and cost-effective cloud solutions that support our applications, services, and wider technology strategy. Reporting to the Head of Cloud Engineering, you will work closely with software engineering, SRE, operations, security, and business stakeholders to shape cloud architecture standards, create reference architectures, review technical designs, and ensure cloud environments follow best practices for security, reliability, performance, compliance, and cost optimisation. This is an architecture led role with a strong hands on element. You will be expected to contribute directly to cloud platform improvements, infrastructure as code, CI/CD pipelines, automation, Kubernetes environments, and cloud native services, helping teams turn architectural designs into practical, maintainable, and operationally effective solutions while driving ongoing improvements to cloud platforms, tooling, and engineering standards. VARIED DAY TO DAY RESPONSIBILITIES Defining and maintaining cloud architecture principles, standards, patterns, and best practices Designing secure, scalable, resilient, and cost-effective cloud solutions to support business critical services Contributing directly to the implementation and improvement of cloud infrastructure, platforms, automation, and tooling Creating and maintaining reference architectures for cloud platforms, application hosting, networking, security, and observability Providing technical leadership and practical engineering guidance to software engineering, SRE, operations, and security teams Reviewing cloud solution designs to ensure alignment with security, compliance, reliability, performance, and cost requirements Designing and evolving Azure landing zones, network topologies, identity models, and cloud governance controls Implementing and improving infrastructure as code to support repeatable, auditable, and well governed deployments Supporting the design, implementation, and improvement of CI/CD and GitOps based delivery approaches Working closely with development teams to ensure applications are designed, deployed, and operated effectively in the cloud Advising on and contributing to the use of Kubernetes, cloud native services, managed platforms, and modern application architectures Defining and supporting strategies for monitoring, logging, alerting, backup, recovery, and disaster recovery Supporting capacity planning, scaling strategies, resilience improvements, and cost optimisation initiatives Troubleshooting and helping resolve complex cloud infrastructure and platform issues where architectural input or hands on support is required Producing clear documentation, including architecture diagrams, design decisions, standards, runbooks, and operational guidance Assessing new cloud technologies and recommending practical improvements to platforms, tooling, and engineering practices Contributing to continuous improvement of cloud architecture, platform strategy, technical governance, and engineering standards WHAT ARE WE LOOKING FOR IN A CANDIDATE? Strong experience designing, building, and operating cloud solutions in production environments Strong knowledge of cloud platforms, particularly Microsoft Azure Experience defining cloud architecture standards, patterns, principles, and best practices Ability to contribute hands on to cloud engineering activities, including infrastructure, automation, deployment, and platform improvements Strong understanding of cloud infrastructure, platform services, networking, security, resilience, and scalability Experience designing and contributing to secure cloud landing zones, network architectures, identity models, and governance frameworks Strong working knowledge of infrastructure as code tools such as Terraform, Bicep, or Ansible Experience with modern application architectures, including microservices, containerised workloads, APIs, and event driven systems Experience designing, supporting, and improving container platforms such as Kubernetes, AKS, and container registries Understanding of CI/CD pipelines, GitOps workflows, deployment strategies, and release practices Strong understanding of cloud security principles, identity and access management, compliance, and risk management Ability to design and contribute to backup, recovery, high availability, and disaster recovery strategies Experience supporting architecture and delivery decisions across multiple environments, including development, test, staging, and production Ability to work with technical and non technical stakeholders to translate business requirements into cloud solutions Strong written and verbal communication skills, with the ability to produce clear architectural and operational documentation Desire to continuously learn and stay current with evolving cloud technologies, architecture patterns, and industry best practices ADVANTAGES 5+ years' experience in cloud, platform, DevOps, infrastructure, or software engineering roles 2+ years' experience in a Cloud Architect, Solution Architect, Platform Architect, Senior Cloud Engineer, or similar role Experience with Azure native services such as AKS, App Services, Azure SQL, Azure Storage, Azure Monitor, Key Vault, Azure Policy, and Azure Networking Experience designing and contributing to cloud landing zones and multi subscription or multi environment Azure estates Familiarity with observability and monitoring tools such as Grafana, Prometheus, Azure Monitor, or cloud native equivalents Working knowledge of CI/CD, GitOps workflows, deployment strategies, and release governance Strong automation and scripting skills using Bash, PowerShell, Python, or Go Experience managing cloud costs and implementing cost optimisation strategies Understanding of reliability engineering, availability, scalability, and performance considerations in cloud platforms Experience contributing to technology roadmaps, architecture reviews, or technical governance forums Familiarity with architecture documentation methods such as diagrams, decision records, standards, and design reviews WHAT IS LIFE LIKE AT SMARTSEARCH? We are a multi award winning Tech company with an aspirational mentality Some of our most recent recognitions include: named in the renownedRegTech100 list for 2024, listed in theTop 100 Fasted Growing Tech CompaniesbyNorthern Tech Awards2024as well as being namedTechnology Provider of the YearbyCorporate Finance Awards 2024 We have beenGreat Place To Work Certified since 2022 There are excellent progression opportunities due to our growth and you will have personal development goals, regular feedback and support We are a diverse and inclusive team committed to promoting Diversity & Inclusion and Social Responsibility. Through our DE&I group, charitable initiatives and support for local schools, we actively foster a positive Impact on our community COMPANY BENEFITS 25 days holiday rising to 30 with each year of service Private Medical Insurance covering dental and optical Company pension scheme Life Assurance - 4x your annual salary 1 day paid volunteering per year Enhanced maternity / paternity offerings Employee Assistance Programme Cycle to work scheme Access to a gym

05/06/2026

Full time

Location: Hybrid / Ilkley LS29 8FL, UK job type: Permanent / Full-time Sector and subsector: Technology Systems & Infrastructure Salary: Competitive salary SmartSearch's distinctive Anti-Money Laundering verification software protects our clients by offering the most advanced and comprehensive features available from an AML provider. SmartSearch has grown rapidly by fostering an incredibly collaborative and supportive culture. As we continue our ambitious growth plans, we will strive to remain a truly exciting, rewarding, and unique place to work. HOW WILL YOU MAKE A DIFFERENCE? We are looking for a Cloud Architect who will be responsible for defining, designing, and contributing to the delivery and continuous improvement of secure, scalable, resilient, and cost-effective cloud solutions that support our applications, services, and wider technology strategy. Reporting to the Head of Cloud Engineering, you will work closely with software engineering, SRE, operations, security, and business stakeholders to shape cloud architecture standards, create reference architectures, review technical designs, and ensure cloud environments follow best practices for security, reliability, performance, compliance, and cost optimisation. This is an architecture led role with a strong hands on element. You will be expected to contribute directly to cloud platform improvements, infrastructure as code, CI/CD pipelines, automation, Kubernetes environments, and cloud native services, helping teams turn architectural designs into practical, maintainable, and operationally effective solutions while driving ongoing improvements to cloud platforms, tooling, and engineering standards. VARIED DAY TO DAY RESPONSIBILITIES Defining and maintaining cloud architecture principles, standards, patterns, and best practices Designing secure, scalable, resilient, and cost-effective cloud solutions to support business critical services Contributing directly to the implementation and improvement of cloud infrastructure, platforms, automation, and tooling Creating and maintaining reference architectures for cloud platforms, application hosting, networking, security, and observability Providing technical leadership and practical engineering guidance to software engineering, SRE, operations, and security teams Reviewing cloud solution designs to ensure alignment with security, compliance, reliability, performance, and cost requirements Designing and evolving Azure landing zones, network topologies, identity models, and cloud governance controls Implementing and improving infrastructure as code to support repeatable, auditable, and well governed deployments Supporting the design, implementation, and improvement of CI/CD and GitOps based delivery approaches Working closely with development teams to ensure applications are designed, deployed, and operated effectively in the cloud Advising on and contributing to the use of Kubernetes, cloud native services, managed platforms, and modern application architectures Defining and supporting strategies for monitoring, logging, alerting, backup, recovery, and disaster recovery Supporting capacity planning, scaling strategies, resilience improvements, and cost optimisation initiatives Troubleshooting and helping resolve complex cloud infrastructure and platform issues where architectural input or hands on support is required Producing clear documentation, including architecture diagrams, design decisions, standards, runbooks, and operational guidance Assessing new cloud technologies and recommending practical improvements to platforms, tooling, and engineering practices Contributing to continuous improvement of cloud architecture, platform strategy, technical governance, and engineering standards WHAT ARE WE LOOKING FOR IN A CANDIDATE? Strong experience designing, building, and operating cloud solutions in production environments Strong knowledge of cloud platforms, particularly Microsoft Azure Experience defining cloud architecture standards, patterns, principles, and best practices Ability to contribute hands on to cloud engineering activities, including infrastructure, automation, deployment, and platform improvements Strong understanding of cloud infrastructure, platform services, networking, security, resilience, and scalability Experience designing and contributing to secure cloud landing zones, network architectures, identity models, and governance frameworks Strong working knowledge of infrastructure as code tools such as Terraform, Bicep, or Ansible Experience with modern application architectures, including microservices, containerised workloads, APIs, and event driven systems Experience designing, supporting, and improving container platforms such as Kubernetes, AKS, and container registries Understanding of CI/CD pipelines, GitOps workflows, deployment strategies, and release practices Strong understanding of cloud security principles, identity and access management, compliance, and risk management Ability to design and contribute to backup, recovery, high availability, and disaster recovery strategies Experience supporting architecture and delivery decisions across multiple environments, including development, test, staging, and production Ability to work with technical and non technical stakeholders to translate business requirements into cloud solutions Strong written and verbal communication skills, with the ability to produce clear architectural and operational documentation Desire to continuously learn and stay current with evolving cloud technologies, architecture patterns, and industry best practices ADVANTAGES 5+ years' experience in cloud, platform, DevOps, infrastructure, or software engineering roles 2+ years' experience in a Cloud Architect, Solution Architect, Platform Architect, Senior Cloud Engineer, or similar role Experience with Azure native services such as AKS, App Services, Azure SQL, Azure Storage, Azure Monitor, Key Vault, Azure Policy, and Azure Networking Experience designing and contributing to cloud landing zones and multi subscription or multi environment Azure estates Familiarity with observability and monitoring tools such as Grafana, Prometheus, Azure Monitor, or cloud native equivalents Working knowledge of CI/CD, GitOps workflows, deployment strategies, and release governance Strong automation and scripting skills using Bash, PowerShell, Python, or Go Experience managing cloud costs and implementing cost optimisation strategies Understanding of reliability engineering, availability, scalability, and performance considerations in cloud platforms Experience contributing to technology roadmaps, architecture reviews, or technical governance forums Familiarity with architecture documentation methods such as diagrams, decision records, standards, and design reviews WHAT IS LIFE LIKE AT SMARTSEARCH? We are a multi award winning Tech company with an aspirational mentality Some of our most recent recognitions include: named in the renownedRegTech100 list for 2024, listed in theTop 100 Fasted Growing Tech CompaniesbyNorthern Tech Awards2024as well as being namedTechnology Provider of the YearbyCorporate Finance Awards 2024 We have beenGreat Place To Work Certified since 2022 There are excellent progression opportunities due to our growth and you will have personal development goals, regular feedback and support We are a diverse and inclusive team committed to promoting Diversity & Inclusion and Social Responsibility. Through our DE&I group, charitable initiatives and support for local schools, we actively foster a positive Impact on our community COMPANY BENEFITS 25 days holiday rising to 30 with each year of service Private Medical Insurance covering dental and optical Company pension scheme Life Assurance - 4x your annual salary 1 day paid volunteering per year Enhanced maternity / paternity offerings Employee Assistance Programme Cycle to work scheme Access to a gym

Platform Lead - UK

PLP Group

Platform Lead - UKJob detailsNEXT GATE TECH LIMITEDFull-time About Next Gate Tech At Next Gate Tech, we create technologies that reshape the landscape of the fund industry operations.We empower our clients by capturing the full potential of harmonized data to drive intelligent and fully automated operations. Our transformative solutions optimize processes, enhance efficiency, reduce risks, and drive cost savings for our clients.Driven by our commitment to innovation, our intelligence layer extracts invaluable insights, employs advanced pattern analysis spotting anomalies, and uncovers hidden links within the data.Our modular, one-stop-shop, SaaS platform seamlessly ingests diverse datasets, creating a harmonized and enriched source of portfolios, transactions, and accounting data. This robust foundation fuels the platform to generate powerful signals through intelligent analytics, empowering a multitude of use cases.Next Gate Tech is not just a part of the industry's evolution - it is a driving force behind it.Learn more about us: Our story, values, mission and team: Our unified platform and technology: Our solutions and use cases: About the Role As a key senior technical leader, the Platform Lead will own and scale the technology and infrastructure that powers our financial technology platform. This role is critical to building the technical foundation that enables new products, supports rapid growth, and upholds the trust expected in financial services. They will design a resilient architecture that accelerates product development, and deliver exceptional reliability as we grow.Partnering closely with product and engineering teams, they will combine hands-on building with strategic technical leadership to ensure our platform remains secure, scalable, and fully compliant with financial-industry standards. Responsibilities Develop and communicate a clear, multi-year technical vision and strategic roadmap for the core platform (including infrastructure, data services, internal developer tools, and security) Lead the technical evaluation and decision-making process for platform technologies, balancing in-house development with best-in-class third-party solutions Drive a Site Reliability Engineering (SRE) culture, ensuring high availability, low latency, and robust disaster recovery capabilities Manage and optimize our cloud infrastructure, focusing on Infrastructure-as-Code (e.g., Terraform), containerization (e.g., Kubernetes), and cost optimization Champion an outstanding Internal Developer Platform (IDP) and developer experience, providing tools, automation, and documentation that accelerate feature delivery for the product team Qualifications 7+ years of experience building software, DevOps, or platform/infrastructure systems 3+ years in a senior leadership or Staff/Principal Engineer role, driving key technical projects Direct experience in implementing cost management and optimization strategies within a cloud environment, resulting in demonstrable savings Proven experience working in a highly regulated industry, ideally fintech, banking, or payments, with a deep understanding of security and compliance requirements Expert-level knowledge of modern cloud architecture (e.g. Microservices, Event-Driven Architecture, Serverless), CI/CD pipelines, and cloud provider platforms. Strong hands-on experience with container orchestration (e.g., Kubernetes), Infrastructure-as-Code (e.g., Terraform), Python, and Observability tools (e.g., Prometheus, Grafana, Datadog, Sentry)Benefits 26 vacation days + 2 duvet days, so you can truly recharge and enjoy life Comprehensive health and dental care coverage Central location for both offices with a fully stocked kitchen, including healthy snacks, and fresh fruit Freedom to create your own entrepreneurial experience by being part of a team in search of excellence Professional Development programs to expand your skills, and maximize your potential on the frontier of financial innovation Next Gate Tech is an equal opportunity employer. We believe our team's unique life experiences, backgrounds, cultures, beliefs and abilities add richness to our culture and depth to our ideas. Our ongoing commitment to diversity and inclusion creates an environment that supports, empowers and delivers a sense of belonging for all members of the team. Should you require any accommodation, please inform us and we will work with you to meet your accessibility needs.

05/06/2026

Full time

Platform Lead - UKJob detailsNEXT GATE TECH LIMITEDFull-time About Next Gate Tech At Next Gate Tech, we create technologies that reshape the landscape of the fund industry operations.We empower our clients by capturing the full potential of harmonized data to drive intelligent and fully automated operations. Our transformative solutions optimize processes, enhance efficiency, reduce risks, and drive cost savings for our clients.Driven by our commitment to innovation, our intelligence layer extracts invaluable insights, employs advanced pattern analysis spotting anomalies, and uncovers hidden links within the data.Our modular, one-stop-shop, SaaS platform seamlessly ingests diverse datasets, creating a harmonized and enriched source of portfolios, transactions, and accounting data. This robust foundation fuels the platform to generate powerful signals through intelligent analytics, empowering a multitude of use cases.Next Gate Tech is not just a part of the industry's evolution - it is a driving force behind it.Learn more about us: Our story, values, mission and team: Our unified platform and technology: Our solutions and use cases: About the Role As a key senior technical leader, the Platform Lead will own and scale the technology and infrastructure that powers our financial technology platform. This role is critical to building the technical foundation that enables new products, supports rapid growth, and upholds the trust expected in financial services. They will design a resilient architecture that accelerates product development, and deliver exceptional reliability as we grow.Partnering closely with product and engineering teams, they will combine hands-on building with strategic technical leadership to ensure our platform remains secure, scalable, and fully compliant with financial-industry standards. Responsibilities Develop and communicate a clear, multi-year technical vision and strategic roadmap for the core platform (including infrastructure, data services, internal developer tools, and security) Lead the technical evaluation and decision-making process for platform technologies, balancing in-house development with best-in-class third-party solutions Drive a Site Reliability Engineering (SRE) culture, ensuring high availability, low latency, and robust disaster recovery capabilities Manage and optimize our cloud infrastructure, focusing on Infrastructure-as-Code (e.g., Terraform), containerization (e.g., Kubernetes), and cost optimization Champion an outstanding Internal Developer Platform (IDP) and developer experience, providing tools, automation, and documentation that accelerate feature delivery for the product team Qualifications 7+ years of experience building software, DevOps, or platform/infrastructure systems 3+ years in a senior leadership or Staff/Principal Engineer role, driving key technical projects Direct experience in implementing cost management and optimization strategies within a cloud environment, resulting in demonstrable savings Proven experience working in a highly regulated industry, ideally fintech, banking, or payments, with a deep understanding of security and compliance requirements Expert-level knowledge of modern cloud architecture (e.g. Microservices, Event-Driven Architecture, Serverless), CI/CD pipelines, and cloud provider platforms. Strong hands-on experience with container orchestration (e.g., Kubernetes), Infrastructure-as-Code (e.g., Terraform), Python, and Observability tools (e.g., Prometheus, Grafana, Datadog, Sentry)Benefits 26 vacation days + 2 duvet days, so you can truly recharge and enjoy life Comprehensive health and dental care coverage Central location for both offices with a fully stocked kitchen, including healthy snacks, and fresh fruit Freedom to create your own entrepreneurial experience by being part of a team in search of excellence Professional Development programs to expand your skills, and maximize your potential on the frontier of financial innovation Next Gate Tech is an equal opportunity employer. We believe our team's unique life experiences, backgrounds, cultures, beliefs and abilities add richness to our culture and depth to our ideas. Our ongoing commitment to diversity and inclusion creates an environment that supports, empowers and delivers a sense of belonging for all members of the team. Should you require any accommodation, please inform us and we will work with you to meet your accessibility needs.

Platform Lead: Fintech Architecture & SRE

PLP Group

Platform Lead - UKJob detailsNEXT GATE TECH LIMITEDFull-time About Next Gate Tech At Next Gate Tech, we create technologies that reshape the landscape of the fund industry operations.We empower our clients by capturing the full potential of harmonized data to drive intelligent and fully automated operations. Our transformative solutions optimize processes, enhance efficiency, reduce risks, and drive cost savings for our clients.Driven by our commitment to innovation, our intelligence layer extracts invaluable insights, employs advanced pattern analysis spotting anomalies, and uncovers hidden links within the data.Our modular, one-stop-shop, SaaS platform seamlessly ingests diverse datasets, creating a harmonized and enriched source of portfolios, transactions, and accounting data. This robust foundation fuels the platform to generate powerful signals through intelligent analytics, empowering a multitude of use cases.Next Gate Tech is not just a part of the industry's evolution - it is a driving force behind it.Learn more about us: Our story, values, mission and team: Our unified platform and technology: Our solutions and use cases: About the Role As a key senior technical leader, the Platform Lead will own and scale the technology and infrastructure that powers our financial technology platform. This role is critical to building the technical foundation that enables new products, supports rapid growth, and upholds the trust expected in financial services. They will design a resilient architecture that accelerates product development, and deliver exceptional reliability as we grow.Partnering closely with product and engineering teams, they will combine hands-on building with strategic technical leadership to ensure our platform remains secure, scalable, and fully compliant with financial-industry standards. Responsibilities Develop and communicate a clear, multi-year technical vision and strategic roadmap for the core platform (including infrastructure, data services, internal developer tools, and security) Lead the technical evaluation and decision-making process for platform technologies, balancing in-house development with best-in-class third-party solutions Drive a Site Reliability Engineering (SRE) culture, ensuring high availability, low latency, and robust disaster recovery capabilities Manage and optimize our cloud infrastructure, focusing on Infrastructure-as-Code (e.g., Terraform), containerization (e.g., Kubernetes), and cost optimization Champion an outstanding Internal Developer Platform (IDP) and developer experience, providing tools, automation, and documentation that accelerate feature delivery for the product team Qualifications 7+ years of experience building software, DevOps, or platform/infrastructure systems 3+ years in a senior leadership or Staff/Principal Engineer role, driving key technical projects Direct experience in implementing cost management and optimization strategies within a cloud environment, resulting in demonstrable savings Proven experience working in a highly regulated industry, ideally fintech, banking, or payments, with a deep understanding of security and compliance requirements Expert-level knowledge of modern cloud architecture (e.g. Microservices, Event-Driven Architecture, Serverless), CI/CD pipelines, and cloud provider platforms. Strong hands-on experience with container orchestration (e.g., Kubernetes), Infrastructure-as-Code (e.g., Terraform), Python, and Observability tools (e.g., Prometheus, Grafana, Datadog, Sentry)Benefits 26 vacation days + 2 duvet days, so you can truly recharge and enjoy life Comprehensive health and dental care coverage Central location for both offices with a fully stocked kitchen, including healthy snacks, and fresh fruit Freedom to create your own entrepreneurial experience by being part of a team in search of excellence Professional Development programs to expand your skills, and maximize your potential on the frontier of financial innovation Next Gate Tech is an equal opportunity employer. We believe our team's unique life experiences, backgrounds, cultures, beliefs and abilities add richness to our culture and depth to our ideas. Our ongoing commitment to diversity and inclusion creates an environment that supports, empowers and delivers a sense of belonging for all members of the team. Should you require any accommodation, please inform us and we will work with you to meet your accessibility needs.

04/06/2026

Full time

Platform Lead - UKJob detailsNEXT GATE TECH LIMITEDFull-time About Next Gate Tech At Next Gate Tech, we create technologies that reshape the landscape of the fund industry operations.We empower our clients by capturing the full potential of harmonized data to drive intelligent and fully automated operations. Our transformative solutions optimize processes, enhance efficiency, reduce risks, and drive cost savings for our clients.Driven by our commitment to innovation, our intelligence layer extracts invaluable insights, employs advanced pattern analysis spotting anomalies, and uncovers hidden links within the data.Our modular, one-stop-shop, SaaS platform seamlessly ingests diverse datasets, creating a harmonized and enriched source of portfolios, transactions, and accounting data. This robust foundation fuels the platform to generate powerful signals through intelligent analytics, empowering a multitude of use cases.Next Gate Tech is not just a part of the industry's evolution - it is a driving force behind it.Learn more about us: Our story, values, mission and team: Our unified platform and technology: Our solutions and use cases: About the Role As a key senior technical leader, the Platform Lead will own and scale the technology and infrastructure that powers our financial technology platform. This role is critical to building the technical foundation that enables new products, supports rapid growth, and upholds the trust expected in financial services. They will design a resilient architecture that accelerates product development, and deliver exceptional reliability as we grow.Partnering closely with product and engineering teams, they will combine hands-on building with strategic technical leadership to ensure our platform remains secure, scalable, and fully compliant with financial-industry standards. Responsibilities Develop and communicate a clear, multi-year technical vision and strategic roadmap for the core platform (including infrastructure, data services, internal developer tools, and security) Lead the technical evaluation and decision-making process for platform technologies, balancing in-house development with best-in-class third-party solutions Drive a Site Reliability Engineering (SRE) culture, ensuring high availability, low latency, and robust disaster recovery capabilities Manage and optimize our cloud infrastructure, focusing on Infrastructure-as-Code (e.g., Terraform), containerization (e.g., Kubernetes), and cost optimization Champion an outstanding Internal Developer Platform (IDP) and developer experience, providing tools, automation, and documentation that accelerate feature delivery for the product team Qualifications 7+ years of experience building software, DevOps, or platform/infrastructure systems 3+ years in a senior leadership or Staff/Principal Engineer role, driving key technical projects Direct experience in implementing cost management and optimization strategies within a cloud environment, resulting in demonstrable savings Proven experience working in a highly regulated industry, ideally fintech, banking, or payments, with a deep understanding of security and compliance requirements Expert-level knowledge of modern cloud architecture (e.g. Microservices, Event-Driven Architecture, Serverless), CI/CD pipelines, and cloud provider platforms. Strong hands-on experience with container orchestration (e.g., Kubernetes), Infrastructure-as-Code (e.g., Terraform), Python, and Observability tools (e.g., Prometheus, Grafana, Datadog, Sentry)Benefits 26 vacation days + 2 duvet days, so you can truly recharge and enjoy life Comprehensive health and dental care coverage Central location for both offices with a fully stocked kitchen, including healthy snacks, and fresh fruit Freedom to create your own entrepreneurial experience by being part of a team in search of excellence Professional Development programs to expand your skills, and maximize your potential on the frontier of financial innovation Next Gate Tech is an equal opportunity employer. We believe our team's unique life experiences, backgrounds, cultures, beliefs and abilities add richness to our culture and depth to our ideas. Our ongoing commitment to diversity and inclusion creates an environment that supports, empowers and delivers a sense of belonging for all members of the team. Should you require any accommodation, please inform us and we will work with you to meet your accessibility needs.

Senior Platform Engineer

PayPoint Welwyn Garden City, Hertfordshire

As a Senior Platform Engineer, you will lead the design, evolution, and reliability of the core platform that underpins our engineering ecosystem. You will set technical direction, define standards, and drive best practices that enable product teams to deliver securely, efficiently, and at scale. Your role goes beyond implementation, you will shape the platform strategy, influence architectural decisions, and act as a technical leader across teams. This is a hybrid role with occasional onsite collaboration - typically once every 2-3 months, but you are able to attend our offices as much as you like, at either our Welwyn Garden City or Livepore offices. From time to time, you may also travel between sites. Key responsibilities Owning and evolving the core platform infrastructure (e.g. AKS clusters, networking, API gateways), ensuring scalability, resilience, and operational excellence. Defining and enforcing platform standards, guardrails, and best practices across areas such as security, cost optimisation, reliability, and performance. Driving platform strategy and roadmap in collaboration with engineering leadership, aligning platform capabilities with business objectives. Partnering with product and engineering teams to understand pain points, proactively identify opportunities, and prioritise high impact platform improvements. Leading architecture and design discussions, providing expert guidance on cloud native patterns, distributed systems, and platform capabilities. Designing, implementing, and continuously improving CI/CD and GitOps workflows to enable safe, fast, and repeatable delivery. Championing DevSecOps principles, embedding security and compliance into the software delivery lifecycle. Establishing and improving observability, monitoring, and incident response practices, including vulnerability management and remediation. Mentoring engineers and contributing to a strong engineering culture through knowledge sharing, documentation, and technical leadership. Evaluating and introducing new tools, technologies, and approaches to keep the platform modern, efficient, and competitive. What we would like from you Strong experience in platform engineering, SRE, or DevOps within a distributed cloud environment. Deep expertise in Kubernetes and containerised workloads, ideally in managed environments such as AKS. Proven experience designing and operating scalable, highly available cloud infrastructure. Strong background in CI/CD, automation, and GitOps practices, including building and evolving pipelines at scale. Solid understanding of cloud networking, identity, and security principles, including container and Kubernetes security. Experience with infrastructure as code and configuration management. Strong Linux systems administration knowledge. Demonstrated ability to make pragmatic, risk based decisions that balance business priorities with technical excellence. Experience shaping standards, influencing teams, and driving adoption of platform practices. Excellent communication skills, with the ability to engage both technical and non technical stakeholders. It would be great if you have the following Experience with Helm, Kustomize, and Kubernetes ecosystem tooling. Familiarity with Azure and Azure DevOps. Experience with observability platforms and Kubernetes policy enforcement tools. Proficiency in scripting or programming (e.g. Bash, Python, PowerShell, C#). Experience designing multi region or highly available systems. Relevant certifications in Kubernetes, Azure, or cloud-native technologies. Our benefits if you decide to join us Holiday purchase scheme, with 25 days holiday plus bank holidays as standard. On site gym at our office (Free), and nationwide corporate rate gym membership. Online benefits portal where you can access a range of deals and discounts. Contributory company pension scheme. Progression and development opportunities. Private medical insurance. Electric car scheme. Life assurance of 3 annual gross salary, with the option to purchase additional cover. Discounted rate benefits including critical illness cover, bicycles via the Cycle2Work scheme, dental insurance, and TasteCard dining discount card. Love2shop everyday benefits card. As a disability confident committed company, we have a passion for championing equality. We welcome all colleagues into a work environment where success is attainable for everyone, regardless of disability, age, race, religion, gender identity, or sexual orientation. We are committed to ensuring that everyone has equal access to growth and opportunities in our workplace.

04/06/2026

Full time

As a Senior Platform Engineer, you will lead the design, evolution, and reliability of the core platform that underpins our engineering ecosystem. You will set technical direction, define standards, and drive best practices that enable product teams to deliver securely, efficiently, and at scale. Your role goes beyond implementation, you will shape the platform strategy, influence architectural decisions, and act as a technical leader across teams. This is a hybrid role with occasional onsite collaboration - typically once every 2-3 months, but you are able to attend our offices as much as you like, at either our Welwyn Garden City or Livepore offices. From time to time, you may also travel between sites. Key responsibilities Owning and evolving the core platform infrastructure (e.g. AKS clusters, networking, API gateways), ensuring scalability, resilience, and operational excellence. Defining and enforcing platform standards, guardrails, and best practices across areas such as security, cost optimisation, reliability, and performance. Driving platform strategy and roadmap in collaboration with engineering leadership, aligning platform capabilities with business objectives. Partnering with product and engineering teams to understand pain points, proactively identify opportunities, and prioritise high impact platform improvements. Leading architecture and design discussions, providing expert guidance on cloud native patterns, distributed systems, and platform capabilities. Designing, implementing, and continuously improving CI/CD and GitOps workflows to enable safe, fast, and repeatable delivery. Championing DevSecOps principles, embedding security and compliance into the software delivery lifecycle. Establishing and improving observability, monitoring, and incident response practices, including vulnerability management and remediation. Mentoring engineers and contributing to a strong engineering culture through knowledge sharing, documentation, and technical leadership. Evaluating and introducing new tools, technologies, and approaches to keep the platform modern, efficient, and competitive. What we would like from you Strong experience in platform engineering, SRE, or DevOps within a distributed cloud environment. Deep expertise in Kubernetes and containerised workloads, ideally in managed environments such as AKS. Proven experience designing and operating scalable, highly available cloud infrastructure. Strong background in CI/CD, automation, and GitOps practices, including building and evolving pipelines at scale. Solid understanding of cloud networking, identity, and security principles, including container and Kubernetes security. Experience with infrastructure as code and configuration management. Strong Linux systems administration knowledge. Demonstrated ability to make pragmatic, risk based decisions that balance business priorities with technical excellence. Experience shaping standards, influencing teams, and driving adoption of platform practices. Excellent communication skills, with the ability to engage both technical and non technical stakeholders. It would be great if you have the following Experience with Helm, Kustomize, and Kubernetes ecosystem tooling. Familiarity with Azure and Azure DevOps. Experience with observability platforms and Kubernetes policy enforcement tools. Proficiency in scripting or programming (e.g. Bash, Python, PowerShell, C#). Experience designing multi region or highly available systems. Relevant certifications in Kubernetes, Azure, or cloud-native technologies. Our benefits if you decide to join us Holiday purchase scheme, with 25 days holiday plus bank holidays as standard. On site gym at our office (Free), and nationwide corporate rate gym membership. Online benefits portal where you can access a range of deals and discounts. Contributory company pension scheme. Progression and development opportunities. Private medical insurance. Electric car scheme. Life assurance of 3 annual gross salary, with the option to purchase additional cover. Discounted rate benefits including critical illness cover, bicycles via the Cycle2Work scheme, dental insurance, and TasteCard dining discount card. Love2shop everyday benefits card. As a disability confident committed company, we have a passion for championing equality. We welcome all colleagues into a work environment where success is attainable for everyone, regardless of disability, age, race, religion, gender identity, or sexual orientation. We are committed to ensuring that everyone has equal access to growth and opportunities in our workplace.

Senior Cloud Platform Engineer

Moneycorp

As a Senior Cloud Platform Engineer, you'll take ownership of day to day operations and deliver impactful projects across Azure and IaaS (Windows and Linux). You'll enhance landing zones, build reusable modules, and drive automation to strengthen our cloud platform. Working closely with DevOps and SRE teams, you'll implement secure, reliable, and cost efficient patterns, while mentoring engineers and promoting best practices. Your focus will be high quality execution, ensuring stability, performance, and compliance, while contributing to continuous improvement and collaborating across teams to deliver a scalable, resilient platform that powers business growth and innovation. Key Responsibilities Operational Ownership (BAU) Operate and improve Azure platform services and IaaS workloads across Windows and Linux for stability, performance, and compliance Implement hardening baselines and patch orchestration, and maintain desired state with DSC or Ansible Enforce secure RBAC, Azure Policy, and identity patterns with AAD and PIM across subscriptions and management groups Own observability runbooks and baselines, including alerting, metrics, logs, dashboards, backups, and DR drills to reduce MTTR Administer Windows Server (AD, GPO, IIS) and provide Linux support including systemd, patching, and log management Project Delivery and Engineering Contribute to landing zones and reusable platform modules using Bicep and Terraform Implement secure connectivity per the platform blueprint: hub and spoke or vWAN, Private Endpoints, DNS, and hybrid links via ExpressRoute or VPN Support VMware to Azure migrations from readiness through cutover, rollback, and DR patterns Deliver CI/CD pipeline templates in Azure DevOps or GitHub Actions with policy gates, secrets scanning, and SBOM generation Enable the Internal Developer Platform to support IaC/CaC based self service environment provisioning Security, Reliability & Cost Controls Embed secure by default patterns, integrate Defender and Conditional Access, and shift left security for images and IaC in pipelines Apply SRE practices such as SLOs and error budgets, and codify operability standards for new capabilities Support FinOps guardrails with tagging, budgets, and alerts; analyse usage and implement cost optimisations without impacting SLAs Collaboration, Mentoring and Governance Mentor and coach platform engineers through pairing, PR reviews, runbook creation, and knowledge sharing Partner with DevOps and SRE to standardise container and registry patterns for AKS or ARO, deployments, and environment parity across stages Contribute to technical governance forums, propose incremental improvements, and document decisions and reusable patterns Collaborate with Principals and architecture boards on architectural approvals where required Skills, Qualifications and Experience Required Azure platform operations across enterprise IaaS and PaaS, including landing zones, subscriptions, RBAC, policy, and governance Strong Windows Server administration (AD, GPO, IIS) with practical Linux experience (RHEL/Ubuntu) for broader support Infrastructure as Code with Terraform and/or Bicep, using reusable modules and Git based workflows Configuration as Code with Ansible and/or DSC to maintain hardened, compliant desired state Automation and scripting with PowerShell and Bash, with Python desirable for tooling CI/CD using Azure DevOps or GitHub Actions, including quality gates, secrets/security scanning, and SBOM generation Azure networking fundamentals: VNets, vWAN, ExpressRoute, VPN, Private Endpoints, and DNS, plus hybrid connectivity patterns Containers and Kubernetes exposure (AKS or ARO), image registry practices, and environment provisioning/on demand environments Observability and reliability: monitoring, logging, alerting baselines, SRE concepts (SLOs, error budgets), backup/DR, and patch orchestration Security and compliance: Zero Trust, identity and access management (AAD, PIM), and integration with Defender and vulnerability scanning Cost optimisation using FinOps practices, tagging strategies, budgeting, and guardrails Desirable (not essential) Experience supporting VMware to Azure migration Any experience working with Temenos or similar core banking platforms would be advantageous Education Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience). Desirable (not essential): Relevant Azure certifications - Microsoft Azure Administrator AZ 104, Azure Solutions Architect/Identity/Security (AZ 305/AZ 500), DevOps Engineer Expert (AZ 400), FinOps Certified Practitioner, ITIL 4 Foundation Fostering a culture of belonging and inclusivity We're committed to creating a workplace where every individual feels valued, respected, and included. As an Equal Opportunity Employer, we actively cultivate an inclusive culture where diversity thrives, and we empower our colleagues to drive meaningful change within our organisation through initiatives such as our DE&I focus groups and value champion network. By measuring our efforts through regular assessments and listening to employee feedback, we strive to ensure our initiatives are impactful and responsive to the evolving needs of our workforce. Together, we want to build a workplace where everyone can bring their authentic selves to work, as we believe this is the foundation of innovation, creativity, and collective success.

04/06/2026

Full time

As a Senior Cloud Platform Engineer, you'll take ownership of day to day operations and deliver impactful projects across Azure and IaaS (Windows and Linux). You'll enhance landing zones, build reusable modules, and drive automation to strengthen our cloud platform. Working closely with DevOps and SRE teams, you'll implement secure, reliable, and cost efficient patterns, while mentoring engineers and promoting best practices. Your focus will be high quality execution, ensuring stability, performance, and compliance, while contributing to continuous improvement and collaborating across teams to deliver a scalable, resilient platform that powers business growth and innovation. Key Responsibilities Operational Ownership (BAU) Operate and improve Azure platform services and IaaS workloads across Windows and Linux for stability, performance, and compliance Implement hardening baselines and patch orchestration, and maintain desired state with DSC or Ansible Enforce secure RBAC, Azure Policy, and identity patterns with AAD and PIM across subscriptions and management groups Own observability runbooks and baselines, including alerting, metrics, logs, dashboards, backups, and DR drills to reduce MTTR Administer Windows Server (AD, GPO, IIS) and provide Linux support including systemd, patching, and log management Project Delivery and Engineering Contribute to landing zones and reusable platform modules using Bicep and Terraform Implement secure connectivity per the platform blueprint: hub and spoke or vWAN, Private Endpoints, DNS, and hybrid links via ExpressRoute or VPN Support VMware to Azure migrations from readiness through cutover, rollback, and DR patterns Deliver CI/CD pipeline templates in Azure DevOps or GitHub Actions with policy gates, secrets scanning, and SBOM generation Enable the Internal Developer Platform to support IaC/CaC based self service environment provisioning Security, Reliability & Cost Controls Embed secure by default patterns, integrate Defender and Conditional Access, and shift left security for images and IaC in pipelines Apply SRE practices such as SLOs and error budgets, and codify operability standards for new capabilities Support FinOps guardrails with tagging, budgets, and alerts; analyse usage and implement cost optimisations without impacting SLAs Collaboration, Mentoring and Governance Mentor and coach platform engineers through pairing, PR reviews, runbook creation, and knowledge sharing Partner with DevOps and SRE to standardise container and registry patterns for AKS or ARO, deployments, and environment parity across stages Contribute to technical governance forums, propose incremental improvements, and document decisions and reusable patterns Collaborate with Principals and architecture boards on architectural approvals where required Skills, Qualifications and Experience Required Azure platform operations across enterprise IaaS and PaaS, including landing zones, subscriptions, RBAC, policy, and governance Strong Windows Server administration (AD, GPO, IIS) with practical Linux experience (RHEL/Ubuntu) for broader support Infrastructure as Code with Terraform and/or Bicep, using reusable modules and Git based workflows Configuration as Code with Ansible and/or DSC to maintain hardened, compliant desired state Automation and scripting with PowerShell and Bash, with Python desirable for tooling CI/CD using Azure DevOps or GitHub Actions, including quality gates, secrets/security scanning, and SBOM generation Azure networking fundamentals: VNets, vWAN, ExpressRoute, VPN, Private Endpoints, and DNS, plus hybrid connectivity patterns Containers and Kubernetes exposure (AKS or ARO), image registry practices, and environment provisioning/on demand environments Observability and reliability: monitoring, logging, alerting baselines, SRE concepts (SLOs, error budgets), backup/DR, and patch orchestration Security and compliance: Zero Trust, identity and access management (AAD, PIM), and integration with Defender and vulnerability scanning Cost optimisation using FinOps practices, tagging strategies, budgeting, and guardrails Desirable (not essential) Experience supporting VMware to Azure migration Any experience working with Temenos or similar core banking platforms would be advantageous Education Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience). Desirable (not essential): Relevant Azure certifications - Microsoft Azure Administrator AZ 104, Azure Solutions Architect/Identity/Security (AZ 305/AZ 500), DevOps Engineer Expert (AZ 400), FinOps Certified Practitioner, ITIL 4 Foundation Fostering a culture of belonging and inclusivity We're committed to creating a workplace where every individual feels valued, respected, and included. As an Equal Opportunity Employer, we actively cultivate an inclusive culture where diversity thrives, and we empower our colleagues to drive meaningful change within our organisation through initiatives such as our DE&I focus groups and value champion network. By measuring our efforts through regular assessments and listening to employee feedback, we strive to ensure our initiatives are impactful and responsive to the evolving needs of our workforce. Together, we want to build a workplace where everyone can bring their authentic selves to work, as we believe this is the foundation of innovation, creativity, and collective success.

Senior DevOps Engineer

TMX Group

Senior DevOps EngineerApplylocations: London - 2 Gresham Streettime type: Full timeposted on: Posted 16 Days Agojob requisition id: R-5987The DevOps Engineer would sit within a development team. This person would be part of Operations but work on the requirements for the development team and focus exclusively on helping with product development. The DevOps Engineer would lower the number of handovers between the Development and Operations teams, therefore speeding up cycle times. The DevOps Engineer would work with other DevOps and Platform members to ensure that changes made within their remit don't affect the underlying platform or other planned work and are compliant with all applicable policies. Responsibilities The role is part of the Execution team at Trayport, which is currently undergoing a technology modernisation transformation. The team is transitioning from an on-premises platform to AWS, aiming to make their applications cloud-native. This process takes into account the latency-sensitive nature of the applications. There will be other Stakeholders in the role daily work with Development team members, and InfoSec, to ensure that all changes made to applications and their CI/CD pipelines are designed to security best practices. The Person Must Have Cloud knowledge, AWS preferred or other would be considered Experience with Octopus Deploy, managing deployment pipelines to automate the release process for applications and service Configuring and maintaining TeamCity build servers, creating and optimising CI/CD pipelines Azure DevOps knowledge, Git, creating new and maintaining existing pipelines Knowledge of Kubernetes and Helm, ideally using EKS Good Terraform knowledge Desired Understand Code signing processes and tooling Coding knowledge would be an advantage in PowerShell, C# and C++ Has worked with VMware in the past Knowledge of Observability tooling, Prometheus, Grafana etc Database technologies with high availability The Person Super interested in all things technical You're energized to learn and apply new technologies and skills. An inquisitive nature, excellent analytical, communication and problem-solving skills Working with a DevOps and SRE mindset Excellent scripting skills with knowledge of scripting best practices Understanding of CI/CD systems Collaborative, team player with a positive attitude Able to constructively discuss and persuade the improvements in technology and processes, suggesting both strategic and tactical improvementsTrayport is committed to creating and sustaining a collegial work environment in which all individuals are treated with dignity and respect and one which reflects the diversity of the community in which we operate. We provide accommodations for applicants and employees who require it. About Us Our Culture: At Trayport, our people power our success. We are a place where talented people never stop learning, innovating and working together to make an impact! We offer you more than a job - we offer you the opportunity to work with, and learn from the most respected industry and thought leaders in the business. We're always pushing the boundaries, rapidly expanding our global presence across London, Vienna, Singapore, Bremen and North America. At Trayport, we understand that our people are crucial to our future. We strive to provide a challenging and inspirational atmosphere; employing intelligent, enthusiastic, adaptable individuals and giving them the freedom, training, and guidance to allow them to consistently achieve their potential. If you share our vision and are motivated to challenge the status quo - we want to hear from you!

04/06/2026

Full time

Senior DevOps EngineerApplylocations: London - 2 Gresham Streettime type: Full timeposted on: Posted 16 Days Agojob requisition id: R-5987The DevOps Engineer would sit within a development team. This person would be part of Operations but work on the requirements for the development team and focus exclusively on helping with product development. The DevOps Engineer would lower the number of handovers between the Development and Operations teams, therefore speeding up cycle times. The DevOps Engineer would work with other DevOps and Platform members to ensure that changes made within their remit don't affect the underlying platform or other planned work and are compliant with all applicable policies. Responsibilities The role is part of the Execution team at Trayport, which is currently undergoing a technology modernisation transformation. The team is transitioning from an on-premises platform to AWS, aiming to make their applications cloud-native. This process takes into account the latency-sensitive nature of the applications. There will be other Stakeholders in the role daily work with Development team members, and InfoSec, to ensure that all changes made to applications and their CI/CD pipelines are designed to security best practices. The Person Must Have Cloud knowledge, AWS preferred or other would be considered Experience with Octopus Deploy, managing deployment pipelines to automate the release process for applications and service Configuring and maintaining TeamCity build servers, creating and optimising CI/CD pipelines Azure DevOps knowledge, Git, creating new and maintaining existing pipelines Knowledge of Kubernetes and Helm, ideally using EKS Good Terraform knowledge Desired Understand Code signing processes and tooling Coding knowledge would be an advantage in PowerShell, C# and C++ Has worked with VMware in the past Knowledge of Observability tooling, Prometheus, Grafana etc Database technologies with high availability The Person Super interested in all things technical You're energized to learn and apply new technologies and skills. An inquisitive nature, excellent analytical, communication and problem-solving skills Working with a DevOps and SRE mindset Excellent scripting skills with knowledge of scripting best practices Understanding of CI/CD systems Collaborative, team player with a positive attitude Able to constructively discuss and persuade the improvements in technology and processes, suggesting both strategic and tactical improvementsTrayport is committed to creating and sustaining a collegial work environment in which all individuals are treated with dignity and respect and one which reflects the diversity of the community in which we operate. We provide accommodations for applicants and employees who require it. About Us Our Culture: At Trayport, our people power our success. We are a place where talented people never stop learning, innovating and working together to make an impact! We offer you more than a job - we offer you the opportunity to work with, and learn from the most respected industry and thought leaders in the business. We're always pushing the boundaries, rapidly expanding our global presence across London, Vienna, Singapore, Bremen and North America. At Trayport, we understand that our people are crucial to our future. We strive to provide a challenging and inspirational atmosphere; employing intelligent, enthusiastic, adaptable individuals and giving them the freedom, training, and guidance to allow them to consistently achieve their potential. If you share our vision and are motivated to challenge the status quo - we want to hear from you!

Senior DevSecOps Engineer

慨正橡扯

We're building a secure, cloud-native platform that underpins how software is delivered across the organisation. Following a major digital transformation, our platform enables teams to ship high-quality software quickly, safely, and consistently-by default. As we continue to scale, security, reliability, and developer experience are treated as first-class concerns, designed in from the start. This role sits at the heart of that mission, shaping how security is applied at scale and how engineering teams confidently move from idea to production. About the role As aDevSecOps Engineer, you'll be a hands on contributor to the design, build, and operation of our internal platform. This is adelivery-focused role, working closely with SRE, Cloud, and Application Security teams to embed security controls, guardrails, and best practices directly into tooling, pipelines, and infrastructure. You'll help define how security is applied at scale in a pragmatic, developer-friendly way, influencing engineering culture through code, automation, and clear technical standards-raising the baseline for security and operational excellence across the organisation. The Tech Stack You'll work with a modern, cloud-native platform, including: Cloud & Networking: AWS (multi-account, IAM, VPC, managed services), hybrid/on prem connectivity Containers & Orchestration: Docker, Kubernetes (EKS, ECS) Infrastructure as Code: OpenTofu, Terragrunt, CloudFormation CI/CD: GitLab CI, reusable components, self-hosted runners Security & Identity: Microsoft Entra, AWS IAM, OIDC, secrets management, policy-as-code Observability: Centralised logging, metrics, tracing (e.g. Datadog, OpenTelemetry) Platform Automation: Declarative configuration and infrastructure management Internal Tooling: Developer-facing tools and services built with Python, Go, and modern frontend frameworks Version Control: Git, merge requests, and code review workflows We value strong fundamentals over specific tools-if you understand the principles, you'll thrive here. What You'll Do Design, build, and operate secure cloud and platform capabilities Embed security controls across the software delivery lifecycle by default Build and maintain fast, reliable, secure CI/CD pipelines and reusable components Automate security, compliance, and operational checks Partner with engineering teams to remove friction and improve workflows Contribute to platform architecture, standards, and technical direction Promote ownership, continuous improvement, and pragmatic DevSecOps practice Key Requirements Hands on experience as a DevSecOps Engineer, Platform Engineer, Cloud Security Engineer, or similar role Strong understanding of DevSecOps principles, including CI/CD, infrastructure as code, and security automation Solid experience working in AWS environments Practical knowledge of containerised workloads and Kubernetes Clear communication skills and the ability to work effectively across teams A focus on raising engineering standards through practical, scalable solutions Why Holland & Barrett? You will be joining at a point where the platform is still being actively shaped, with real scope to influence how security and delivery work across the organisation. This role offers autonomy, technical ownership, and the opportunity to build foundational capabilities that directly impact hundreds of engineers. We offer a competitive salary, comprehensive benefits, and flexible working arrangements. If you enjoy building secure platforms that developers actually love, we'd love to hear from you. What we offer Wellbeing & Lifestyle Benefits Health Cash Plan Life Assurance Incentive Scheme - Based on company & personal performance Virtual GP Private Medical care FREE at-home blood test kit Holiday Purchase option Pension Contribution scheme Access to 'Wellhub' with gyms, studios and wellbeing apps Discounts & Savings 25% Colleague Discount with FREE Standard Delivery Exclusive Discounts from a wide range of partners £/€50 Annual Product Allowance to spend in store Learning & Development Access to a variety of learning opportunities, including Level 2-5 Apprenticeships, Workshops and our Digital Learning Library AND MORE! Holland and Barrett is an equal opportunity employer. We welcome diverse perspectives and are committed to creating an inclusive environment for all colleagues. We understand that when our colleagues are listened to, respected and valued for who they are, we build anorganisationwith belonging at its heart - making health and wellness a way of life for everyone.

04/06/2026

Full time

We're building a secure, cloud-native platform that underpins how software is delivered across the organisation. Following a major digital transformation, our platform enables teams to ship high-quality software quickly, safely, and consistently-by default. As we continue to scale, security, reliability, and developer experience are treated as first-class concerns, designed in from the start. This role sits at the heart of that mission, shaping how security is applied at scale and how engineering teams confidently move from idea to production. About the role As aDevSecOps Engineer, you'll be a hands on contributor to the design, build, and operation of our internal platform. This is adelivery-focused role, working closely with SRE, Cloud, and Application Security teams to embed security controls, guardrails, and best practices directly into tooling, pipelines, and infrastructure. You'll help define how security is applied at scale in a pragmatic, developer-friendly way, influencing engineering culture through code, automation, and clear technical standards-raising the baseline for security and operational excellence across the organisation. The Tech Stack You'll work with a modern, cloud-native platform, including: Cloud & Networking: AWS (multi-account, IAM, VPC, managed services), hybrid/on prem connectivity Containers & Orchestration: Docker, Kubernetes (EKS, ECS) Infrastructure as Code: OpenTofu, Terragrunt, CloudFormation CI/CD: GitLab CI, reusable components, self-hosted runners Security & Identity: Microsoft Entra, AWS IAM, OIDC, secrets management, policy-as-code Observability: Centralised logging, metrics, tracing (e.g. Datadog, OpenTelemetry) Platform Automation: Declarative configuration and infrastructure management Internal Tooling: Developer-facing tools and services built with Python, Go, and modern frontend frameworks Version Control: Git, merge requests, and code review workflows We value strong fundamentals over specific tools-if you understand the principles, you'll thrive here. What You'll Do Design, build, and operate secure cloud and platform capabilities Embed security controls across the software delivery lifecycle by default Build and maintain fast, reliable, secure CI/CD pipelines and reusable components Automate security, compliance, and operational checks Partner with engineering teams to remove friction and improve workflows Contribute to platform architecture, standards, and technical direction Promote ownership, continuous improvement, and pragmatic DevSecOps practice Key Requirements Hands on experience as a DevSecOps Engineer, Platform Engineer, Cloud Security Engineer, or similar role Strong understanding of DevSecOps principles, including CI/CD, infrastructure as code, and security automation Solid experience working in AWS environments Practical knowledge of containerised workloads and Kubernetes Clear communication skills and the ability to work effectively across teams A focus on raising engineering standards through practical, scalable solutions Why Holland & Barrett? You will be joining at a point where the platform is still being actively shaped, with real scope to influence how security and delivery work across the organisation. This role offers autonomy, technical ownership, and the opportunity to build foundational capabilities that directly impact hundreds of engineers. We offer a competitive salary, comprehensive benefits, and flexible working arrangements. If you enjoy building secure platforms that developers actually love, we'd love to hear from you. What we offer Wellbeing & Lifestyle Benefits Health Cash Plan Life Assurance Incentive Scheme - Based on company & personal performance Virtual GP Private Medical care FREE at-home blood test kit Holiday Purchase option Pension Contribution scheme Access to 'Wellhub' with gyms, studios and wellbeing apps Discounts & Savings 25% Colleague Discount with FREE Standard Delivery Exclusive Discounts from a wide range of partners £/€50 Annual Product Allowance to spend in store Learning & Development Access to a variety of learning opportunities, including Level 2-5 Apprenticeships, Workshops and our Digital Learning Library AND MORE! Holland and Barrett is an equal opportunity employer. We welcome diverse perspectives and are committed to creating an inclusive environment for all colleagues. We understand that when our colleagues are listened to, respected and valued for who they are, we build anorganisationwith belonging at its heart - making health and wellness a way of life for everyone.

58 jobs found

Modal Window