Browse IT Jobs | IT Job Board

Global DevOps Team Lead New United Kingdom

Cluedin ApS

Join Our Journey in Transforming Data into Insights At CluedIn, we're reshaping the future of data management with an Azure-native, graph-based Modern Master Data Management (MDM) platform that simplifies and accelerates how businesses manage their data-making it faster, smarter, and more accessible. Trusted by global industry leaders, our technology enables organisations to unlock the full value of their data and drive meaningful transformation. As Microsoft's preferred MDM partner and the first solution integrated with Azure OpenAI, we're setting new standards for innovation in this space. Backed by five-star reviews on Gartner Peer Insights and seamless integration with Microsoft's data services, CluedIn delivers powerful, scalable solutions built for the enterprise. Our team's expertise, passion, and bold ideas drive our innovation. Together, we're reimagining what's possible in data management. About the Role - Global DevOps Team Lead We are seeking an experienced and hands on Global DevOps Team Lead to lead the reliability, scalability, and operational excellence of our cloud platform. This is a leadership role for a technical expert who thrives in modern cloud-native environments and enjoys balancing strategic leadership with hands on technical ownership. You'll lead our devops team while remaining close to the technology, driving infrastructure decisions, improving platform resilience, and shaping how we scale globally. Reporting directly to the CTO, you will own the operational performance of our Azure-based SaaS platform, champion automation and continuous improvement, and ensure our customers receive a world class service experience. What You'll Be Responsible For Platform Ownership Own the performance, availability, security, and scalability of our global Azure infrastructure Lead the operational strategy for our Kubernetes based platform Drive platform reliability and ensure we consistently meet or exceed SLA commitments Act as the senior escalation point for complex technical and operational issues Technical Leadership Provide hands on technical leadership across Azure, Kubernetes, networking, observability, and platform operations Lead major incident management and post incident reviews Identify and resolve systemic reliability and performance challenges Partner with Engineering to improve deployment processes, resilience, and platform architecture Drive Infrastructure as Code, automation, and operational efficiency initiatives Reduce manual operational overhead through tooling and process improvements Establish best practices for monitoring, alerting, capacity planning, and operational readiness Build and maintain robust operational documentation and runbooks Lead, mentor, and develop a high performing DevOps and Operations team Create a culture of ownership, accountability, continuous learning, and operational excellence Support recruitment, onboarding, and development of future team members Ensure the team remains aligned with evolving platform and business requirements Stakeholder Management Work closely with Product, Engineering, Customer Success, and Leadership teams Provide operational insights, metrics, and recommendations to senior leadership Contribute to technology strategy and long term infrastructure planning Manage operational budgets and optimise cloud resource utilisation What Success Looks Like Exceptional platform reliability and service availability Consistent achievement of SLA and operational performance targets Fast and effective incident response and resolution Highly automated, scalable operational processes Strong observability and proactive monitoring across the platform A capable, engaged, and continuously developing Operations team Infrastructure that scales seamlessly alongside business growth What We're Looking For Significant experience operating and managing production cloud environments, ideally within Microsoft Azure Deep hands on experience with Kubernetes in large scale production environments Proven experience leading DevOps, Platform Engineering, Site Reliability Engineering (SRE), or Infrastructure teams Strong understanding of cloud architecture, networking, security, performance optimisation, and high availability design Experience implementing Infrastructure as Code and automation practices Demonstrated success managing complex incidents and driving operational improvements Strong commercial awareness and experience balancing reliability, performance, and cost optimisation Technical Expertise You will be comfortable acting as the technical authority across: Containerisation technologies Infrastructure as Code (Terraform, Bicep or similar) Monitoring and observability platforms Cloud networking and security Incident management and operational excellence practices Personal Attributes Strong technical credibility with the ability to lead from the front Naturally proactive with a strong sense of ownership Comfortable making decisions and taking accountability Excellent communicator capable of engaging both technical and non technical stakeholders Passionate about building resilient systems and high performing teams Why This Role Matters This is a key leadership position within our technology organisation. We are looking for someone who will not simply manage devops, but who will actively shape the future of our platform, raise technical standards, mentor the team, and play a critical role in our continued growth. Why Join CluedIn Lead the Change - Be at the forefront of the data revolution Microsoft-Backed Innovation - Work with cutting edge Azure technologies Grow Continuously - Dedicated learning and development opportunities Diverse & Collaborative Team - Innovation thrives in our culture Flexible by Design - Remote first with flexible working Eligibility for Stock options £500 home office budget Private healthcare + life insurance + EAP 25 days holiday + public holidays 5 dedicated training days per year Work from anywhere (up to 120 days/year) Global team meetups and offsites CluedIn is constantly working to maintain and improve our inclusive, friendly workplace. We ensure that both applicants and our people receive unbiased treatment without discrimination on the grounds of gender, age, disability, religion, belief, sexual orientation, marital status, race or any other protected characteristic. We are happy to discuss flexible and agile approaches to working for all our roles - we can't promise we will be able to offer you everything you want or need but we do promise to discuss it with you openly and honestly. If you have any reasonable adjustment needs arising from a disability or medical condition to fully participate in the recruitment process, please discuss this with our hiring team.

21/06/2026

Full time

Senior Platform Engineer

Stackone

About StackOne: StackOne is the AI Integration Gateway for SaaS products and AI Agents. Backed by GV and Workday Ventures ($24M raised), we help builders of SaaS platforms and AI Agents orchestrate hundreds of scalable, accurate, and enterprise-grade integrations. Our platform combines 25,000 pre-mapped actions on 200 connectors, an AI-powered integration development toolkit, plus security by design: a real-time architecture, managed authentication and permissions, and end-to-end observability. Join us on our fast trajectory to build the future of agentic integrations. About the role We're looking for a Senior Platform Engineer to own how StackOne is built, shipped, and run, as we scale across our own cloud and into our customers' clouds. You'll own the infrastructure behind the platform, our deployment pipeline and developer tooling, and how we package StackOne to run inside customers' own AWS, GCP, or Azure accounts. It's a hands on role with broad scope. You write code and tooling, you own the IaC other engineers depend on, and you set the standard for how every new repository gets deployed and secured. You'll report directly to the CTO and work closely with our Security Engineer and tech leads. Responsibilities Own our infrastructure at scale: the AWS estate today (ECS Fargate, Aurora, ElastiCache, MSK, OpenSearch, Lambda, KMS) and the AWS CDK to Terraform migration. Keep it reliable, observable, and cost aware. Build out the deployment pipeline as we consolidate toward a monorepo: the CI/CD that ships every service, plus an automated end to end testing harness with incremental (affected only) testing that stays fast as we grow. Ship into customers' own clouds (self hosted / BYOC): the Terraform modules, container images, runbooks, and documentation for internal teams and customers. Own the release and upgrade path for self hosted customers: versioned, signed releases, a supported version policy, and a call home for usage and version reporting. Set the standard for new repositories: partner with the Security Engineer and tech leads so new projects (product, internal tools, and vibe coded prototypes) ship secure and deployable from day one. Templates and golden paths, not gate keeping. Raise reliability: SLOs, observability, and incident response. Make the system easy to operate when something breaks at 3am. Treat infrastructure as a product: paved roads and self serve tooling so product engineers ship without waiting on you. Use AI in the workflow: lean on LLMs and agents for IaC generation, test scaffolding, and runbook drafting, with guardrails you trust. What we're looking for 4+ years in platform, infrastructure, SRE, or DevOps, with hands on AWS at scale: ECS/Fargate, RDS/Aurora, Lambda, networking, IAM, KMS. Deep IaC ability: Terraform and/or AWS CDK, comfortable owning modules other teams build on. Strong CI/CD and monorepo build experience: caching and incremental/affected only test execution. You've made a slow pipeline fast. Strong coding in TypeScript, Python, or Go. You build tooling, not just YAML and configs. Containers in production (Docker); Kubernetes a plus. Security minded: you bake secure defaults into pipelines and repo templates, and partner with security rather than routing around it. A clear writer: docs and runbooks that internal engineers and external customers can follow. End to end ownership: you scope, ship, and measure, and automate instead of running manual checklists. Nice to have Shipping software into customers' own cloud or on prem (BYOC / self hosted), or a platform like Nuon, Replicated, or Omnistrate. Multi cloud (GCP or Azure) IaC, and exposure to Temporal, Kafka/MSK, ClickHouse, or OpenSearch. Our stack Cloud & infra: AWS (ECS Fargate, Aurora Postgres, ElastiCache, MSK, OpenSearch, Lambda, S3, KMS, CloudFront, WAF), Cloudflare (Workers, WAF) IaC: AWS CDK today, migrating to Terraform Data & messaging: Postgres, Redis, Kafka, OpenSearch, ClickHouse CI/CD: GitHub Actions Observability & analytics: Datadog, Sentry, Metabase Languages: TypeScript (Node.js), Python Benefits Meaningful share options (EMI) 25 days holiday + 1 additional day per year of tenure Private health insurance, including dental & optical £15/day London office lunch budget, up to £120/month £1,000 home office setup + £500/year top up Annual team offsite to sunny spots Join one of Europe's fastest growing startups Work with a veteran team of ex Google, Microsoft, Oracle, Coinbase, JP Morgan and more Health, fitness and gift card discounts; Cycle2Work and Electric Cars scheme London (hybrid, 2 days/week) preferred; open to remote within the UK We believe diversity drives innovation. We encourage individuals from all backgrounds to apply. As an equal opportunity employer, we celebrate diversity and are committed to creating an inclusive environment for all employees.

02/06/2026

Full time

2 jobs found

Modal Window