A mission-driven tech start-up is seeking a Technical Writer to support clear and structured documentation across its platform and infrastructure. The role involves creating user-friendly content for internal teams and customers, documenting systems and workflows, and maintaining documentation repositories. Ideal candidates will have experience in technical writing within software or infrastructure environments and a detail-oriented approach to writing. Join this innovative company to play a critical role in shaping its operations at scale.
23/05/2026
Full time
A mission-driven tech start-up is seeking a Technical Writer to support clear and structured documentation across its platform and infrastructure. The role involves creating user-friendly content for internal teams and customers, documenting systems and workflows, and maintaining documentation repositories. Ideal candidates will have experience in technical writing within software or infrastructure environments and a detail-oriented approach to writing. Join this innovative company to play a critical role in shaping its operations at scale.
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public sector organisations. We are seeking a Technical Writer to support the creation and maintenance of clear, structured documentation across our platform and infrastructure. You will work closely with engineering, platform, and product teams to translate complex technical concepts into accessible content for internal teams and customers. This role is hands on and delivery focused, ideal for someone who can bring structure to fast moving environments and improve how knowledge is captured and shared. Key Responsibilities: Create and maintain technical documentation including platform guides, onboarding materials, runbooks, and operational procedures. Work with engineering and platform teams to document systems, workflows, and APIs. Translate complex infrastructure and platform concepts into clear, user friendly content. Support customer facing documentation such as user guides and knowledge base articles. Maintain and improve documentation repositories (e.g., Confluence, Git based docs, Notion). Apply consistent standards, templates, and formatting across documentation. Keep documentation up to date as systems evolve, ensuring accuracy and usability. In depth experience as a Technical Writer in a software, cloud, or infrastructure environment. Strong ability to understand and explain technical systems (e.g., cloud platforms, Kubernetes, networking fundamentals). Experience working with engineers and product teams to produce documentation. Clear and concise writing style with strong attention to detail. Familiarity with documentation tools such as Markdown, Git, Confluence, or similar. Comfortable operating in a fast paced, evolving environment. One or more would be an advantage: Exposure to AI/ML platforms or GPU based infrastructure. Familiarity with Kubernetes or container based platforms. Experience documenting APIs or developer facing products. Understanding of data centre environments (compute, storage, networking). Experience in a startup or scaling organisation. Why Join Era4: You'll be joining a mission driven start up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next generation company operates at scale.
23/05/2026
Full time
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public sector organisations. We are seeking a Technical Writer to support the creation and maintenance of clear, structured documentation across our platform and infrastructure. You will work closely with engineering, platform, and product teams to translate complex technical concepts into accessible content for internal teams and customers. This role is hands on and delivery focused, ideal for someone who can bring structure to fast moving environments and improve how knowledge is captured and shared. Key Responsibilities: Create and maintain technical documentation including platform guides, onboarding materials, runbooks, and operational procedures. Work with engineering and platform teams to document systems, workflows, and APIs. Translate complex infrastructure and platform concepts into clear, user friendly content. Support customer facing documentation such as user guides and knowledge base articles. Maintain and improve documentation repositories (e.g., Confluence, Git based docs, Notion). Apply consistent standards, templates, and formatting across documentation. Keep documentation up to date as systems evolve, ensuring accuracy and usability. In depth experience as a Technical Writer in a software, cloud, or infrastructure environment. Strong ability to understand and explain technical systems (e.g., cloud platforms, Kubernetes, networking fundamentals). Experience working with engineers and product teams to produce documentation. Clear and concise writing style with strong attention to detail. Familiarity with documentation tools such as Markdown, Git, Confluence, or similar. Comfortable operating in a fast paced, evolving environment. One or more would be an advantage: Exposure to AI/ML platforms or GPU based infrastructure. Familiarity with Kubernetes or container based platforms. Experience documenting APIs or developer facing products. Understanding of data centre environments (compute, storage, networking). Experience in a startup or scaling organisation. Why Join Era4: You'll be joining a mission driven start up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next generation company operates at scale.
Era4 is seeking Automation & AIOps Engineers to build a modern AI Platform Operations function from scratch. This role involves developing autonomous workflows, integrating with observability tools, and designing operational automation. Strong Python skills and experience with monitoring platforms are required, along with occasional visits to the London office. Era4 promotes diversity and aims to create an inclusive work environment, offering high visibility and the opportunity to shape operational standards at a mission-driven start-up.
23/05/2026
Full time
Era4 is seeking Automation & AIOps Engineers to build a modern AI Platform Operations function from scratch. This role involves developing autonomous workflows, integrating with observability tools, and designing operational automation. Strong Python skills and experience with monitoring platforms are required, along with occasional visits to the London office. Era4 promotes diversity and aims to create an inclusive work environment, offering high visibility and the opportunity to shape operational standards at a mission-driven start-up.
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public sector organisations This is a greenfield role, building a modern Agentic approach to Client and Infrastructure Operations . Role Summary: We are seeking Automation & AIOps Engineers who sit at the intersection of Site Reliability Engineering and modern AI driven operations. Embedded within Era4's engineering led Operations Centre, this role exists to build a modern AI Platform Operations function from scratch, designing tooling, and agentic workflows. No legacy to deal with. Runbook Automation & Agent Development: Build agentic, executable workflows capable of triaging, diagnosing, and where appropriate autonomously remediating known failure patterns. Build and maintain LLM backed agents targeting the observability stack, ITSM platform, and infrastructure APIs (e.g. DCIM, IPAM, hypervisor layers). Develop auditable Client focused automations, for Client interactions and workflows, with appropriate controls. Develop safe, auditable automation with appropriate controls for higher risk platform actions. Operational Tooling & Self Service Enablement: Build internal tooling that empowers engineers and service desk analysts: CLI utilities, ChatOps integrations (Slack/Teams bots), status dashboards, and self service automation hooks. Reduce dependency on DevSecOps and engineering teams for routine operational tasks through automation. Maintain and contribute a library of automation assets, agent prompts, and runbook as code artefacts, version controlled and peer reviewed. Develop the automation layer around monitoring and event management: alert suppression logic, enrichment pipelines, correlation rules, and alert to ticket integrations. Continuously tune signal to noise ratios across monitoring tooling (Prometheus, Mimir, Grafana, or equivalent) to improve situational awareness. Design and implement event correlation and deduplication logic to reduce alert storms and improve incident context. Identify common Operational patterns and tasks as candidates for automation; maintain and prioritise a toil reduction backlog. Participate in post incident reviews and translate findings into updated automation, runbooks, or agent logic. Contribute to the evolution of Era4's operational standards, tooling architecture, and agent framework. Technical - Core Element: Strong Python development skills, including scripting for automation, API integration, and data processing. Hands on experience with observability and monitoring platforms: Prometheus, Grafana, Mimir, or equivalent. Experience integrating with ITSM platforms (ServiceNow, Halo, Jira Service Management, or similar) via API. Solid understanding of event driven architectures, message queues, and webhook based automation patterns. Strong understanding of managing GPU infrastructure in production, key signals and metrics and the automation of workflows. Familiarity with Infrastructure as Code principles and cloud native environments (Kubernetes, Terraform, or similar). Technical - Agent & AI: Demonstrable experience building LLM powered agents or automation using frameworks such as LangChain, LlamaIndex, the Anthropic SDK, OpenAI function calling, or comparable tooling. Understanding of agentic design patterns: tool use, structured output, human in the loop controls, and chain of thought reasoning for operational tasks. Comfort operating in an API first environment, integrating agents with infrastructure APIs, DCIM, IPAM, and hypervisor control planes. Operational: Prior experience in an SRE, Senior Operations, or Platform Engineering environment, with exposure to on call operations and incident management processes. Experience in converting narrative runbooks into executable automation or codified decision trees. Understanding of ITIL aligned incident and change management principles and ITSM tooling. One or more would be an advantage: Exposure to data centre or colocation operations, particularly high density compute or GPU infrastructure environments. Experience with ChatOps tooling: building Slack or Microsoft Teams bots for operational workflows. Familiarity with DCIM platforms and telemetry pipelines (power, thermal, network). Knowledge of OpenTelemetry, distributed tracing, or log aggregation platforms (Loki, ELK, Splunk). Contributions to open source observability or automation tooling. Experience in a start up or scale up environment where tooling is built from scratch. Why Join Era4: You'll be joining a mission driven start up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next generation company operates at scale. Diversity & Inclusion Era4 is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Executive & Operations United Kingdom - Hybrid (Occasional visit to London office)
23/05/2026
Full time
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public sector organisations This is a greenfield role, building a modern Agentic approach to Client and Infrastructure Operations . Role Summary: We are seeking Automation & AIOps Engineers who sit at the intersection of Site Reliability Engineering and modern AI driven operations. Embedded within Era4's engineering led Operations Centre, this role exists to build a modern AI Platform Operations function from scratch, designing tooling, and agentic workflows. No legacy to deal with. Runbook Automation & Agent Development: Build agentic, executable workflows capable of triaging, diagnosing, and where appropriate autonomously remediating known failure patterns. Build and maintain LLM backed agents targeting the observability stack, ITSM platform, and infrastructure APIs (e.g. DCIM, IPAM, hypervisor layers). Develop auditable Client focused automations, for Client interactions and workflows, with appropriate controls. Develop safe, auditable automation with appropriate controls for higher risk platform actions. Operational Tooling & Self Service Enablement: Build internal tooling that empowers engineers and service desk analysts: CLI utilities, ChatOps integrations (Slack/Teams bots), status dashboards, and self service automation hooks. Reduce dependency on DevSecOps and engineering teams for routine operational tasks through automation. Maintain and contribute a library of automation assets, agent prompts, and runbook as code artefacts, version controlled and peer reviewed. Develop the automation layer around monitoring and event management: alert suppression logic, enrichment pipelines, correlation rules, and alert to ticket integrations. Continuously tune signal to noise ratios across monitoring tooling (Prometheus, Mimir, Grafana, or equivalent) to improve situational awareness. Design and implement event correlation and deduplication logic to reduce alert storms and improve incident context. Identify common Operational patterns and tasks as candidates for automation; maintain and prioritise a toil reduction backlog. Participate in post incident reviews and translate findings into updated automation, runbooks, or agent logic. Contribute to the evolution of Era4's operational standards, tooling architecture, and agent framework. Technical - Core Element: Strong Python development skills, including scripting for automation, API integration, and data processing. Hands on experience with observability and monitoring platforms: Prometheus, Grafana, Mimir, or equivalent. Experience integrating with ITSM platforms (ServiceNow, Halo, Jira Service Management, or similar) via API. Solid understanding of event driven architectures, message queues, and webhook based automation patterns. Strong understanding of managing GPU infrastructure in production, key signals and metrics and the automation of workflows. Familiarity with Infrastructure as Code principles and cloud native environments (Kubernetes, Terraform, or similar). Technical - Agent & AI: Demonstrable experience building LLM powered agents or automation using frameworks such as LangChain, LlamaIndex, the Anthropic SDK, OpenAI function calling, or comparable tooling. Understanding of agentic design patterns: tool use, structured output, human in the loop controls, and chain of thought reasoning for operational tasks. Comfort operating in an API first environment, integrating agents with infrastructure APIs, DCIM, IPAM, and hypervisor control planes. Operational: Prior experience in an SRE, Senior Operations, or Platform Engineering environment, with exposure to on call operations and incident management processes. Experience in converting narrative runbooks into executable automation or codified decision trees. Understanding of ITIL aligned incident and change management principles and ITSM tooling. One or more would be an advantage: Exposure to data centre or colocation operations, particularly high density compute or GPU infrastructure environments. Experience with ChatOps tooling: building Slack or Microsoft Teams bots for operational workflows. Familiarity with DCIM platforms and telemetry pipelines (power, thermal, network). Knowledge of OpenTelemetry, distributed tracing, or log aggregation platforms (Loki, ELK, Splunk). Contributions to open source observability or automation tooling. Experience in a start up or scale up environment where tooling is built from scratch. Why Join Era4: You'll be joining a mission driven start up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next generation company operates at scale. Diversity & Inclusion Era4 is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Executive & Operations United Kingdom - Hybrid (Occasional visit to London office)
A mission-driven start-up in the UK is seeking a Systems Architect to define the architecture of their Kubernetes-based AI infrastructure. The role involves producing high-level designs, establishing reference architectures, and ensuring system integrations. Candidates should have proven experience in designing scalable, cloud-native solutions and a strong grasp of Infrastructure-as-code. This position offers hybrid work, occasional travel to London, and an opportunity to shape a critical national infrastructure initiative.
22/05/2026
Full time
A mission-driven start-up in the UK is seeking a Systems Architect to define the architecture of their Kubernetes-based AI infrastructure. The role involves producing high-level designs, establishing reference architectures, and ensuring system integrations. Candidates should have proven experience in designing scalable, cloud-native solutions and a strong grasp of Infrastructure-as-code. This position offers hybrid work, occasional travel to London, and an opportunity to shape a critical national infrastructure initiative.
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data-centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public-sector organisations Role Summary We are seeking a Systems Architect to define, design, and govern the architecture of our Kubernetes-based AI infrastructure. This role is responsible for translating business, customer, and regulatory requirements into scalable platform designs and technical specifications that guide platform engineering and operations teams. Work closely with Product Managers, Platform Engineers, ML engineers, and leadership to ensure the platform is secure, performant, resilient, and future-proof, enabling AI workloads at national-scale. This is an opportunity to define the end-to-end architecture of our AI platform, defining how Kubernetes, GPU infrastructure, networking, storage, and automation components integrate into a cohesive system. This is a design authority role, responsible for: Technical standards Long-term scalability and evolution Provide clear specifications and boundaries that enable Platform Engineers to build and operate the platform effectively. Key Responsibilities Produce high-level and low-level designs (HLDs/LLDs) for platform components. Establish reference architectures for multi-tenant AI workloads. Specify system and integrations across Kubernetes, IAM, PaaS, Observability and automation, ITSM, Security, Billing - the complete cloud service provider stack. Define solutions for multi-tenant SaaS offerings for GPU cloud related offerings. Scalability, Reliability & Performance Provide inputs for SLOs/SLAs across: Infrastructure, SaaS services, Storage services. Architect: Capacity planning (compute + storage), Performance optimisation across workloads. Proven results designing scaled end to end systems (not just components, or end services).An expert of infrastructure, cloud, or platform related Architecture/Engineering. Exceptional experience designing Kubernetes and cloud native solutions at scale. Commercial aptitude to enable performance, cost, time trade-offs and evolutions. Proven experience defining target architectures and technology roadmaps. Strong understanding of Infrastructure-as-code, automation, GitOps. Experience of Open Source scale-out storage/object stores (Ceph). One or more would be an advantage Contribution to open-source projects in the IaaS/PaaS space. Experience of building and consuming significant cloud services. Commercial success of turning technology into a commercially successful product offering. AI/ML System design and consumption practises. Why Join Era4 You'll be joining a mission-driven start-up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next-generation company operates at scale. Diversity & Inclusion Era4 is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Technology United Kingdom, Hybrid - (Occasional travel to London office)
22/05/2026
Full time
Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data-centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public-sector organisations Role Summary We are seeking a Systems Architect to define, design, and govern the architecture of our Kubernetes-based AI infrastructure. This role is responsible for translating business, customer, and regulatory requirements into scalable platform designs and technical specifications that guide platform engineering and operations teams. Work closely with Product Managers, Platform Engineers, ML engineers, and leadership to ensure the platform is secure, performant, resilient, and future-proof, enabling AI workloads at national-scale. This is an opportunity to define the end-to-end architecture of our AI platform, defining how Kubernetes, GPU infrastructure, networking, storage, and automation components integrate into a cohesive system. This is a design authority role, responsible for: Technical standards Long-term scalability and evolution Provide clear specifications and boundaries that enable Platform Engineers to build and operate the platform effectively. Key Responsibilities Produce high-level and low-level designs (HLDs/LLDs) for platform components. Establish reference architectures for multi-tenant AI workloads. Specify system and integrations across Kubernetes, IAM, PaaS, Observability and automation, ITSM, Security, Billing - the complete cloud service provider stack. Define solutions for multi-tenant SaaS offerings for GPU cloud related offerings. Scalability, Reliability & Performance Provide inputs for SLOs/SLAs across: Infrastructure, SaaS services, Storage services. Architect: Capacity planning (compute + storage), Performance optimisation across workloads. Proven results designing scaled end to end systems (not just components, or end services).An expert of infrastructure, cloud, or platform related Architecture/Engineering. Exceptional experience designing Kubernetes and cloud native solutions at scale. Commercial aptitude to enable performance, cost, time trade-offs and evolutions. Proven experience defining target architectures and technology roadmaps. Strong understanding of Infrastructure-as-code, automation, GitOps. Experience of Open Source scale-out storage/object stores (Ceph). One or more would be an advantage Contribution to open-source projects in the IaaS/PaaS space. Experience of building and consuming significant cloud services. Commercial success of turning technology into a commercially successful product offering. AI/ML System design and consumption practises. Why Join Era4 You'll be joining a mission-driven start-up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy, and the chance to shape how a next-generation company operates at scale. Diversity & Inclusion Era4 is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Technology United Kingdom, Hybrid - (Occasional travel to London office)