Linux Systems Operations Engineer - AWS Hybrid - Sandbach (3 days onsite) Competitive + bonus & benefits Applause IT are working with a large, product-led technology business to hire a Linux Systems Operations Engineer into their growing Technology & Data function. This is a hands-on infrastructure role focused on Linux, AWS and automation, supporting both internal platforms and customer-facing systems at scale. You'll sit within a collaborative infrastructure team, working closely with DevOps and Software Engineering to keep platforms secure, scalable and reliable. The role You'll take ownership across the Linux estate while supporting and improving cloud infrastructure in AWS. Day-to-day, you'll be: Supporting and maintaining a Linux server environment (Ubuntu, Apache/Nginx, MySQL/Postgres) Supporting and developing AWS infrastructure (EC2, ECS, Lambda, VPC, Route53, S3, RDS, CloudWatch) Automating builds, scaling, patching and monitoring Working alongside DevOps and Engineering teams to support delivery Improving resilience, security and operational efficiency Documenting systems and following structured change management Managing SLAs and balancing competing priorities This is a role with real scope to shape how infrastructure is run, not just keep the lights on. What we're looking for We're open to strong mid-level through to senior engineers. You don't need everything below, but experience in most areas is important: Strong Linux systems experience Solid exposure to AWS cloud environments Experience with automation / configuration tools (Terraform, Ansible, Puppet or similar) Scripting skills in Python and/or Bash Working knowledge of Windows Server / Active Directory Good understanding of security fundamentals Comfortable working in a change-controlled, production environment Benefits Annual bonus Pension & life assurance Electric vehicle & fuel schemes Private wellbeing & healthcare support Cycle to work scheme Interested? If you're a Linux / AWS-focused infrastructure engineer looking for a role with ownership, stability and room to grow, we'd love to talk. Apply now or contact Applause IT for a confidential discussion.
15/04/2026
Full time
Linux Systems Operations Engineer - AWS Hybrid - Sandbach (3 days onsite) Competitive + bonus & benefits Applause IT are working with a large, product-led technology business to hire a Linux Systems Operations Engineer into their growing Technology & Data function. This is a hands-on infrastructure role focused on Linux, AWS and automation, supporting both internal platforms and customer-facing systems at scale. You'll sit within a collaborative infrastructure team, working closely with DevOps and Software Engineering to keep platforms secure, scalable and reliable. The role You'll take ownership across the Linux estate while supporting and improving cloud infrastructure in AWS. Day-to-day, you'll be: Supporting and maintaining a Linux server environment (Ubuntu, Apache/Nginx, MySQL/Postgres) Supporting and developing AWS infrastructure (EC2, ECS, Lambda, VPC, Route53, S3, RDS, CloudWatch) Automating builds, scaling, patching and monitoring Working alongside DevOps and Engineering teams to support delivery Improving resilience, security and operational efficiency Documenting systems and following structured change management Managing SLAs and balancing competing priorities This is a role with real scope to shape how infrastructure is run, not just keep the lights on. What we're looking for We're open to strong mid-level through to senior engineers. You don't need everything below, but experience in most areas is important: Strong Linux systems experience Solid exposure to AWS cloud environments Experience with automation / configuration tools (Terraform, Ansible, Puppet or similar) Scripting skills in Python and/or Bash Working knowledge of Windows Server / Active Directory Good understanding of security fundamentals Comfortable working in a change-controlled, production environment Benefits Annual bonus Pension & life assurance Electric vehicle & fuel schemes Private wellbeing & healthcare support Cycle to work scheme Interested? If you're a Linux / AWS-focused infrastructure engineer looking for a role with ownership, stability and room to grow, we'd love to talk. Apply now or contact Applause IT for a confidential discussion.
Senior Network Engineer Hybrid - Palo Alto - F5 Load Balancers This is a senior-level engineering role focused on enterprise network design, implementation, security and modernisation . The successful candidate will take ownership of complex infrastructure initiatives, working closely with architecture teams and senior stakeholders to deliver projects from concept through to full implementation. The position requires strong technical leadership, hands-on engineering expertise, and the ability to manage multiple workstreams in a large, regulated enterprise environment. With multiple positions available, there is a focus on Palo Alto or F5 Load Balancers experience. Key Responsibilities Implement new network and network security technologies as defined by enterprise architecture. Build, configure and test network infrastructure solutions across on-prem and cloud environments. Contribute to the research and recommendation of innovative technologies to improve performance, resilience and scalability. Engineer solutions using enterprise blueprints and standards. Design and implement resilient architectures with disaster recovery and business continuity in mind. Work with technologies including Switches, Routers, Firewalls, wireless platforms , SDN fabrics, load balancers, NAC and cloud networking components. Provide Tier 3 engineering support for complex incidents and escalations. Participate in a 24x7 on-call rotation as required. Produce and maintain detailed network documentation using Microsoft Visio. Maintain and continuously improve network security posture in line with regulatory frameworks including PCI-DSS, PII, CIS and NIST. Required Experience & Skills 5-7+ years of experience designing, implementing and supporting medium to large enterprise networks (10,000+ users). Palo Alto Firewall platforms (Pan-OS, Threat Prevention, User-ID, GlobalProtect, HA, Prisma Access). F5 BIG-IP platforms (LTM, GTM, APM, ASM/Cloud WAF). Strong hands-on experience with Cisco enterprise technologies . CCNP Enterprise (R&S) level knowledge required. 1-2 years' experience designing and supporting data centre spine-leaf fabrics (Cisco/Arista). Experience with Cisco DNA Center . Experience with SD-WAN technologies (Cisco, Palo Alto ION). Enterprise-scale Cisco Wireless experience (WLC, FlexConnect, CAPWAP). Desirable Experience Remote access VPN technologies. Certificate life cycle management (Venafi, PKI). NAC solutions (Cisco ISE, Forescout). Infoblox DNS/IPAM. Cloud networking design and security principles. Automation and Scripting (Python, Ansible). Network monitoring tools such as SevOne, SolarWinds, Datadog or Splunk. Knowledge of network security architecture, IDS/IPS, VPNs and SSL technologies. Senior Network Engineer Due to the volume of applications received for positions, it will not be possible to respond to all applications and only applicants who are considered suitable for interview will be contacted. Proactive Appointments Limited operates as an employment agency and employment business and is an equal opportunities organisation We take our obligations to protect your personal data very seriously. Any information provided to us will be processed as detailed in our Privacy Notice, a copy of which can be found on our website
14/04/2026
Full time
Senior Network Engineer Hybrid - Palo Alto - F5 Load Balancers This is a senior-level engineering role focused on enterprise network design, implementation, security and modernisation . The successful candidate will take ownership of complex infrastructure initiatives, working closely with architecture teams and senior stakeholders to deliver projects from concept through to full implementation. The position requires strong technical leadership, hands-on engineering expertise, and the ability to manage multiple workstreams in a large, regulated enterprise environment. With multiple positions available, there is a focus on Palo Alto or F5 Load Balancers experience. Key Responsibilities Implement new network and network security technologies as defined by enterprise architecture. Build, configure and test network infrastructure solutions across on-prem and cloud environments. Contribute to the research and recommendation of innovative technologies to improve performance, resilience and scalability. Engineer solutions using enterprise blueprints and standards. Design and implement resilient architectures with disaster recovery and business continuity in mind. Work with technologies including Switches, Routers, Firewalls, wireless platforms , SDN fabrics, load balancers, NAC and cloud networking components. Provide Tier 3 engineering support for complex incidents and escalations. Participate in a 24x7 on-call rotation as required. Produce and maintain detailed network documentation using Microsoft Visio. Maintain and continuously improve network security posture in line with regulatory frameworks including PCI-DSS, PII, CIS and NIST. Required Experience & Skills 5-7+ years of experience designing, implementing and supporting medium to large enterprise networks (10,000+ users). Palo Alto Firewall platforms (Pan-OS, Threat Prevention, User-ID, GlobalProtect, HA, Prisma Access). F5 BIG-IP platforms (LTM, GTM, APM, ASM/Cloud WAF). Strong hands-on experience with Cisco enterprise technologies . CCNP Enterprise (R&S) level knowledge required. 1-2 years' experience designing and supporting data centre spine-leaf fabrics (Cisco/Arista). Experience with Cisco DNA Center . Experience with SD-WAN technologies (Cisco, Palo Alto ION). Enterprise-scale Cisco Wireless experience (WLC, FlexConnect, CAPWAP). Desirable Experience Remote access VPN technologies. Certificate life cycle management (Venafi, PKI). NAC solutions (Cisco ISE, Forescout). Infoblox DNS/IPAM. Cloud networking design and security principles. Automation and Scripting (Python, Ansible). Network monitoring tools such as SevOne, SolarWinds, Datadog or Splunk. Knowledge of network security architecture, IDS/IPS, VPNs and SSL technologies. Senior Network Engineer Due to the volume of applications received for positions, it will not be possible to respond to all applications and only applicants who are considered suitable for interview will be contacted. Proactive Appointments Limited operates as an employment agency and employment business and is an equal opportunities organisation We take our obligations to protect your personal data very seriously. Any information provided to us will be processed as detailed in our Privacy Notice, a copy of which can be found on our website
Site Reliability Engineer (SRE) - Active SC required! Up to £55,000 + benefits Hybrid (UK-based) We're looking for a Site Reliability Engineer to join a growing technology team delivering highly scalable, resilient systems across a range of enterprise environments. This is a fantastic opportunity for someone with a solid foundation in DevOps/SRE practices who wants to deepen their expertise in automation, reliability, and cloud-native technologies. What you'll be doing: Supporting the reliability, availability, and performance of production systems Monitoring applications and infrastructure, responding to incidents and driving resolution Automating manual processes to improve efficiency and reduce risk Collaborating with engineering teams to improve system design and resilience Contributing to CI/CD pipelines and infrastructure-as-code practices What we're looking for: Experience in an SRE, DevOps, or similar engineering role Knowledge of cloud platforms (AWS, Azure, or GCP) Familiarity with monitoring/logging tools (e.g. Prometheus, Grafana, ELK) Scripting or programming skills (e.g. Python, Bash, Go) Understanding of containers and orchestration (Docker/Kubernetes is a plus) Why apply? Work with modern, cloud-native technologies Supportive environment with strong learning and development opportunities Clear progression path into senior SRE roles
01/04/2026
Full time
Site Reliability Engineer (SRE) - Active SC required! Up to £55,000 + benefits Hybrid (UK-based) We're looking for a Site Reliability Engineer to join a growing technology team delivering highly scalable, resilient systems across a range of enterprise environments. This is a fantastic opportunity for someone with a solid foundation in DevOps/SRE practices who wants to deepen their expertise in automation, reliability, and cloud-native technologies. What you'll be doing: Supporting the reliability, availability, and performance of production systems Monitoring applications and infrastructure, responding to incidents and driving resolution Automating manual processes to improve efficiency and reduce risk Collaborating with engineering teams to improve system design and resilience Contributing to CI/CD pipelines and infrastructure-as-code practices What we're looking for: Experience in an SRE, DevOps, or similar engineering role Knowledge of cloud platforms (AWS, Azure, or GCP) Familiarity with monitoring/logging tools (e.g. Prometheus, Grafana, ELK) Scripting or programming skills (e.g. Python, Bash, Go) Understanding of containers and orchestration (Docker/Kubernetes is a plus) Why apply? Work with modern, cloud-native technologies Supportive environment with strong learning and development opportunities Clear progression path into senior SRE roles
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
01/04/2026
Full time
The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation's growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments. This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale. In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions. Key Responsibilities Reliability, Performance & Security: Design and implement strategies to improve system reliability, availability, and security. Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment. Conduct regular security reviews and collaborate with security teams to address vulnerabilities. CI/CD Management: Own and optimise Continuous Integration and Continuous Deployment pipelines. Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows. Ensure secure, efficient, and automated deployment processes across environments. Monitoring & Observability: Implement and maintain monitoring solutions for infrastructure and applications. Develop dashboards and alerting systems to ensure proactive incident and security event management. Evaluate and integrate new observability tools as needed. Automation & Tooling: Automate repetitive tasks to improve efficiency and reduce human error. Build and maintain internal tools that support engineering productivity and security compliance. Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates. Cloud Infrastructure Management: Manage and optimise services across AWS and Azure environments. Ensure scalability, resilience, and security of service-based architectures. Implement cost management strategies to optimise cloud spend without compromising performance or security. Incident Response & Root Cause Analysis: Lead incident response efforts, including security incidents, and conduct post-mortem reviews. Drive continuous improvement through lessons learned and preventive measures. Skills & Experience Proven experience in AWS and Azure cloud environments. Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins). Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity with containerisation and orchestration (Docker, Kubernetes). Solid understanding of networking, security, and cost optimisation in cloud environments. Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks. A problem-solver with a proactive mindset. Comfortable working in fast-paced, evolving environments. Strong communicator who can bridge gaps between operations, development, and security teams. Passionate about automation, scalability, cost efficiency, and security.
SF Technology are supporting a high-growth, digitally led UK business looking to appoint an AI Evangelist to lead the next phase of its technology and AI capability. This is a leadership role where you will define and deliver the organisation's AI and engineering strategy, while remaining close enough to the technology to guide architecture, development standards and infrastructure decisions. You will lead a capable in-house engineering team responsible for the platforms, data pipelines and infrastructure that power a high-volume digital environment. Alongside this, you will drive the adoption of AI, machine learning and advanced data capabilities across core commercial and operational functions. The role sits within the leadership team and will play a key part in shaping the company's wider digital, product and innovation strategy. Key responsibilities - Define and deliver a scalable AI and technology roadmap aligned to business growth - Lead, mentor and grow a high-performing engineering and data team - Oversee platform architecture, infrastructure, security and operational resilience - Drive the development and deployment of AI and machine learning models into live business workflows - Ensure engineering best practice across software development, testing and deployment - Translate commercial objectives into robust, scalable technical solutions - Work closely with senior leadership to influence digital strategy, investment and innovation Experience required - Proven experience operating in an Engineering, AI related role - you will have managed and led small teams - Strong software engineering background, ideally with hands-on experience in Python-based development environments - Experience building and scaling data platforms, machine learning pipelines or AI-driven applications - Demonstrable experience deploying AI models into production environments rather than purely experimental use cases - Experience leading engineering teams responsible for high availability platforms, APIs and complex backend systems - Strong understanding of cloud or hybrid infrastructure, containerisation and scalable architecture patterns - Experience embedding data, analytics and AI into real commercial decision making (pricing, automation, optimisation, fraud detection, personalisation etc.) - Ability to operate at both strategic and technical depth, influencing senior stakeholders while maintaining engineering credibility - Experience leading engineering teams, and be able to effectively challenge Developers This is an opportunity to shape and scale an AI-driven engineering capability within a highly successful digital business, with real influence over technology direction and innovation. Birmingham (office based, 4 days a week onsite)
31/03/2026
Full time
SF Technology are supporting a high-growth, digitally led UK business looking to appoint an AI Evangelist to lead the next phase of its technology and AI capability. This is a leadership role where you will define and deliver the organisation's AI and engineering strategy, while remaining close enough to the technology to guide architecture, development standards and infrastructure decisions. You will lead a capable in-house engineering team responsible for the platforms, data pipelines and infrastructure that power a high-volume digital environment. Alongside this, you will drive the adoption of AI, machine learning and advanced data capabilities across core commercial and operational functions. The role sits within the leadership team and will play a key part in shaping the company's wider digital, product and innovation strategy. Key responsibilities - Define and deliver a scalable AI and technology roadmap aligned to business growth - Lead, mentor and grow a high-performing engineering and data team - Oversee platform architecture, infrastructure, security and operational resilience - Drive the development and deployment of AI and machine learning models into live business workflows - Ensure engineering best practice across software development, testing and deployment - Translate commercial objectives into robust, scalable technical solutions - Work closely with senior leadership to influence digital strategy, investment and innovation Experience required - Proven experience operating in an Engineering, AI related role - you will have managed and led small teams - Strong software engineering background, ideally with hands-on experience in Python-based development environments - Experience building and scaling data platforms, machine learning pipelines or AI-driven applications - Demonstrable experience deploying AI models into production environments rather than purely experimental use cases - Experience leading engineering teams responsible for high availability platforms, APIs and complex backend systems - Strong understanding of cloud or hybrid infrastructure, containerisation and scalable architecture patterns - Experience embedding data, analytics and AI into real commercial decision making (pricing, automation, optimisation, fraud detection, personalisation etc.) - Ability to operate at both strategic and technical depth, influencing senior stakeholders while maintaining engineering credibility - Experience leading engineering teams, and be able to effectively challenge Developers This is an opportunity to shape and scale an AI-driven engineering capability within a highly successful digital business, with real influence over technology direction and innovation. Birmingham (office based, 4 days a week onsite)
Type: Full-time, Permanent The OpportunityWe're recruiting on behalf of a leading organisation undergoing a major digital transformation. This is a hands-on, senior engineering role for someone who thrives on solving complex data challenges, building scalable platforms, and integrating operational systems across a diverse business landscape. You'll work closely with stakeholders in Logistics, Operations, Finance, and Compliance to modernise data infrastructure, automate workflows, and embed AI into BI and operational processes. If you're ready to take ownership of high-impact projects and shape the future of data in logistics, this is the role for you. What You'll Be DoingData Platform & BI Engineering Architect and implement cloud-native data platforms (AWS S3, Glue, Athena, Redshift, QuickSight). Build reliable, governed data pipelines with CI/CD and infrastructure as code. Design dimensional models and deliver robust SQL/Python transformations. Systems Integration & Application Support Provide expert-level support for transport, warehouse, and fleet systems (TMS/WMS/FMS). Develop and maintain integrations using REST/SOAP APIs, EDI (XML/JSON), and flat-file interfaces. Implement observability, error-handling, and retry logic for mission-critical interfaces. Automation & Process Improvement Replace manual, spreadsheet-driven processes with governed datasets and internal tools. Build lightweight portals, scripts, and APIs to streamline business workflows. AI & Advanced Analytics Integrate AI services into BI dashboards and operational workflows (eg, anomaly detection, natural language Q&A). Implement semantic search and intelligent alerting using AWS Bedrock or Azure equivalents. Security, Governance & Resilience Enforce least-privilege access, RBAC, and secrets management. Apply data governance across AWS/Microsoft estates and contribute to DR strategies. What You'll BringEssential Experience 5+ years in SQL (T-SQL), Python, and BI/data platform engineering. Strong hands-on experience with AWS analytics stack and Power BI. Proven track record in designing and deploying production-grade ETL/ELT pipelines. Experience supporting and integrating operational systems (TMS/WMS/FMS). Solid understanding of data modelling, performance tuning, and infrastructure as code. Desirable Skills & Certifications AWS or Microsoft certifications (eg, Data Analytics Speciality, DP-203, PL-300). Experience with Azure Data Factory, Kafka/Kinesis, or message brokers. Familiarity with LLMs (eg, Claude, Azure OpenAI) and vector databases. Why You Should Apply Be part of a company driving innovation and sustainability in logistics. Lead and deliver high-impact digital transformation initiatives. Work in a collaborative, forward-thinking environment. Competitive salary and benefits, with professional development opportunities. If you would like more information or some career advice, please do not hesitate to reach out directly. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found on our website.
06/10/2025
Full time
Type: Full-time, Permanent The OpportunityWe're recruiting on behalf of a leading organisation undergoing a major digital transformation. This is a hands-on, senior engineering role for someone who thrives on solving complex data challenges, building scalable platforms, and integrating operational systems across a diverse business landscape. You'll work closely with stakeholders in Logistics, Operations, Finance, and Compliance to modernise data infrastructure, automate workflows, and embed AI into BI and operational processes. If you're ready to take ownership of high-impact projects and shape the future of data in logistics, this is the role for you. What You'll Be DoingData Platform & BI Engineering Architect and implement cloud-native data platforms (AWS S3, Glue, Athena, Redshift, QuickSight). Build reliable, governed data pipelines with CI/CD and infrastructure as code. Design dimensional models and deliver robust SQL/Python transformations. Systems Integration & Application Support Provide expert-level support for transport, warehouse, and fleet systems (TMS/WMS/FMS). Develop and maintain integrations using REST/SOAP APIs, EDI (XML/JSON), and flat-file interfaces. Implement observability, error-handling, and retry logic for mission-critical interfaces. Automation & Process Improvement Replace manual, spreadsheet-driven processes with governed datasets and internal tools. Build lightweight portals, scripts, and APIs to streamline business workflows. AI & Advanced Analytics Integrate AI services into BI dashboards and operational workflows (eg, anomaly detection, natural language Q&A). Implement semantic search and intelligent alerting using AWS Bedrock or Azure equivalents. Security, Governance & Resilience Enforce least-privilege access, RBAC, and secrets management. Apply data governance across AWS/Microsoft estates and contribute to DR strategies. What You'll BringEssential Experience 5+ years in SQL (T-SQL), Python, and BI/data platform engineering. Strong hands-on experience with AWS analytics stack and Power BI. Proven track record in designing and deploying production-grade ETL/ELT pipelines. Experience supporting and integrating operational systems (TMS/WMS/FMS). Solid understanding of data modelling, performance tuning, and infrastructure as code. Desirable Skills & Certifications AWS or Microsoft certifications (eg, Data Analytics Speciality, DP-203, PL-300). Experience with Azure Data Factory, Kafka/Kinesis, or message brokers. Familiarity with LLMs (eg, Claude, Azure OpenAI) and vector databases. Why You Should Apply Be part of a company driving innovation and sustainability in logistics. Lead and deliver high-impact digital transformation initiatives. Work in a collaborative, forward-thinking environment. Competitive salary and benefits, with professional development opportunities. If you would like more information or some career advice, please do not hesitate to reach out directly. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found on our website.
IT Security & Resilience Specialist - Permanent, London (Hybrid) Our client, a global law firm , is seeking an experienced IT Security & Resilience Specialist to join their IT Infrastructure Engineering team. This is a hands-on, process-driven role focused on ensuring disaster recovery, failover, and operational resilience capabilities are robust, tested, and continuously improved. Key Responsibilities: Plan and execute DR and resilience tests, capture evidence, and report findings. Maintain and improve DR runbooks, procedures, and documentation. Collaborate with Cyber Security, BCP, and Infrastructure teams to mitigate risks and vulnerabilities. Automate testing and evidence collection using Scripting or orchestration tools. Produce clear technical updates and dashboards for stakeholders. Candidate Profile: Hands-on experience in disaster recovery, failover testing, and operational resilience. Solid understanding of ISO27001, ISO22301, NIST frameworks, and control evidence. Experience with hyperconverged and hybrid cloud infrastructure (Nutanix, VMware, Commvault, Azure). Skilled in Scripting (PowerShell or Python) and infrastructure tooling. Knowledge of vulnerability management, monitoring, and automation platforms. Relevant certifications such as SC-200, CEH, CBCP/CBCI desirable. Why Join: Join a global, high-performing technology team. Take ownership of resilience and security initiatives impacting key IT systems. Hybrid working (3 days per week on site) Ready to take the next step? Apply now and be part of a team driving operational excellence
06/10/2025
Full time
IT Security & Resilience Specialist - Permanent, London (Hybrid) Our client, a global law firm , is seeking an experienced IT Security & Resilience Specialist to join their IT Infrastructure Engineering team. This is a hands-on, process-driven role focused on ensuring disaster recovery, failover, and operational resilience capabilities are robust, tested, and continuously improved. Key Responsibilities: Plan and execute DR and resilience tests, capture evidence, and report findings. Maintain and improve DR runbooks, procedures, and documentation. Collaborate with Cyber Security, BCP, and Infrastructure teams to mitigate risks and vulnerabilities. Automate testing and evidence collection using Scripting or orchestration tools. Produce clear technical updates and dashboards for stakeholders. Candidate Profile: Hands-on experience in disaster recovery, failover testing, and operational resilience. Solid understanding of ISO27001, ISO22301, NIST frameworks, and control evidence. Experience with hyperconverged and hybrid cloud infrastructure (Nutanix, VMware, Commvault, Azure). Skilled in Scripting (PowerShell or Python) and infrastructure tooling. Knowledge of vulnerability management, monitoring, and automation platforms. Relevant certifications such as SC-200, CEH, CBCP/CBCI desirable. Why Join: Join a global, high-performing technology team. Take ownership of resilience and security initiatives impacting key IT systems. Hybrid working (3 days per week on site) Ready to take the next step? Apply now and be part of a team driving operational excellence
People Source Consulting Ltd
Manchester, Lancashire
DevOps Engineer - Defence & National Security Location: Manchester city centre (Hybrid) + North West client sites Salary: £60,000 - £90,000 per annum Security: SC required to start, must be willing to obtain DV Are you a DevOps engineer looking for work that makes a real difference? We're expanding a specialist team in Manchester and are looking for DevOps professionals from all technical backgrounds who want to apply their skills to impactful projects in Defence and National Security. You'll play a key role in building and running secure, scalable platforms that support mission-critical services. We welcome engineers with different tech stack experience - what matters most is your passion for automation, reliability, and problem solving in a collaborative environment. What you'll do Design and implement CI/CD pipelines and automated deployments. Build and manage cloud-native and containerised environments. Apply Infrastructure-as-Code, monitoring and Site Reliability Engineering principles to ensure resilience and performance. Collaborate with developers, testers, and client stakeholders to deliver end-to-end solutions. Share knowledge, contribute to a learning culture, and help shape the direction of a growing practice. What we're looking for Hands-on experience in DevOps engineering, regardless of stack (eg AWS, Azure, GCP, Kubernetes, Docker, Jenkins, GitLab CI/CD). Strong understanding of automation and modern software delivery practices. Experience working in Agile teams. Curiosity, adaptability and eligibility for UK National Security vetting at DV level. Why this role? Work only on high-impact, mission-critical Defence projects. Join at the ground floor of a growing team, with real scope for influence and progression. Hybrid flexibility in a modern city-centre location. If you want to grow your DevOps career while contributing to work of real national importance, we'd love to hear from you People Source Consulting Ltd is acting as an Employment Agency in relation to this vacancy. People Source specialise in technology recruitment across niche markets including Information Technology, Digital TV, Digital Marketing, Project and Programme Management, SAP, Digital and Consumer Electronics, Air Traffic Management, Management Consultancy, Business Intelligence, Manufacturing, Telecoms, Public Sector, Healthcare, Finance and Oil & Gas.
02/10/2025
Full time
DevOps Engineer - Defence & National Security Location: Manchester city centre (Hybrid) + North West client sites Salary: £60,000 - £90,000 per annum Security: SC required to start, must be willing to obtain DV Are you a DevOps engineer looking for work that makes a real difference? We're expanding a specialist team in Manchester and are looking for DevOps professionals from all technical backgrounds who want to apply their skills to impactful projects in Defence and National Security. You'll play a key role in building and running secure, scalable platforms that support mission-critical services. We welcome engineers with different tech stack experience - what matters most is your passion for automation, reliability, and problem solving in a collaborative environment. What you'll do Design and implement CI/CD pipelines and automated deployments. Build and manage cloud-native and containerised environments. Apply Infrastructure-as-Code, monitoring and Site Reliability Engineering principles to ensure resilience and performance. Collaborate with developers, testers, and client stakeholders to deliver end-to-end solutions. Share knowledge, contribute to a learning culture, and help shape the direction of a growing practice. What we're looking for Hands-on experience in DevOps engineering, regardless of stack (eg AWS, Azure, GCP, Kubernetes, Docker, Jenkins, GitLab CI/CD). Strong understanding of automation and modern software delivery practices. Experience working in Agile teams. Curiosity, adaptability and eligibility for UK National Security vetting at DV level. Why this role? Work only on high-impact, mission-critical Defence projects. Join at the ground floor of a growing team, with real scope for influence and progression. Hybrid flexibility in a modern city-centre location. If you want to grow your DevOps career while contributing to work of real national importance, we'd love to hear from you People Source Consulting Ltd is acting as an Employment Agency in relation to this vacancy. People Source specialise in technology recruitment across niche markets including Information Technology, Digital TV, Digital Marketing, Project and Programme Management, SAP, Digital and Consumer Electronics, Air Traffic Management, Management Consultancy, Business Intelligence, Manufacturing, Telecoms, Public Sector, Healthcare, Finance and Oil & Gas.
Head Resourcing is delighted to be partnering with one of the UK's leading retail banks to recruit an experienced Senior DevOps Engineer for their Leeds-based team. The Opportunity This is a fantastic chance to join a forward-thinking digital engineering team where you'll focus on improving and maintaining the tools and processes that support continuous integration, automated testing, and software delivery pipelines . The role is highly collaborative and hands-on, with a strong emphasis on automation, resilience, and enabling faster, more reliable software releases. As well as technical delivery, you'll play an important part in championing DevOps culture -helping teams adopt agile practices, embedding continuous improvement, and sharing expertise across the organisation. Key Responsibilities Design, develop, and enhance CI/CD pipelines and release frameworks. Provide operational support and optimisation for applications running in cloud environments (public and private). Build and manage solutions with containerisation and orchestration (Docker, Kubernetes, service mesh). Drive automation and scalability through infrastructure as code (Terraform, CloudFormation). Contribute across the full software lifecycle-from planning through to production. Mentor colleagues and act as a point of leadership in modern engineering practices. Work closely with cross-functional teams to solve complex technical challenges. What We're Looking For Strong background in cloud platforms such as GCP, Azure, AWS, or OCP. Hands-on experience with DevOps toolchains (Jenkins, Nexus, SonarQube, Git, Maven). Solid programming ability-ideally in Java or JavaScript , though Python, Golang, or Rust are also valuable. Experience with containers and orchestration frameworks . Demonstrated ability to mentor or lead within technical teams . A collaborative approach, with an interest in driving cultural and process improvements. Salary : Depending on experience Pension Discretionary Bonus This role is Hybrid 2 days per week onsite in Leeds. If this sounds like you we would love to hear from you.
01/10/2025
Full time
Head Resourcing is delighted to be partnering with one of the UK's leading retail banks to recruit an experienced Senior DevOps Engineer for their Leeds-based team. The Opportunity This is a fantastic chance to join a forward-thinking digital engineering team where you'll focus on improving and maintaining the tools and processes that support continuous integration, automated testing, and software delivery pipelines . The role is highly collaborative and hands-on, with a strong emphasis on automation, resilience, and enabling faster, more reliable software releases. As well as technical delivery, you'll play an important part in championing DevOps culture -helping teams adopt agile practices, embedding continuous improvement, and sharing expertise across the organisation. Key Responsibilities Design, develop, and enhance CI/CD pipelines and release frameworks. Provide operational support and optimisation for applications running in cloud environments (public and private). Build and manage solutions with containerisation and orchestration (Docker, Kubernetes, service mesh). Drive automation and scalability through infrastructure as code (Terraform, CloudFormation). Contribute across the full software lifecycle-from planning through to production. Mentor colleagues and act as a point of leadership in modern engineering practices. Work closely with cross-functional teams to solve complex technical challenges. What We're Looking For Strong background in cloud platforms such as GCP, Azure, AWS, or OCP. Hands-on experience with DevOps toolchains (Jenkins, Nexus, SonarQube, Git, Maven). Solid programming ability-ideally in Java or JavaScript , though Python, Golang, or Rust are also valuable. Experience with containers and orchestration frameworks . Demonstrated ability to mentor or lead within technical teams . A collaborative approach, with an interest in driving cultural and process improvements. Salary : Depending on experience Pension Discretionary Bonus This role is Hybrid 2 days per week onsite in Leeds. If this sounds like you we would love to hear from you.
LA International Computer Consultants Ltd
Sheffield, Yorkshire
GCP Cloud Engineer 2 Month contract initially Based: Hybrid/Sheffield or Birmingham or Edinburgh (Max 3 days p/w onsite) Rate: £Market rates p/d (via Umbrella company) We have a great opportunity with a world leading organisation where you will be provided with all of the support and development to succeed. A progressive organisation where you can really make a difference. We a great opportunity for a GCP Cloud Engineer to join the team. We have an excellent opportunity for a seasoned GCP Cloud Engineer to aid in the development of services on Public Cloud Platforms. Utilise your Cloud Engineering expertise and DevOps skills across GCP to deploy and configure robust Back End services, automate infrastructure, and employ CSP native services. This role offers the chance to work on impactful systems within a secure, high-availability setting at a leading global financial institution. Key Responsibilities: * Deploying, configuring and securing Back End REST API services using CSP native services. * Deploying, configuring and securing containerised application runtimes using Infrastructure as Code. * Building and maintaining CI/CD pipelines in collaboration with DevOps and Security teams, focusing on traceability and regulatory controls. * Managing, monitoring, and optimising cloud infrastructure across GCP, ensuring performance, resilience, cost-efficiency, and data security. * Collaborating closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. * Supporting live systems, conducting root cause analysis, fixing bugs and implementing solutions for incidents and performance bottlenecks. Key Skills & Experience: * A background in Cloud Engineering with infrastructure experience. * Over 5 years of development experience, focusing on large-scale, distributed systems. * Hands-on experience with GCP, including CSP native services, networking, IAM, databases (PostgreSQL) and cost optimization. Experience with other cloud providers such as AWS is advantageous. * Proven experience with DevOps practices, including Infrastructure as Code (eg, Terraform), CI/CD tools (eg, Jenkins, GitLab CI), and containerization. * A strong understanding of security principles in cloud and enterprise systems. * Familiarity with audit and compliance considerations in regulated industries, particularly finance or banking. * Excellent written and verbal communication skills, with the ability to convey complex information effectively to diverse audiences. * A successful track record of delivering complex projects and/or programmes, using appropriate techniques and tools to ensure and measure success. Essential Skills * Demonstrable experience of: o Public Cloud. o Infrastructure build and configurations for services including Compute, Storage, Networking. o Linux. o Relational and NoSQL databases. o Integration services such as messaging and streams. o Building RESTful API Services. o Containerisation, Kubernetes, serverless functions. o Microservices and distributed tracing. o Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). o Automation Scripting (using languages such as Terraform, Ansible etc.). * Experience with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. * Experience working within an Agile environment. * A good understanding of cryptography (authentication, data encryption). * The ability to quickly acquire new skills and tools. * Good non-functional testing experience. This is an excellent opportunity on a great project of work, If you are looking for your next exciting opportunity, apply now for your CV to reach me directly, we will respond as soon as possible. LA International is a HMG approved ICT Recruitment and Project Solutions Consultancy, operating globally from the largest single site in the UK as an IT Consultancy or as an Employment Business & Agency depending upon the precise nature of the work, for security cleared jobs or non-clearance vacancies, LA International welcome applications from all sections of the community and from people with diverse experience and backgrounds. Award Winning LA International, winner of the Recruiter Awards for Excellence, Best IT Recruitment Company, Best Public Sector Recruitment Company and overall Gold Award winner, has now secured the most prestigious business award that any business can receive, The Queens Award for Enterprise: International Trade, for the second consecutive period.
01/10/2025
Contractor
GCP Cloud Engineer 2 Month contract initially Based: Hybrid/Sheffield or Birmingham or Edinburgh (Max 3 days p/w onsite) Rate: £Market rates p/d (via Umbrella company) We have a great opportunity with a world leading organisation where you will be provided with all of the support and development to succeed. A progressive organisation where you can really make a difference. We a great opportunity for a GCP Cloud Engineer to join the team. We have an excellent opportunity for a seasoned GCP Cloud Engineer to aid in the development of services on Public Cloud Platforms. Utilise your Cloud Engineering expertise and DevOps skills across GCP to deploy and configure robust Back End services, automate infrastructure, and employ CSP native services. This role offers the chance to work on impactful systems within a secure, high-availability setting at a leading global financial institution. Key Responsibilities: * Deploying, configuring and securing Back End REST API services using CSP native services. * Deploying, configuring and securing containerised application runtimes using Infrastructure as Code. * Building and maintaining CI/CD pipelines in collaboration with DevOps and Security teams, focusing on traceability and regulatory controls. * Managing, monitoring, and optimising cloud infrastructure across GCP, ensuring performance, resilience, cost-efficiency, and data security. * Collaborating closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. * Supporting live systems, conducting root cause analysis, fixing bugs and implementing solutions for incidents and performance bottlenecks. Key Skills & Experience: * A background in Cloud Engineering with infrastructure experience. * Over 5 years of development experience, focusing on large-scale, distributed systems. * Hands-on experience with GCP, including CSP native services, networking, IAM, databases (PostgreSQL) and cost optimization. Experience with other cloud providers such as AWS is advantageous. * Proven experience with DevOps practices, including Infrastructure as Code (eg, Terraform), CI/CD tools (eg, Jenkins, GitLab CI), and containerization. * A strong understanding of security principles in cloud and enterprise systems. * Familiarity with audit and compliance considerations in regulated industries, particularly finance or banking. * Excellent written and verbal communication skills, with the ability to convey complex information effectively to diverse audiences. * A successful track record of delivering complex projects and/or programmes, using appropriate techniques and tools to ensure and measure success. Essential Skills * Demonstrable experience of: o Public Cloud. o Infrastructure build and configurations for services including Compute, Storage, Networking. o Linux. o Relational and NoSQL databases. o Integration services such as messaging and streams. o Building RESTful API Services. o Containerisation, Kubernetes, serverless functions. o Microservices and distributed tracing. o Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). o Automation Scripting (using languages such as Terraform, Ansible etc.). * Experience with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. * Experience working within an Agile environment. * A good understanding of cryptography (authentication, data encryption). * The ability to quickly acquire new skills and tools. * Good non-functional testing experience. This is an excellent opportunity on a great project of work, If you are looking for your next exciting opportunity, apply now for your CV to reach me directly, we will respond as soon as possible. LA International is a HMG approved ICT Recruitment and Project Solutions Consultancy, operating globally from the largest single site in the UK as an IT Consultancy or as an Employment Business & Agency depending upon the precise nature of the work, for security cleared jobs or non-clearance vacancies, LA International welcome applications from all sections of the community and from people with diverse experience and backgrounds. Award Winning LA International, winner of the Recruiter Awards for Excellence, Best IT Recruitment Company, Best Public Sector Recruitment Company and overall Gold Award winner, has now secured the most prestigious business award that any business can receive, The Queens Award for Enterprise: International Trade, for the second consecutive period.
SRE & DevOps Engineer - Inside IR36 - 6 months - Leeds - hybrid Working Are you a Site reliability engineer with extensive experience within DevOps? Would you like to be part of one of the UK's Largest transformation projects? Can you turn your hand to working on ensuring resilience, observability, and release automation? Have you worked on building tooling and Complex automation across multiple teams? Illuminet are currently working with a retail client whose role will entail operational work such as handling escalations, being on-call to respond to production issues, and fixing problems. Secondly your focus will be on automation. The expectation is that this will be done hands on writing code. As an SRE engineer you will be focused on building tooling and automation across the various parts of the E-commerce platform to ensure it maintains its service level objectives. The Role Debug production issues across services and technology stack. Consult on new cloud patterns; improving system resilience, performance and stability. Support Prod deployments, pipeline engineering and maintenance & build failures (squads responsible for releasing their own code/packages) Platform Ops - L2 quick fixes (restart env/jobs, check monitoring dashboard, reset access etc) Monitoring & Observability - Configure & extend monitoring, egress/ingress data lake (consolidation of Azure Integration Services, 3rd party events), Business Monitoring Automation & Self-healing (incl. event based triggers) Ensuring consistency of technology usage across a programme, by continuously reviewing existing toolsets and code and suggesting re-use of components. Ensuring system SLOs/SLis and performance are monitored and alerted on. About You Software engineering background Hands-on experience designing, building, delivering and operating production-grade software at scale Experience with troubleshooting distributed systems Strong opinions informed by experience of continuous delivery, distributed architectures, testing, everything-as-code, containerisation, orchestration, cloud services and incident response Comfortable having in-depth discussions, troubleshooting and debugging systems and reading/writing code Experience working within an Agile environment Experience with enterprise APM monitoring tools Working knowledge of system architectures and networking Salesforce experience Azure cloud experience Experience with CI-CD tooling eg code quality, security, accessibility, testing framework integration Worked as DevOps/SRE engineer The role is flexible with working locations, but the successful applicant must be prepared to go to the client's head office 2-3 times a month.
22/09/2022
Contractor
SRE & DevOps Engineer - Inside IR36 - 6 months - Leeds - hybrid Working Are you a Site reliability engineer with extensive experience within DevOps? Would you like to be part of one of the UK's Largest transformation projects? Can you turn your hand to working on ensuring resilience, observability, and release automation? Have you worked on building tooling and Complex automation across multiple teams? Illuminet are currently working with a retail client whose role will entail operational work such as handling escalations, being on-call to respond to production issues, and fixing problems. Secondly your focus will be on automation. The expectation is that this will be done hands on writing code. As an SRE engineer you will be focused on building tooling and automation across the various parts of the E-commerce platform to ensure it maintains its service level objectives. The Role Debug production issues across services and technology stack. Consult on new cloud patterns; improving system resilience, performance and stability. Support Prod deployments, pipeline engineering and maintenance & build failures (squads responsible for releasing their own code/packages) Platform Ops - L2 quick fixes (restart env/jobs, check monitoring dashboard, reset access etc) Monitoring & Observability - Configure & extend monitoring, egress/ingress data lake (consolidation of Azure Integration Services, 3rd party events), Business Monitoring Automation & Self-healing (incl. event based triggers) Ensuring consistency of technology usage across a programme, by continuously reviewing existing toolsets and code and suggesting re-use of components. Ensuring system SLOs/SLis and performance are monitored and alerted on. About You Software engineering background Hands-on experience designing, building, delivering and operating production-grade software at scale Experience with troubleshooting distributed systems Strong opinions informed by experience of continuous delivery, distributed architectures, testing, everything-as-code, containerisation, orchestration, cloud services and incident response Comfortable having in-depth discussions, troubleshooting and debugging systems and reading/writing code Experience working within an Agile environment Experience with enterprise APM monitoring tools Working knowledge of system architectures and networking Salesforce experience Azure cloud experience Experience with CI-CD tooling eg code quality, security, accessibility, testing framework integration Worked as DevOps/SRE engineer The role is flexible with working locations, but the successful applicant must be prepared to go to the client's head office 2-3 times a month.
An exciting opportunity for a DevOps Engineer has become available to be a part of a Global Publishing Group's technical team, who deliver to over two hundred countries. As a DevOps/Site Reliability Engineer, you will help build and support scalable, secure, resilient infrastructure in AWS and other cloud providers. You will be an invaluable resource for their tech teams, writers and editors and the readership of their digital publications. About You: - Built or supported resilient cloud applications - Helped manage CI/CD pipelines to automate application testing and deployment - Familiar with some or all of the following automation, configuration management, resilience, elasticity/horizontal scaling, observability, cost optimization, and security concepts and principles - An approach that constantly reviews industry best practices and security practices and apply this accordingly What you will do: - Serve as a member of the Mail Technology Devops team while managing overall cloud and system health, resilience, performance, security, capacity, and cost - Manage and improve our AWS environments and infrastructure - Troubleshoot application and system issues, while communicating status to the rest of the team - Help build on our automation pipeline to increase our reliability and development velocity. - We have a pipeline and want to extend it to further automate testing and deployment - Improve observability coverage and tooling to increase the efficiency of the Devops team - Focus will be initially around mitigating and fixing security issues Required Qualifications: Key Skills: - 3+ years of technical experience with Platform, Site Reliability Engineering, or DevOps - Solid experience managing, monitoring, and supporting AWS environments - Experience with software release management (JIRA, GitHub & source code branching strategies) and agile/scrum methodologies - Capacity planning and auto scaling infrastructure - 3+ years of experience with Amazon Web Services - Docker containers and orchestrators (ECS, Kubernetes) - Build and configuration management tools (Ansible, Chef, Puppet) - Experience programming in a language like Python, JavaScript/Bash - Deploying and supporting Linux and Windows Servers - Designing and operating platforms to support off-the-shelf software - Experience troubleshooting performance issues - Flexibility This needs combining with a positive attitude and an ability to work within a large, globally dispersed project team in a multi-cultural environment. You also need to be a self-starter, a logical thinker and a quick learner, with strong initiative and excellent communication, interpersonal and presentation skills, able to write clearly and concisely. We believe in equality of opportunity for all job applicants regardless of gender, marital status, race, colour, nationality, ethnic origin, creed or religion, disability, sexual orientation or age. Specialising within Energy Trading, Oil & Gas, Financial Markets and TV & Entertainment, Eaglecliff Recruitment is ISO accredited, a Member of REC and listed within the top 4% for Financial stability by Dun & Bradstreet. Please telephone for an immediate response or email your CV for a reply within one hour. Eaglecliff Ltd is acting in the capacity of an employment agency for permanent recruitment and an employment business for contractor resourcing
05/11/2021
Full time
An exciting opportunity for a DevOps Engineer has become available to be a part of a Global Publishing Group's technical team, who deliver to over two hundred countries. As a DevOps/Site Reliability Engineer, you will help build and support scalable, secure, resilient infrastructure in AWS and other cloud providers. You will be an invaluable resource for their tech teams, writers and editors and the readership of their digital publications. About You: - Built or supported resilient cloud applications - Helped manage CI/CD pipelines to automate application testing and deployment - Familiar with some or all of the following automation, configuration management, resilience, elasticity/horizontal scaling, observability, cost optimization, and security concepts and principles - An approach that constantly reviews industry best practices and security practices and apply this accordingly What you will do: - Serve as a member of the Mail Technology Devops team while managing overall cloud and system health, resilience, performance, security, capacity, and cost - Manage and improve our AWS environments and infrastructure - Troubleshoot application and system issues, while communicating status to the rest of the team - Help build on our automation pipeline to increase our reliability and development velocity. - We have a pipeline and want to extend it to further automate testing and deployment - Improve observability coverage and tooling to increase the efficiency of the Devops team - Focus will be initially around mitigating and fixing security issues Required Qualifications: Key Skills: - 3+ years of technical experience with Platform, Site Reliability Engineering, or DevOps - Solid experience managing, monitoring, and supporting AWS environments - Experience with software release management (JIRA, GitHub & source code branching strategies) and agile/scrum methodologies - Capacity planning and auto scaling infrastructure - 3+ years of experience with Amazon Web Services - Docker containers and orchestrators (ECS, Kubernetes) - Build and configuration management tools (Ansible, Chef, Puppet) - Experience programming in a language like Python, JavaScript/Bash - Deploying and supporting Linux and Windows Servers - Designing and operating platforms to support off-the-shelf software - Experience troubleshooting performance issues - Flexibility This needs combining with a positive attitude and an ability to work within a large, globally dispersed project team in a multi-cultural environment. You also need to be a self-starter, a logical thinker and a quick learner, with strong initiative and excellent communication, interpersonal and presentation skills, able to write clearly and concisely. We believe in equality of opportunity for all job applicants regardless of gender, marital status, race, colour, nationality, ethnic origin, creed or religion, disability, sexual orientation or age. Specialising within Energy Trading, Oil & Gas, Financial Markets and TV & Entertainment, Eaglecliff Recruitment is ISO accredited, a Member of REC and listed within the top 4% for Financial stability by Dun & Bradstreet. Please telephone for an immediate response or email your CV for a reply within one hour. Eaglecliff Ltd is acting in the capacity of an employment agency for permanent recruitment and an employment business for contractor resourcing
Senior Application Support / SRE Hybrid Working: Mix of Home Working / London EMEA HQ Permanent, Full Time As a trusted and preferred recruitment partner to this leading global provider of cloud-based solutions to the global financial sector, we have been asked to assist in the hire of a Senior Application Support Engineer to take responsibility for the availability and reliability of services used by over 23,000 customers across 90 countries (including 22 of the world's top 25 banks). In this role you will ensure all services exceed availability targets, have in-depth monitoring and are proactively managed. Already benefitting from a dominance in the North American finance industry, our client is expanding its London operations to better serve the UK and EU markets. This is an exciting time to join, and you will have the opportunity to work a mix of remotely and within their state-of-the-art EMEA HQ in London. Your Job *Service Reliability: Proactively identifying risks to service and remediate them. Reduce risk from deployments by improved use of resilience and ensuring appropriate testing of releases pre and post deployment. Provide support and troubleshooting when service incidents occur. Improve time to recover from service impacting incidents. Identifying trends and root causes to reduce volume of incidents. *Automation: Identify and deliver on opportunities to use automation to increase efficiency, reduce toil and drive service availability. Use automation and orchestration techniques to provide repeatable solutions and reduce risk of mis-operations. *Observability: Monitor and ensure smooth operation of all production services. Identifying gaps in coverage and improving observability of Production services. Ensuring appropriate events are generated for service failure or degradation scenarios. Responding to events and alerts in timely manner managing through to resolution. *Knowledge management: Continuously improving the knowledge of the Application Support team to become subject matter experts on the Product and the technology that runs it. Collaborating with other teams to understand how underpinning services support the Products. Identifying opportunities to share knowledge and decrease the time it takes to resolve customer related incidents. Tech Stacks: Platform and Database Tech: Linux, Cassandra, Kafka, ArangoDB; Containerisation/Virtualisation: Kubernetes/OpenShift, VMware; Instrumentation and Monitoring: Splunk, Zabbix, Prometheus, Grafana; Scripting: PowerShell, Python. Your Skills *Experience as a Site Reliability Engineer, Application Support Engineer or similar running highly available critical services (ideally SaaS) *Scripting abilities in PowerShell / Python *Understanding of networking, firewalls, protocols, databases and more *Java Debugging - ability to complete thread dumps and analysis *Experience with monitoring solutions *Splunk Experience - creating dashboards, events and analysis *CI/CD Delivery Practices *Troubleshooting connectivity issues: TCP/IP, DNS, Telnet, Trace Route, TCP dump and analysis *Awareness of Load Balancing Technologies such as HA Proxy, Nginx, F5 *Experience of collaboration technologies - email, archiving, instant messaging *Exposure to support Voice / SMS Tech nice to have Alongside a competitive salary, you will receive a benefits package which includes 25 Days Holiday (increases with service), Private Medical Cover, Bupa Dental Cover, Life Insurance, Income Protection, Secondment Opportunities to Global HQ in Vancouver, Pension Scheme (increases with service up to 7% employer contribution), Bonus Scheme (up to 8% dependent on revenues and team performance). This role would be suitable for those who have held the following job roles: Site Reliability Engineer, Senior SRE, Site Availability Engineer, Application Support Engineer, Senior Site Reliability Engineer, Senior Application Support Engineer, Lead SRE, Lead Site Reliability Engineer, Lead Application Support. Deerfoot IT Resources Ltd is one of the UK's leading IT Recruitment Agencies, trusted by many of the UK's leading employers. Established in 1997, we have over twenty years of experience as IT Recruitment Specialist. We will never send your CV anywhere without your authorisation and only after you have seen the complete details on this opportunity. Deerfoot is acting as an employment agency in relation to this vacancy. Each time Deerfoot sends a CV to a recruiting client we donate £1 to The Born Free Foundation ().
04/11/2021
Full time
Senior Application Support / SRE Hybrid Working: Mix of Home Working / London EMEA HQ Permanent, Full Time As a trusted and preferred recruitment partner to this leading global provider of cloud-based solutions to the global financial sector, we have been asked to assist in the hire of a Senior Application Support Engineer to take responsibility for the availability and reliability of services used by over 23,000 customers across 90 countries (including 22 of the world's top 25 banks). In this role you will ensure all services exceed availability targets, have in-depth monitoring and are proactively managed. Already benefitting from a dominance in the North American finance industry, our client is expanding its London operations to better serve the UK and EU markets. This is an exciting time to join, and you will have the opportunity to work a mix of remotely and within their state-of-the-art EMEA HQ in London. Your Job *Service Reliability: Proactively identifying risks to service and remediate them. Reduce risk from deployments by improved use of resilience and ensuring appropriate testing of releases pre and post deployment. Provide support and troubleshooting when service incidents occur. Improve time to recover from service impacting incidents. Identifying trends and root causes to reduce volume of incidents. *Automation: Identify and deliver on opportunities to use automation to increase efficiency, reduce toil and drive service availability. Use automation and orchestration techniques to provide repeatable solutions and reduce risk of mis-operations. *Observability: Monitor and ensure smooth operation of all production services. Identifying gaps in coverage and improving observability of Production services. Ensuring appropriate events are generated for service failure or degradation scenarios. Responding to events and alerts in timely manner managing through to resolution. *Knowledge management: Continuously improving the knowledge of the Application Support team to become subject matter experts on the Product and the technology that runs it. Collaborating with other teams to understand how underpinning services support the Products. Identifying opportunities to share knowledge and decrease the time it takes to resolve customer related incidents. Tech Stacks: Platform and Database Tech: Linux, Cassandra, Kafka, ArangoDB; Containerisation/Virtualisation: Kubernetes/OpenShift, VMware; Instrumentation and Monitoring: Splunk, Zabbix, Prometheus, Grafana; Scripting: PowerShell, Python. Your Skills *Experience as a Site Reliability Engineer, Application Support Engineer or similar running highly available critical services (ideally SaaS) *Scripting abilities in PowerShell / Python *Understanding of networking, firewalls, protocols, databases and more *Java Debugging - ability to complete thread dumps and analysis *Experience with monitoring solutions *Splunk Experience - creating dashboards, events and analysis *CI/CD Delivery Practices *Troubleshooting connectivity issues: TCP/IP, DNS, Telnet, Trace Route, TCP dump and analysis *Awareness of Load Balancing Technologies such as HA Proxy, Nginx, F5 *Experience of collaboration technologies - email, archiving, instant messaging *Exposure to support Voice / SMS Tech nice to have Alongside a competitive salary, you will receive a benefits package which includes 25 Days Holiday (increases with service), Private Medical Cover, Bupa Dental Cover, Life Insurance, Income Protection, Secondment Opportunities to Global HQ in Vancouver, Pension Scheme (increases with service up to 7% employer contribution), Bonus Scheme (up to 8% dependent on revenues and team performance). This role would be suitable for those who have held the following job roles: Site Reliability Engineer, Senior SRE, Site Availability Engineer, Application Support Engineer, Senior Site Reliability Engineer, Senior Application Support Engineer, Lead SRE, Lead Site Reliability Engineer, Lead Application Support. Deerfoot IT Resources Ltd is one of the UK's leading IT Recruitment Agencies, trusted by many of the UK's leading employers. Established in 1997, we have over twenty years of experience as IT Recruitment Specialist. We will never send your CV anywhere without your authorisation and only after you have seen the complete details on this opportunity. Deerfoot is acting as an employment agency in relation to this vacancy. Each time Deerfoot sends a CV to a recruiting client we donate £1 to The Born Free Foundation ().
As one of the leading retail banks in the UK, Lloyds Banking Group is undertaking a major engineering transformation. Our goal is to put technology at the forefront of our culture and transform banking. Our team in General Insurance is passionate about creating new services for our customers that transform the customer and colleague experience. Our culture brings together smart, hardworking, people from a diverse group of backgrounds that enjoy a collaborative and innovative environment that supports flexible and agile working As a Site Reliability Engineer at Lloyds (General Insurance) You'll be a part of a small but expanding SRE team responsible for driving high reliability into our systems by working closely with software development and IT-operations teams. As one of the early joiners in this area You'll help shape our SRE strategy and mentor engineers in product teams to produce optimal code with a service engineering focus. Want to know more?The SRE team will be focusing on Lloyds General Insurance production services and you'll have significant influence on design and the delivery of our products, features and new technical capabilities. We love solving problems using tooling and technology to optimise our platforms and our ways of working, but you'll: Guarantee the reliability of Lloyds General Insurance production service providing support to a global and diverse organization. Provide platforms and tooling that enable shipping to production easy and reliably with a real attention to quality and control. Mitigate incidents as part of our blameless post-mortem culture and build solutions and automation to prevent them from happening again. Through our post incident reviews identify and mitigate weaknesses in incident management or software delivery optimising our SDLC (Software Development Life Cycle) where required. Ensure all necessary monitoring, alerting and backup solutions are in place. Using SLOs to guide prioritization putting reliability front and centre Dive into large codebases, not being afraid of programming more than a few lines code yourself. Spend a small amount of your time dealing with incidents and internal change requests. This is not a service-desk or incident-only position, the vast majority of your time will be spent creating and optimizing our tools and infrastructure. We'll be specifically looking for these skills on your CV This is a relatively senior level role, and we're looking for the following skills and experience: Experience in software engineering and automation in one or more modern programming languages. Strong resilience and determination to make a difference. A true passion for monitoring, observability and an understanding of the importance of reliability in our large-scale production systems for our customers. Experience in public cloud (AWS, GCP, or Azure) and containerization. Prior experience with SRE or SRE concepts. Our current tech stack is vast and wide ranging but includes the following core tech: Modern Web Tech such as JavaScript, NodeJS, React/Redux More Traditional\Compiled Tech such as C#. SQL. XML. Code Management and deployment tech such as GIT\TFS, Urban Code Deploy, Jenkins Automated testing capabilities such as Selenium, Cucumber We'd also like to see experience of Working within an Agile environment working in Scrum and Waterfall environments, and being flexible to work with a mix of methodologies and inspire change quickly Passion to lead and work collaboratively across teams in order to align to Strategy and meet the overall goals of the Bank and Partners A hunger to learn and self-improve with a passionate not just about the result, but how you get there. Excitement about enabling others to achieve greater results than one can achieve alone and being part of a wider delivery team responsible for end to end solutions. Join us and be part of an inclusive, values-led culture that celebrates diversity, equal opportunity and provides opportunities for flexible working. Together we'll make it possible... So, what can we offer you in return? Whatever your aspiration, you can also expect excellent benefits, personal development and a career that's enriching and full of opportunity. You'll also receive a package that includes Discretionary bonus Cash sum of 4% which you can exchange for a variety of benefits or simply take the cash Private Medical Insurance Pension, where we'll give up to a max of 13% Share plans 30 days holiday plus bank holidays Are you interested in joining us? Apply today; we'd love to hear from you...:
18/03/2021
Full time
As one of the leading retail banks in the UK, Lloyds Banking Group is undertaking a major engineering transformation. Our goal is to put technology at the forefront of our culture and transform banking. Our team in General Insurance is passionate about creating new services for our customers that transform the customer and colleague experience. Our culture brings together smart, hardworking, people from a diverse group of backgrounds that enjoy a collaborative and innovative environment that supports flexible and agile working As a Site Reliability Engineer at Lloyds (General Insurance) You'll be a part of a small but expanding SRE team responsible for driving high reliability into our systems by working closely with software development and IT-operations teams. As one of the early joiners in this area You'll help shape our SRE strategy and mentor engineers in product teams to produce optimal code with a service engineering focus. Want to know more?The SRE team will be focusing on Lloyds General Insurance production services and you'll have significant influence on design and the delivery of our products, features and new technical capabilities. We love solving problems using tooling and technology to optimise our platforms and our ways of working, but you'll: Guarantee the reliability of Lloyds General Insurance production service providing support to a global and diverse organization. Provide platforms and tooling that enable shipping to production easy and reliably with a real attention to quality and control. Mitigate incidents as part of our blameless post-mortem culture and build solutions and automation to prevent them from happening again. Through our post incident reviews identify and mitigate weaknesses in incident management or software delivery optimising our SDLC (Software Development Life Cycle) where required. Ensure all necessary monitoring, alerting and backup solutions are in place. Using SLOs to guide prioritization putting reliability front and centre Dive into large codebases, not being afraid of programming more than a few lines code yourself. Spend a small amount of your time dealing with incidents and internal change requests. This is not a service-desk or incident-only position, the vast majority of your time will be spent creating and optimizing our tools and infrastructure. We'll be specifically looking for these skills on your CV This is a relatively senior level role, and we're looking for the following skills and experience: Experience in software engineering and automation in one or more modern programming languages. Strong resilience and determination to make a difference. A true passion for monitoring, observability and an understanding of the importance of reliability in our large-scale production systems for our customers. Experience in public cloud (AWS, GCP, or Azure) and containerization. Prior experience with SRE or SRE concepts. Our current tech stack is vast and wide ranging but includes the following core tech: Modern Web Tech such as JavaScript, NodeJS, React/Redux More Traditional\Compiled Tech such as C#. SQL. XML. Code Management and deployment tech such as GIT\TFS, Urban Code Deploy, Jenkins Automated testing capabilities such as Selenium, Cucumber We'd also like to see experience of Working within an Agile environment working in Scrum and Waterfall environments, and being flexible to work with a mix of methodologies and inspire change quickly Passion to lead and work collaboratively across teams in order to align to Strategy and meet the overall goals of the Bank and Partners A hunger to learn and self-improve with a passionate not just about the result, but how you get there. Excitement about enabling others to achieve greater results than one can achieve alone and being part of a wider delivery team responsible for end to end solutions. Join us and be part of an inclusive, values-led culture that celebrates diversity, equal opportunity and provides opportunities for flexible working. Together we'll make it possible... So, what can we offer you in return? Whatever your aspiration, you can also expect excellent benefits, personal development and a career that's enriching and full of opportunity. You'll also receive a package that includes Discretionary bonus Cash sum of 4% which you can exchange for a variety of benefits or simply take the cash Private Medical Insurance Pension, where we'll give up to a max of 13% Share plans 30 days holiday plus bank holidays Are you interested in joining us? Apply today; we'd love to hear from you...: