Senior LLM Inference Systems Engineer

Doubleword
13/06/2026

Full time Information Technology Telecommunications

Job Description

A leading AI technology company in Greater London is seeking a Senior Research Engineer to develop cutting-edge LLM inference technology. Candidates will work on optimizing infrastructure for batch inference workloads and enhancing inference engines in memory-constrained environments. Ideal candidates will possess a deep understanding of inference workloads and GPU architectures, along with familiarity with tools such as PyTorch and TensorRT. The role offers competitive compensation and is aimed at solving challenging AI problems.

Senior LLM Inference Systems Engineer

Job Description

Modal Window