Job Description
A leading AI technology company in Greater London is seeking a Senior Research Engineer to develop cutting-edge LLM inference technology. Candidates will work on optimizing infrastructure for batch inference workloads and enhancing inference engines in memory-constrained environments. Ideal candidates will possess a deep understanding of inference workloads and GPU architectures, along with familiarity with tools such as PyTorch and TensorRT. The role offers competitive compensation and is aimed at solving challenging AI problems.