GRADUATE AI ENGINEER

  • Reply, Inc.
  • 04/02/2026
Full time Information Technology Telecommunications

Job Description

Role Overview

Sail Reply is an AI tech innovation consultancy that delivers experience-led, value-focused solutions for some of the world's most forward-thinking organisations. Our mission is democratising LLMs to any business process by turning proprietary knowledge into competitive advantage with bespoke LLMs built for Clients domain and deployed at scale. We buildbespoke LLM solutions tailored to the client's business processes, deliveringenterprise-grade performancecomparable to leading off the shelf model. Providing a solution designed forhigh relevance, low latency, and compliance. This role is available for an immediate start.

Responsibilities
  • Design, develop, and train large language models and AI systems.
  • Fine-tune pre-trained LLMs (e.g., GPT, LLaMA, Mistral, Falcon) for specific use cases.
  • Build and optimize prompting strategies, Retrieval-Augmented Generation (RAG), and agent-based systems.
  • Prepare, clean, and manage large-scale datasets for model training.
  • Implement model evaluation, benchmarking, and performance optimization.
  • Deploy models into production using scalable and secure architectures.
  • Collaborate with cross-functional teams to translate business needs into AI solutions.
  • Monitor model performance, manage model drift, iterate improvements, and stay current with the latest research and advancements in AI and LLMs.
Qualifications
  • Bachelor's or Master's degree (2:1 or higher) in Computer Science, AI, Machine Learning, or a related field (or equivalent experience) is essential.
  • Strong experience with Python and ML frameworks such as PyTorch or TensorFlow, and hands-on experience training, fine-tuning, or deploying LLMs.
  • Solid understanding of NLP, transformers, attention mechanisms, embeddings, and experience with data preprocessing, tokenization, and dataset pipelines.
  • Familiarity with REST APIs, microservices, model serving, and MLOps tools (e.g., MLflow, Kubeflow, Airflow, Weights & Biases).
  • Experience with cloud platforms (AWS, GCP, Azure), distributed training, model parallelism, inference optimization, and GPU/TPU infrastructure.
  • Knowledge of vector databases (e.g., FAISS, Pinecone), security, privacy, and responsible AI practices.
  • Strong problem-solving, analytical, and communication skills, with a positive, team-oriented attitude and a passion for continuous learning.
  • Additional advantages include experience with RLHF, open-source contributions, building AI copilots/chatbots, client and stakeholder management, and use of Atlassian tools like Jira and Confluence.
  • Willingness to travel within the UK and EU for client engagements as required.
Equality, Diversity, and Inclusion

Reply is an Equal Opportunities Employer and committed to embracing diversity in the workplace. We provide equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type regardless of age, sexual orientation, gender, identity, pregnancy, religion, nationality, ethnic origin, disability, medical history, skincolour, marital status or parental status or any other characteristic protected by the Law.

Reply is committed to making sure that our selection methods are fair to everyone. To help you during the recruitment process, please let us know of any Reasonable Adjustments you may need.