Platform Systems Engineer: Scalable AI Infra & Observability

  • OpenAI
  • 02/02/2026
Full time Information Technology Telecommunications

Job Description

A global AI research company in London is seeking a Software Engineer for Platform Systems to enhance large-scale AI training infrastructure. Key responsibilities include designing failure detection systems, improving observability, and collaborating with various teams to ensure the platform's reliability. Ideal candidates should have experience in distributed systems, performance optimization, and debugging complex issues. Join us in shaping the future of AI technology with state-of-the-art engineering solutions.