Bright Purple
Midlothian, UK
LLM Architect Edinburgh (on-site) £100k-120k + exceptional benefits A rare chance to drive the future of AI infrastructure at one of the world's leading R&D tech organisations. This is a senior opportunity with a global research leader, where you ll architect and optimise the platforms that deliver large-scale language models to production. You ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment. What you ll be doing Designing cloud-native architectures to run large language models on serverless frameworks (e.g. Kubernetes, Knative, or custom-built FaaS). Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning . Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scale. Experimenting...