KubeCon Europe London
A project lightning talk about sailing multi-host inference with LWS. LWS is a sub-project under Kubernetes community focused on multi-host inference for super big models, like llama3.1-405b and DeepSeek-R3.

Visit the youtube video here: