270M parameters running locally with jax-js + WebGPU.
KV cache is allocated dynamically for the current chat.
The first message downloads and caches a 536 MB fp16 checkpoint. Everything after that runs locally in your browser.