Settings | Multigrain

Inference

Fastest and most powerful. Uses cloud-hosted models via API.

Run models on your own machine via Ollama. Fast iteration for development. Requires Ollama running at localhost:11434.

Maximum privacy. Models run entirely in your browser using WebGPU. First load downloads the model (~2GB), then works offline.

Multigrain — for your information