Browser Runtime
Playground
Run Gerbil directly in your browser. No installation required.
Requires WebGPU support (Chrome 113+, Edge 113+)
Loading playground...
How it works
- 01Models are downloaded to your browser's cache on first use (one-time download)
- 02Inference runs locally using WebGPU for GPU acceleration
- 03Your prompts never leave your device — completely private
Chat Models
qwen3.5-0.8b— 404MBlfm2.5-350m— 199MB- Text + vision, 262K context
- Thinking mode (CoT)
Vision
qwen3.5-0.8b— 404MB- Image understanding
- OCR & analysis
Text-to-Speech
kani-tts-2— on-device- Native codec-LM synthesis
- 22.05 kHz mono PCM
Speech-to-Text
moonshine-base— encoder-decoder ASR- Native raw-waveform transcription
- English, on-device
Tips
- —Use Cmd+Enter to generate/speak
- —Enable thinking mode for math and reasoning
- —First use downloads models (cached after)
- —All processing happens on-device
// Want to use Gerbil in Node.js? Check out the docs