npm Get Started

Browser Runtime

Playground

Run Gerbil directly in your browser. No installation required.

Requires WebGPU support (Chrome 113+, Edge 113+)

Loading playground...

How it works

01Models are downloaded to your browser's cache on first use (one-time download)
02Inference runs locally using WebGPU for GPU acceleration
03Your prompts never leave your device — completely private

Chat Models

qwen3.5-0.8b — 404MB
lfm2.5-350m — 199MB
Text + vision, 262K context
Thinking mode (CoT)

Vision

qwen3.5-0.8b — 404MB
Image understanding
OCR & analysis

Text-to-Speech

kani-tts-2 — on-device
Native codec-LM synthesis
22.05 kHz mono PCM

Speech-to-Text

moonshine-base — encoder-decoder ASR
Native raw-waveform transcription
English, on-device

Tips

—Use Cmd+Enter to generate/speak
—Enable thinking mode for math and reasoning
—First use downloads models (cached after)
—All processing happens on-device

// Want to use Gerbil in Node.js? Check out the docs