Speech-to-Text Models

Speech-to-text models transcribe audio into text, enabling applications like meeting transcription, subtitle generation, voice commands, and podcast indexing. The Whisper family from OpenAI dominates this space with models ranging from tiny (39M) to large (1.5B), offering tradeoffs between speed and accuracy. Distil-Whisper provides a faster alternative with minimal quality loss. All models support running fully locally for maximum privacy.

9models available

0.1GB min VRAM needed

OpenAI

Whisper Large v3 Turbo

Optimized large Whisper model. Near-best accuracy with faster inference.

Speech0.81B2.01GB VRAM

7.9M downloads1 quants

OpenAI

Whisper Large v3

Largest Whisper model. Best accuracy across all languages and accents.

Speech1.55B3.38GB VRAM

5.3M downloads1 quants

OpenAI

Whisper Small

Compact Whisper model. Good accuracy for everyday transcription tasks.

Speech0.24B0.95GB VRAM

2.6M downloads1 quants

OpenAI

Whisper Base

Base whisper model. Good balance of speed and accuracy. 142MB.

Speech0.074B0.3GB VRAM

2.4M downloads1 quants

HuggingFace

Distil-Whisper Large v3

Distilled Whisper. 6x faster than large-v3 with 1% accuracy loss.

Speech0.76B1.92GB VRAM

986.2K downloads1 quants

OpenAI

Whisper Tiny

Tiny multilingual speech recognition. Only 75MB. Supports 99 languages. Runs on any device.

Speech0.039B0.2GB VRAM

832.4K downloads1 quants

OpenAI

Whisper Medium

Mid-size Whisper model. Strong multilingual speech recognition.

Speech0.77B1.93GB VRAM

711.8K downloads1 quants

OpenAI

Whisper Tiny English (Quantized)

Smallest possible speech recognition model. Only 32MB. English only. Default speech model - guaranteed to run on any iPhone.

Speech0.039B0.1GB VRAM

91.7K downloads1 quants

OpenAI

Whisper Base English

English-only base model. Faster and more accurate for English.

Speech0.074B0.3GB VRAM

23.9K downloads1 quants

Browse Other Capabilities

Uncensored & Abliterated AI Models

18 models

Vision & Multimodal AI Models

6 models

Coding AI Models

17 models

Embedding Models

5 models

Image Generation Models

9 models

Multilingual AI Models

52 models

Small AI Models (Under 3B)

63 models

© runthismodel · 2026privacy terms disclaimer changelog embed badge runpod vast.ai huggingface ollama lm-studiomade for the people who actually read GGUF metadata

 ┌─┐                ╔══╗     ╔══╗
 │░│  RUN  THIS  M  ║▓▓║ DEL ║▓▓║
 └─┘                ╚══╝     ╚══╝