./rankings·top-of-list · 137 models across 8 categoriessorted by min VRAM asc · params desc as tie-break
top of list
best model in each category, ranked
rank #1 = smallest VRAM that still ships quality. ranking ties broken by parameter count. click a row to drill into the model card.
chat & general·74 modelsllm
general-purpose language models for conversation, writing, and reasoning
coding·17 modelscode
specialized models for code generation, completion, and debugging
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | Qwen 2.5 Coder 0.5B | Alibaba | 0.5B | 1.1GB | open → |
| △2 | DeepSeek Coder 1.3B | DeepSeek | 1.3B | 1.3GB | open → |
| ▴3 | Yi Coder 1.5B | 01.AI | 1.5B | 1.4GB | open → |
| 4 | Qwen 2.5 Coder 1.5B | Alibaba | 1.5B | 1.5GB | open → |
| 5 | CodeGemma 2B | 2B | 2.0GB | open → | |
| 6 | Stable Code 3B | Stability AI | 3B | 2.1GB | open → |
| 7 | StarCoder2 3B | BigCode | 3B | 2.3GB | open → |
| 8 | Qwen 2.5 Coder 3B | Alibaba | 3B | 2.5GB | open → |
| 9 | Code Llama 7B | Meta | 7B | 4.3GB | open → |
| 10 | DeepSeek Coder 6.7B | DeepSeek | 6.7B | 4.3GB | open → |
| 11 | StarCoder2 7B | BigCode | 7B | 4.7GB | open → |
| 12 | Qwen 2.5 Coder 7B | Alibaba | 7.6B | 4.9GB | open → |
| 13 | Yi Coder 9B | 01.AI | 9B | 5.5GB | open → |
| 14 | CodeGemma 7B | 8.5B | 5.5GB | open → | |
| 15 | Code Llama 13B Instruct | Meta | 13B | 7.8GB | open → |
| 16 | Qwen 2.5 Coder 14B | Alibaba | 14B | 8.9GB | open → |
| 17 | Codestral 22B (abliterated) | failspy | 22B | 12.9GB | open → |
image gen·9 modelsimage
text-to-image models for art, photos, and design
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | Stable Diffusion 2.1 Base (CoreML) | Stability AI / Apple | 0.86B | 1.6GB | open → |
| △2 | Stable Diffusion 1.5 (GGUF) | Runway / GPUStack | 0.86B | 2.1GB | open → |
| ▴3 | Stable Diffusion 1.5 (CoreML) | Runway | 0.86B | 2.5GB | open → |
| 4 | Stable Diffusion 2.1 (GGUF) | Stability AI | 0.86B | 2.7GB | open → |
| 5 | Stable Diffusion XL (CoreML) | Stability AI | 3.5B | 3.3GB | open → |
| 6 | SDXL Turbo (GGUF) | Stability AI | 3.5B | 5.0GB | open → |
| 7 | Stable Diffusion 3 Medium (GGUF) | Stability AI | 2.5B | 9.2GB | open → |
| 8 | FLUX.1 Schnell (GGUF) | Black Forest Labs | 12B | 14.0GB | open → |
| 9 | FLUX.1 Dev (GGUF) | Black Forest Labs | 12B | 14.0GB | open → |
speech-to-text·9 modelsstt
transcription and speech recognition models
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | Whisper Tiny English (Quantized) | OpenAI | 0.039B | 0.1GB | open → |
| △2 | Whisper Tiny | OpenAI | 0.039B | 0.2GB | open → |
| ▴3 | Whisper Base | OpenAI | 0.074B | 0.3GB | open → |
| 4 | Whisper Base English | OpenAI | 0.074B | 0.3GB | open → |
| 5 | Whisper Small | OpenAI | 0.24B | 0.9GB | open → |
| 6 | Distil-Whisper Large v3 | HuggingFace | 0.76B | 1.9GB | open → |
| 7 | Whisper Medium | OpenAI | 0.77B | 1.9GB | open → |
| 8 | Whisper Large v3 Turbo | OpenAI | 0.81B | 2.0GB | open → |
| 9 | Whisper Large v3 | OpenAI | 1.55B | 3.4GB | open → |
text-to-speech·14 modelstts
voice synthesis and text-to-speech models
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | Piper TTS - Amy (English) | Rhasspy | 0.02B | 0.1GB | open → |
| △2 | Piper TTS - Lessac (English) | Rhasspy | 0.02B | 0.1GB | open → |
| ▴3 | Piper TTS - Spanish (MLS) | Rhasspy | 0.02B | 0.1GB | open → |
| 4 | Piper TTS - German (Thorsten) | Rhasspy | 0.02B | 0.1GB | open → |
| 5 | Piper TTS - Chinese (Huayan) | Rhasspy | 0.02B | 0.1GB | open → |
| 6 | Piper TTS - Japanese (Kokoro) | Rhasspy | 0.02B | 0.1GB | open → |
| 7 | Piper TTS - Korean | Rhasspy | 0.02B | 0.1GB | open → |
| 8 | Piper TTS - Russian (Irina) | Rhasspy | 0.02B | 0.1GB | open → |
| 9 | Piper TTS - Portuguese (Faber) | Rhasspy | 0.02B | 0.1GB | open → |
| 10 | Piper TTS - Arabic (Kareem) | Rhasspy | 0.02B | 0.1GB | open → |
| 11 | Piper TTS - French (Siwis) | Rhasspy | 0.02B | 0.5GB | open → |
| 12 | Piper TTS - Italian (Riccardo) | Rhasspy | 0.02B | 0.5GB | open → |
| 13 | Piper TTS - LibriTTS-R (English) | Rhasspy | 0.02B | 0.6GB | open → |
| 14 | Kokoro 82M TTS | Kokoro | 0.082B | 0.6GB | open → |
audio gen·3 modelsaudio
AI music and audio creation
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | MusicGen Small | Meta | 0.3B | 0.8GB | open → |
| △2 | Stable Audio Open | Stability AI | 1B | 6.0GB | open → |
| ▴3 | ACE-Step 1.5XL | ACE Studio | 1.5B | 8.0GB | open → |
multimodal / vision·6 modelsvlm
models that understand both images and text
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | Qwen2-VL 2B | Alibaba | 2.2B | 1.4GB | open → |
| △2 | Moondream 2 | Moondream | 1.8B | 1.5GB | open → |
| ▴3 | MiniCPM-V 2.6 | OpenBMB | 2B | 2.1GB | open → |
| 4 | PaliGemma 3B | 3B | 2.5GB | open → | |
| 5 | Phi-3.5 Vision | Microsoft | 4.2B | 3.2GB | open → |
| 6 | LLaVA 1.6 7B | LLaVA | 7B | 5.0GB | open → |
embedding·5 modelsembed
text embedding models for search and retrieval
| rank | model | author | params | min vram | action |
|---|---|---|---|---|---|
| ▲1 | BGE Small EN v1.5 | BAAI | 0.033B | 0.1GB | open → |
| △2 | Snowflake Arctic Embed S | Snowflake | 0.033B | 0.1GB | open → |
| ▴3 | all-MiniLM-L6-v2 | Sentence Transformers | 0.023B | 0.1GB | open → |
| 4 | Nomic Embed Text v1.5 | Nomic AI | 0.137B | 0.3GB | open → |
| 5 | BGE Large EN v1.5 | BAAI | 0.335B | 0.8GB | open → |
cloud://gpu·escape hatch
can't run the model you want?
cloud GPUs give you instant access to any model, any size.