AI Model Rankings

Best models for every use case, ranked by quality and sorted by minimum VRAM. Find the best model your hardware can run.

💬

Chat & General

General-purpose language models for conversation, writing, and reasoning

#ModelAuthorParamsMin VRAMAction
1SmolLM2 135MHuggingFace0.135B0.3GBCheck
2SmolLM2 360MHuggingFace0.36B0.5GBCheck
3Danube 3 500MH2O.ai0.5B0.6GBCheck
4Qwen 2.5 0.5BAlibaba0.5B0.7GBCheck
5TinyLlama 1.1BTinyLlama1.1B1.2GBCheck
6Gemma 3 1BGoogle1B1.2GBCheck
7Falcon 3 1BTII1B1.2GBCheck
8Llama 3.2 1B InstructMeta1.24B1.3GBCheck
9Qwen 2.5 1.5BAlibaba1.5B1.5GBCheck
10DeepSeek R1 Distill 1.5BDeepSeek1.5B1.5GBCheck
11SmolLM2 1.7BHuggingFace1.7B1.6GBCheck
12Granite 3.3 2BIBM2B1.8GBCheck
13EXAONE 3.5 2.4BLG AI2.4B2GBCheck
14StableLM Zephyr 3BStability AI3B2.3GBCheck
15Rocket 3BPansophic3B2.3GBCheck
16Gemma 2 2BGoogle2.6B2.3GBCheck
17Qwen 2.5 3BAlibaba3B2.5GBCheck
18Falcon 3 3BTII3B2.5GBCheck
19Llama 3.2 3B InstructMeta3.2B2.6GBCheck
20Nemotron Mini 4BNVIDIA4B3GBCheck
21Danube 3 4BH2O.ai4B3GBCheck
22Phi-3.5 Mini 3.8BMicrosoft3.8B3GBCheck
23Phi-4 Mini 3.8BMicrosoft3.8B3GBCheck
24Gemma 3 4BGoogle4B3.2GBCheck
25Yi 1.5 6B Chat01.AI6B4.3GBCheck
26Mistral 7B Instruct v0.3Mistral AI7.3B5GBCheck
27Falcon 3 7BTII7B5GBCheck
28OLMo 2 7BAllen AI7B5GBCheck
29OpenChat 3.5 7BOpenChat7B5GBCheck
30InternLM 2.5 7BShanghai AI Lab7.7B5.3GBCheck
31Qwen 2.5 7B InstructAlibaba7.6B5.3GBCheck
32DeepSeek R1 Distill 8BDeepSeek8B5.5GBCheck
33Llama 3.1 8B InstructMeta8B5.5GBCheck
34Granite 3.3 8BIBM8B5.5GBCheck
35EXAONE 3.5 7.8BLG AI7.8B5.5GBCheck
36Gemma 2 9B InstructGoogle9.2B6.2GBCheck
37Yi 1.5 9B Chat01.AI9B6.2GBCheck
38Falcon 3 10BTII10B7GBCheck
39Solar 10.7BUpstage10.7B7.2GBCheck
40Gemma 3 12BGoogle12B8GBCheck
41Mistral Nemo 12BMistral AI12B8GBCheck
42Phi-4Microsoft14B9.5GBCheck
43Qwen 2.5 14BAlibaba14B10GBCheck
44Mistral Small 22BMistral AI22B14.5GBCheck
45Gemma 3 27BGoogle27B17.5GBCheck
46Qwen 2.5 32BAlibaba32B20GBCheck
47Llama 3.1 70B InstructMeta70B42GBCheck
💻

Coding

Specialized models for code generation, completion, and debugging

#ModelAuthorParamsMin VRAMAction
1Qwen 2.5 Coder 0.5BAlibaba0.5B0.9GBCheck
2DeepSeek Coder 1.3BDeepSeek1.3B1.3GBCheck
3Qwen 2.5 Coder 1.5BAlibaba1.5B1.5GBCheck
4Yi Coder 1.5B01.AI1.5B1.5GBCheck
5CodeGemma 2BGoogle2B1.9GBCheck
6Stable Code 3BStability AI3B2.3GBCheck
7StarCoder2 3BBigCode3B2.4GBCheck
8Qwen 2.5 Coder 3BAlibaba3B2.5GBCheck
9Code Llama 7BMeta7B4.7GBCheck
10DeepSeek Coder 6.7BDeepSeek6.7B4.7GBCheck
11StarCoder2 7BBigCode7B5GBCheck
12Qwen 2.5 Coder 7BAlibaba7.6B5.3GBCheck
13CodeGemma 7BGoogle8.5B5.6GBCheck
14Yi Coder 9B01.AI9B6.2GBCheck
15Code Llama 13B InstructMeta13B8.7GBCheck
16Qwen 2.5 Coder 14BAlibaba14B10GBCheck
🎨

Image Generation

Text-to-image models for art, photos, and design

#ModelAuthorParamsMin VRAMAction
1Stable Diffusion 2.1 Base (CoreML)Stability AI / Apple0.86B2GBCheck
2Stable Diffusion 1.5 (CoreML)Runway0.86B2.5GBCheck
3Stable Diffusion 1.5 (GGUF)Runway / GPUStack0.86B2.5GBCheck
4Stable Diffusion 2.1 (GGUF)Stability AI0.86B2.5GBCheck
5Stable Diffusion XL (CoreML)Stability AI3.5B5GBCheck
6SDXL Turbo (GGUF)Stability AI3.5B5GBCheck
7Stable Diffusion 3 Medium (GGUF)Stability AI2.5B5.5GBCheck
8FLUX.1 Schnell (GGUF)Black Forest Labs12B14GBCheck
9FLUX.1 Dev (GGUF)Black Forest Labs12B14GBCheck
🎤

Speech-to-Text

Transcription and speech recognition models

#ModelAuthorParamsMin VRAMAction
1Whisper Tiny English (Quantized)OpenAI0.039B0.1GBCheck
2Whisper TinyOpenAI0.039B0.2GBCheck
3Whisper BaseOpenAI0.074B0.3GBCheck
4Whisper Base EnglishOpenAI0.074B0.3GBCheck
5Whisper SmallOpenAI0.24B0.6GBCheck
6Distil-Whisper Large v3HuggingFace0.76B1GBCheck
7Whisper MediumOpenAI0.77B1.8GBCheck
8Whisper Large v3 TurboOpenAI0.81B1.9GBCheck
9Whisper Large v3OpenAI1.55B3.5GBCheck
🔊

Text-to-Speech

Voice synthesis and text-to-speech models

#ModelAuthorParamsMin VRAMAction
1Piper TTS - Amy (English)Rhasspy0.02B0.15GBCheck
2Piper TTS - Lessac (English)Rhasspy0.02B0.15GBCheck
3Piper TTS - LibriTTS-R (English)Rhasspy0.02B0.15GBCheck
4Piper TTS - Spanish (MLS)Rhasspy0.02B0.15GBCheck
5Piper TTS - French (Siwis)Rhasspy0.02B0.15GBCheck
6Piper TTS - German (Thorsten)Rhasspy0.02B0.15GBCheck
7Piper TTS - Chinese (Huayan)Rhasspy0.02B0.15GBCheck
8Piper TTS - Japanese (Kokoro)Rhasspy0.02B0.15GBCheck
9Piper TTS - KoreanRhasspy0.02B0.15GBCheck
10Piper TTS - Russian (Irina)Rhasspy0.02B0.15GBCheck
11Piper TTS - Portuguese (Faber)Rhasspy0.02B0.15GBCheck
12Piper TTS - Italian (Riccardo)Rhasspy0.02B0.15GBCheck
13Piper TTS - Arabic (Kareem)Rhasspy0.02B0.15GBCheck
14Kokoro 82M TTSKokoro0.082B0.2GBCheck
🎵

Audio Generation

AI music and audio creation

#ModelAuthorParamsMin VRAMAction
1MusicGen SmallMeta0.3B1.5GBCheck
👁️

Multimodal / Vision

Models that understand both images and text

#ModelAuthorParamsMin VRAMAction
1Moondream 2Moondream1.8B1.5GBCheck
2Qwen2-VL 2BAlibaba2.2B2GBCheck
3MiniCPM-V 2.6OpenBMB2B2.1GBCheck
4PaliGemma 3BGoogle3B2.5GBCheck
5Phi-3.5 VisionMicrosoft4.2B3.2GBCheck
6LLaVA 1.6 7BLLaVA7B5GBCheck
🔗

Embedding

Text embedding models for search and retrieval

#ModelAuthorParamsMin VRAMAction
1BGE Small EN v1.5BAAI0.033B0.1GBCheck
2Snowflake Arctic Embed SSnowflake0.033B0.1GBCheck
3all-MiniLM-L6-v2Sentence Transformers0.023B0.1GBCheck
4Nomic Embed Text v1.5Nomic AI0.137B0.3GBCheck
5BGE Large EN v1.5BAAI0.335B0.5GBCheck

Can't Run the Model You Want?

Cloud GPUs give you instant access to any model, any size.