Compatible Models
74
Largest Model
Yi 1.5 6B Chat (6B)
Categories
9
💬
Chat / LLM (25 models)
Yi 1.5 6B Chat
01.AI
3.9GB VRAMQ4_K_M
85%
Gemma 3 4B
2.8GB VRAMQ4_K_M
85%
Nemotron Mini 4B
NVIDIA
3.0GB VRAMQ4_K_M
85%
Danube 3 4B
H2O.ai
2.7GB VRAMQ4_K_M
85%
Phi-3.5 Mini 3.8B
Microsoft
3.1GB VRAMQ5_K_M
90%
Phi-4 Mini 3.8B
Microsoft
2.8GB VRAMQ4_K_M
85%
Llama 3.2 3B Instruct
Meta
3.7GB VRAMQ8_0
98%
Qwen 2.5 3B
Alibaba
3.9GB VRAMQ8_0
98%
Falcon 3 3B
TII
3.8GB VRAMQ8_0
98%
StableLM Zephyr 3B
Stability AI
3.3GB VRAMQ8_0
98%
Rocket 3B
Pansophic
3.3GB VRAMQ8_0
98%
Gemma 2 2B
3.1GB VRAMQ8_0
98%
EXAONE 3.5 2.4B
LG AI
3.1GB VRAMQ8_0
98%
Granite 3.3 2B
IBM
3.0GB VRAMQ8_0
98%
SmolLM2 1.7B
HuggingFace
2.2GB VRAMQ8_0
98%
Qwen 2.5 1.5B
Alibaba
2.3GB VRAMQ8_0
98%
DeepSeek R1 Distill 1.5B
DeepSeek
2.3GB VRAMQ8_0
98%
Llama 3.2 1B Instruct
Meta
2.8GB VRAMFP16
100%
TinyLlama 1.1B
TinyLlama
1.6GB VRAMQ8_0
98%
Gemma 3 1B
1.5GB VRAMQ8_0
98%
Falcon 3 1B
TII
2.2GB VRAMQ8_0
98%
Qwen 2.5 0.5B
Alibaba
1.1GB VRAMQ8_0
98%
Danube 3 500M
H2O.ai
1.0GB VRAMQ8_0
98%
SmolLM2 360M
HuggingFace
0.9GB VRAMQ8_0
98%
SmolLM2 135M
HuggingFace
0.8GB VRAMFP16
100%
💻
Coding (8 models)
Qwen 2.5 Coder 3B
Alibaba
3.9GB VRAMQ8_0
98%
StarCoder2 3B
BigCode
3.5GB VRAMQ8_0
98%
Stable Code 3B
Stability AI
3.3GB VRAMQ8_0
98%
CodeGemma 2B
3.0GB VRAMQ8_0
98%
Qwen 2.5 Coder 1.5B
Alibaba
2.3GB VRAMQ8_0
98%
Yi Coder 1.5B
01.AI
2.0GB VRAMQ8_0
98%
DeepSeek Coder 1.3B
DeepSeek
1.8GB VRAMQ8_0
98%
Qwen 2.5 Coder 0.5B
Alibaba
1.1GB VRAMQ8_0
98%
👁
Multimodal (5 models)
🎨
Image Generation (5 models)
Stable Diffusion XL (CoreML)
Stability AI
3.3GB VRAMCoreML
100%
Stable Diffusion 2.1 Base (CoreML)
Stability AI / Apple
1.6GB VRAMCoreML-Palettized
85%
Stable Diffusion 1.5 (CoreML)
Runway
2.5GB VRAMCoreML-Palettized
90%
Stable Diffusion 1.5 (GGUF)
Runway / GPUStack
2.3GB VRAMQ8_0
95%
Stable Diffusion 2.1 (GGUF)
Stability AI
2.7GB VRAMQ8_0
95%
🎤
Speech Recognition (9 models)
Whisper Large v3
OpenAI
3.4GB VRAMQ8_0
98%
Whisper Large v3 Turbo
OpenAI
2.0GB VRAMQ8_0
95%
Whisper Medium
OpenAI
1.9GB VRAMQ8_0
92%
Distil-Whisper Large v3
HuggingFace
1.9GB VRAMQ8_0
96%
Whisper Small
OpenAI
0.9GB VRAMQ8_0
85%
Whisper Base
OpenAI
0.3GB VRAMQ8_0
80%
Whisper Base English
OpenAI
0.3GB VRAMQ8_0
82%
Whisper Tiny English (Quantized)
OpenAI
0.1GB VRAMQ5_1
65%
Whisper Tiny
OpenAI
0.2GB VRAMQ8_0
70%
🔊
Text to Speech (14 models)
Kokoro 82M TTS
Kokoro
0.6GB VRAMONNX-Q8F16
95%
Piper TTS - Amy (English)
Rhasspy
0.1GB VRAMONNX
85%
Piper TTS - Lessac (English)
Rhasspy
0.1GB VRAMONNX
85%
Piper TTS - LibriTTS-R (English)
Rhasspy
0.6GB VRAMONNX
80%
Piper TTS - Spanish (MLS)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - French (Siwis)
Rhasspy
0.5GB VRAMONNX
80%
Piper TTS - German (Thorsten)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Chinese (Huayan)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Japanese (Kokoro)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Korean
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Russian (Irina)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Portuguese (Faber)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Italian (Riccardo)
Rhasspy
0.5GB VRAMONNX
80%
Piper TTS - Arabic (Kareem)
Rhasspy
0.1GB VRAMONNX
80%
🎵
Audio Generation (1 model)
🧩
Embedding (5 models)
🔄
Reranker (2 models)
Compatible GPUs (99 with 4GB+ VRAM)
NVIDIA GeForce GTX 1050 Ti
4GB$139nvidia
NVIDIA GeForce GTX 1650
4GB$149nvidia
NVIDIA GeForce GTX 1650 SUPER
4GB$159nvidia
NVIDIA RTX 3060 Laptop
6GBnvidia
NVIDIA GeForce GTX 1660
6GB$219nvidia
NVIDIA GeForce GTX 1660 SUPER
6GB$229nvidia
NVIDIA GeForce GTX 1060 6GB
6GB$249nvidia
NVIDIA GeForce GTX 1660 Ti
6GB$279nvidia
NVIDIA GeForce RTX 2060
6GB$349nvidia
NVIDIA RTX 4070 Laptop
8GBnvidia
NVIDIA RTX 4060 Laptop
8GBnvidia
NVIDIA RTX 3070 Laptop
8GBnvidia
Apple Silicon (Unknown)
8GBapple
Intel Arc A580
8GB$179intel
NVIDIA GeForce RTX 3050
8GB$249nvidia
AMD Radeon RX 7600
8GB$269amd
Intel Arc A750
8GB$289intel
NVIDIA GeForce RTX 5060
8GB$299nvidia
NVIDIA GeForce RTX 4060
8GB$299nvidia
AMD Radeon RX 6600
8GB$329amd
NVIDIA GeForce GTX 1070
8GB$379nvidia
AMD Radeon RX 6600 XT
8GB$379amd
NVIDIA GeForce RTX 4060 Ti
8GB$399nvidia
NVIDIA GeForce RTX 3060 Ti
8GB$399nvidia
NVIDIA GeForce RTX 2060 SUPER
8GB$399nvidia
NVIDIA GeForce GTX 1070 Ti
8GB$449nvidia
NVIDIA GeForce RTX 3070
8GB$499nvidia
NVIDIA GeForce RTX 2070 SUPER
8GB$499nvidia
NVIDIA GeForce RTX 2070
8GB$499nvidia
NVIDIA GeForce RTX 3070 Ti
8GB$599nvidia
NVIDIA GeForce GTX 1080
8GB$599nvidia
NVIDIA GeForce RTX 2080 SUPER
8GB$699nvidia
NVIDIA GeForce RTX 2080
8GB$699nvidia
Intel Arc B570
10GB$219intel
NVIDIA GeForce RTX 3080
10GB$699nvidia
NVIDIA GeForce GTX 1080 Ti
11GB$699nvidia
NVIDIA GeForce RTX 2080 Ti
11GB$1,199nvidia
NVIDIA RTX 4080 Laptop
12GBnvidia
Intel Arc B580
12GB$249intel
NVIDIA GeForce RTX 3060 12GB
12GB$329nvidia
AMD Radeon RX 7700 XT
12GB$449amd
AMD Radeon RX 6700 XT
12GB$479amd
NVIDIA GeForce RTX 5070
12GB$549nvidia
NVIDIA GeForce RTX 4070 SUPER
12GB$599nvidia
NVIDIA GeForce RTX 4070
12GB$599nvidia
NVIDIA GeForce RTX 4070 Ti
12GB$799nvidia
NVIDIA GeForce RTX 3080 Ti
12GB$1,199nvidia
NVIDIA RTX 4090 Laptop
16GBnvidia
NVIDIA RTX 3080 Laptop
16GBnvidia
Apple M1
16GBapple
AMD Radeon RX 7600 XT
16GB$329amd
Intel Arc A770
16GB$349intel
NVIDIA GeForce RTX 5060 Ti
16GB$449nvidia
AMD Radeon RX 9070
16GB$449amd
NVIDIA GeForce RTX 4060 Ti 16GB
16GB$499nvidia
AMD Radeon RX 7800 XT
16GB$499amd
AMD Radeon RX 9070 XT
16GB$549amd
AMD Radeon RX 7900 GRE
16GB$549amd
AMD Radeon RX 6800
16GB$579amd
AMD Radeon RX 6800 XT
16GB$649amd
NVIDIA GeForce RTX 5070 Ti
16GB$749nvidia
NVIDIA GeForce RTX 4070 Ti SUPER
16GB$799nvidia
NVIDIA GeForce RTX 5080
16GB$999nvidia
NVIDIA GeForce RTX 4080 SUPER
16GB$999nvidia
AMD Radeon RX 6900 XT
16GB$999amd
NVIDIA RTX A4000
16GB$1,000nvidia
AMD Radeon RX 6950 XT
16GB$1,099amd
NVIDIA GeForce RTX 4080
16GB$1,199nvidia
NVIDIA T4
16GB$2,200nvidia
AMD Radeon RX 7900 XT
20GB$899amd
Apple M3
24GBapple
Apple M2
24GBapple
AMD Radeon RX 7900 XTX
24GB$999amd
NVIDIA GeForce RTX 3090
24GB$1,499nvidia
NVIDIA GeForce RTX 4090
24GB$1,599nvidia
NVIDIA GeForce RTX 3090 Ti
24GB$1,999nvidia
NVIDIA RTX A5000
24GB$2,250nvidia
NVIDIA L4
24GB$2,500nvidia
Apple M4
32GBapple
Apple M2 Pro
32GBapple
Apple M1 Pro
32GBapple
NVIDIA GeForce RTX 5090
32GB$1,999nvidia
Apple M3 Pro
36GBapple
NVIDIA A100 40GB
40GB$10,000nvidia
Apple M4 Pro
48GBapple
NVIDIA RTX A6000
48GB$4,650nvidia
NVIDIA L40S
48GB$7,500nvidia
Apple M1 Max
64GBapple
NVIDIA A100 80GB
80GB$15,000nvidia
NVIDIA H100
80GB$30,000nvidia
Apple M2 Max
96GBapple
Apple M4 Max
128GBapple
Apple M3 Max
128GBapple
Apple M1 Ultra
128GBapple
AMD Instinct MI250X
128GB$10,000amd
Apple M4 Ultra
192GBapple
Apple M3 Ultra
192GBapple
Apple M2 Ultra
192GBapple
AMD Instinct MI300X
192GB$15,000amd
Frequently Asked Questions
What is the best AI model I can run with 4GB VRAM?
The largest model you can run with 4GB VRAM is Yi 1.5 6B Chat (6B parameters) using Q4_K_M quantization. There are 74 total compatible models.
Can I run Llama with 4GB VRAM?
Yes! You can run 3 Llama models with 4GB VRAM: Llama 3.2 3B Instruct (Q8_0), Llama 3.2 1B Instruct (FP16), TinyLlama 1.1B (Q8_0).
What GPU has 4GB VRAM?
GPUs with 4GB or more VRAM include: NVIDIA GeForce GTX 1050 Ti, NVIDIA GeForce GTX 1650, NVIDIA GeForce GTX 1650 SUPER, NVIDIA RTX 3060 Laptop, NVIDIA GeForce GTX 1660, and 94 more.