Compatible Models
94
Largest Model
Gemma 2 9B Instruct (9.2B)
Categories
9
💬
Chat / LLM (37 models)
Gemma 2 9B Instruct
5.9GB VRAMQ4_K_M
85%
Yi 1.5 9B Chat
01.AI
5.5GB VRAMQ4_K_M
85%
DeepSeek R1 Distill 8B
DeepSeek
5.8GB VRAMQ5_K_M
90%
Llama 3.1 8B Instruct
Meta
5.8GB VRAMQ5_K_M
90%
Granite 3.3 8B
IBM
5.1GB VRAMQ4_K_M
85%
EXAONE 3.5 7.8B
LG AI
4.9GB VRAMQ4_K_M
85%
InternLM 2.5 7B
Shanghai AI Lab
4.9GB VRAMQ4_K_M
85%
Qwen 2.5 7B Instruct
Alibaba
5.3GB VRAMQ4_K_M
85%
Mistral 7B Instruct v0.3
Mistral AI
5.3GB VRAMQ5_K_M
90%
Falcon 3 7B
TII
5.0GB VRAMQ4_K_M
85%
OLMo 2 7B
Allen AI
4.7GB VRAMQ4_K_M
85%
OpenChat 3.5 7B
OpenChat
4.6GB VRAMQ4_K_M
85%
Yi 1.5 6B Chat
01.AI
3.9GB VRAMQ4_K_M
85%
Gemma 3 4B
4.3GB VRAMQ8_0
98%
Nemotron Mini 4B
NVIDIA
4.7GB VRAMQ8_0
98%
Danube 3 4B
H2O.ai
4.4GB VRAMQ8_0
98%
Phi-3.5 Mini 3.8B
Microsoft
4.3GB VRAMQ8_0
98%
Phi-4 Mini 3.8B
Microsoft
4.3GB VRAMQ8_0
98%
Llama 3.2 3B Instruct
Meta
3.7GB VRAMQ8_0
98%
Qwen 2.5 3B
Alibaba
3.9GB VRAMQ8_0
98%
Falcon 3 3B
TII
3.8GB VRAMQ8_0
98%
StableLM Zephyr 3B
Stability AI
3.3GB VRAMQ8_0
98%
Rocket 3B
Pansophic
3.3GB VRAMQ8_0
98%
Gemma 2 2B
3.1GB VRAMQ8_0
98%
EXAONE 3.5 2.4B
LG AI
3.1GB VRAMQ8_0
98%
Granite 3.3 2B
IBM
3.0GB VRAMQ8_0
98%
SmolLM2 1.7B
HuggingFace
2.2GB VRAMQ8_0
98%
Qwen 2.5 1.5B
Alibaba
2.3GB VRAMQ8_0
98%
DeepSeek R1 Distill 1.5B
DeepSeek
2.3GB VRAMQ8_0
98%
Llama 3.2 1B Instruct
Meta
2.8GB VRAMFP16
100%
TinyLlama 1.1B
TinyLlama
1.6GB VRAMQ8_0
98%
Gemma 3 1B
1.5GB VRAMQ8_0
98%
Falcon 3 1B
TII
2.2GB VRAMQ8_0
98%
Qwen 2.5 0.5B
Alibaba
1.1GB VRAMQ8_0
98%
Danube 3 500M
H2O.ai
1.0GB VRAMQ8_0
98%
SmolLM2 360M
HuggingFace
0.9GB VRAMQ8_0
98%
SmolLM2 135M
HuggingFace
0.8GB VRAMFP16
100%
💻
Coding (14 models)
Yi Coder 9B
01.AI
5.5GB VRAMQ4_K_M
85%
CodeGemma 7B
5.5GB VRAMQ4_K_M
85%
Qwen 2.5 Coder 7B
Alibaba
4.9GB VRAMQ4_K_M
85%
StarCoder2 7B
BigCode
4.7GB VRAMQ4_K_M
85%
Code Llama 7B
Meta
4.3GB VRAMQ4_K_M
85%
DeepSeek Coder 6.7B
DeepSeek
4.3GB VRAMQ4_K_M
85%
Qwen 2.5 Coder 3B
Alibaba
3.9GB VRAMQ8_0
98%
StarCoder2 3B
BigCode
3.5GB VRAMQ8_0
98%
Stable Code 3B
Stability AI
3.3GB VRAMQ8_0
98%
CodeGemma 2B
3.0GB VRAMQ8_0
98%
Qwen 2.5 Coder 1.5B
Alibaba
2.3GB VRAMQ8_0
98%
Yi Coder 1.5B
01.AI
2.0GB VRAMQ8_0
98%
DeepSeek Coder 1.3B
DeepSeek
1.8GB VRAMQ8_0
98%
Qwen 2.5 Coder 0.5B
Alibaba
1.1GB VRAMQ8_0
98%
👁
Multimodal (6 models)
🎨
Image Generation (6 models)
Stable Diffusion XL (CoreML)
Stability AI
3.3GB VRAMCoreML
100%
SDXL Turbo (GGUF)
Stability AI
5.0GB VRAMQ5_0
85%
Stable Diffusion 2.1 Base (CoreML)
Stability AI / Apple
1.6GB VRAMCoreML-Palettized
85%
Stable Diffusion 1.5 (CoreML)
Runway
2.5GB VRAMCoreML-Palettized
90%
Stable Diffusion 1.5 (GGUF)
Runway / GPUStack
2.3GB VRAMQ8_0
95%
Stable Diffusion 2.1 (GGUF)
Stability AI
2.7GB VRAMQ8_0
95%
🎤
Speech Recognition (9 models)
Whisper Large v3
OpenAI
3.4GB VRAMQ8_0
98%
Whisper Large v3 Turbo
OpenAI
2.0GB VRAMQ8_0
95%
Whisper Medium
OpenAI
1.9GB VRAMQ8_0
92%
Distil-Whisper Large v3
HuggingFace
1.9GB VRAMQ8_0
96%
Whisper Small
OpenAI
0.9GB VRAMQ8_0
85%
Whisper Base
OpenAI
0.3GB VRAMQ8_0
80%
Whisper Base English
OpenAI
0.3GB VRAMQ8_0
82%
Whisper Tiny English (Quantized)
OpenAI
0.1GB VRAMQ5_1
65%
Whisper Tiny
OpenAI
0.2GB VRAMQ8_0
70%
🔊
Text to Speech (14 models)
Kokoro 82M TTS
Kokoro
0.6GB VRAMONNX-Q8F16
95%
Piper TTS - Amy (English)
Rhasspy
0.1GB VRAMONNX
85%
Piper TTS - Lessac (English)
Rhasspy
0.1GB VRAMONNX
85%
Piper TTS - LibriTTS-R (English)
Rhasspy
0.6GB VRAMONNX
80%
Piper TTS - Spanish (MLS)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - French (Siwis)
Rhasspy
0.5GB VRAMONNX
80%
Piper TTS - German (Thorsten)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Chinese (Huayan)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Japanese (Kokoro)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Korean
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Russian (Irina)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Portuguese (Faber)
Rhasspy
0.1GB VRAMONNX
80%
Piper TTS - Italian (Riccardo)
Rhasspy
0.5GB VRAMONNX
80%
Piper TTS - Arabic (Kareem)
Rhasspy
0.1GB VRAMONNX
80%
🎵
Audio Generation (1 model)
🧩
Embedding (5 models)
🔄
Reranker (2 models)
Compatible GPUs (96 with 6GB+ VRAM)
NVIDIA RTX 3060 Laptop
6GBnvidia
NVIDIA GeForce GTX 1660
6GB$219nvidia
NVIDIA GeForce GTX 1660 SUPER
6GB$229nvidia
NVIDIA GeForce GTX 1060 6GB
6GB$249nvidia
NVIDIA GeForce GTX 1660 Ti
6GB$279nvidia
NVIDIA GeForce RTX 2060
6GB$349nvidia
NVIDIA RTX 4070 Laptop
8GBnvidia
NVIDIA RTX 4060 Laptop
8GBnvidia
NVIDIA RTX 3070 Laptop
8GBnvidia
Apple Silicon (Unknown)
8GBapple
Intel Arc A580
8GB$179intel
NVIDIA GeForce RTX 3050
8GB$249nvidia
AMD Radeon RX 7600
8GB$269amd
Intel Arc A750
8GB$289intel
NVIDIA GeForce RTX 5060
8GB$299nvidia
NVIDIA GeForce RTX 4060
8GB$299nvidia
AMD Radeon RX 6600
8GB$329amd
NVIDIA GeForce GTX 1070
8GB$379nvidia
AMD Radeon RX 6600 XT
8GB$379amd
NVIDIA GeForce RTX 4060 Ti
8GB$399nvidia
NVIDIA GeForce RTX 3060 Ti
8GB$399nvidia
NVIDIA GeForce RTX 2060 SUPER
8GB$399nvidia
NVIDIA GeForce GTX 1070 Ti
8GB$449nvidia
NVIDIA GeForce RTX 3070
8GB$499nvidia
NVIDIA GeForce RTX 2070 SUPER
8GB$499nvidia
NVIDIA GeForce RTX 2070
8GB$499nvidia
NVIDIA GeForce RTX 3070 Ti
8GB$599nvidia
NVIDIA GeForce GTX 1080
8GB$599nvidia
NVIDIA GeForce RTX 2080 SUPER
8GB$699nvidia
NVIDIA GeForce RTX 2080
8GB$699nvidia
Intel Arc B570
10GB$219intel
NVIDIA GeForce RTX 3080
10GB$699nvidia
NVIDIA GeForce GTX 1080 Ti
11GB$699nvidia
NVIDIA GeForce RTX 2080 Ti
11GB$1,199nvidia
NVIDIA RTX 4080 Laptop
12GBnvidia
Intel Arc B580
12GB$249intel
NVIDIA GeForce RTX 3060 12GB
12GB$329nvidia
AMD Radeon RX 7700 XT
12GB$449amd
AMD Radeon RX 6700 XT
12GB$479amd
NVIDIA GeForce RTX 5070
12GB$549nvidia
NVIDIA GeForce RTX 4070 SUPER
12GB$599nvidia
NVIDIA GeForce RTX 4070
12GB$599nvidia
NVIDIA GeForce RTX 4070 Ti
12GB$799nvidia
NVIDIA GeForce RTX 3080 Ti
12GB$1,199nvidia
NVIDIA RTX 4090 Laptop
16GBnvidia
NVIDIA RTX 3080 Laptop
16GBnvidia
Apple M1
16GBapple
AMD Radeon RX 7600 XT
16GB$329amd
Intel Arc A770
16GB$349intel
NVIDIA GeForce RTX 5060 Ti
16GB$449nvidia
AMD Radeon RX 9070
16GB$449amd
NVIDIA GeForce RTX 4060 Ti 16GB
16GB$499nvidia
AMD Radeon RX 7800 XT
16GB$499amd
AMD Radeon RX 9070 XT
16GB$549amd
AMD Radeon RX 7900 GRE
16GB$549amd
AMD Radeon RX 6800
16GB$579amd
AMD Radeon RX 6800 XT
16GB$649amd
NVIDIA GeForce RTX 5070 Ti
16GB$749nvidia
NVIDIA GeForce RTX 4070 Ti SUPER
16GB$799nvidia
NVIDIA GeForce RTX 5080
16GB$999nvidia
NVIDIA GeForce RTX 4080 SUPER
16GB$999nvidia
AMD Radeon RX 6900 XT
16GB$999amd
NVIDIA RTX A4000
16GB$1,000nvidia
AMD Radeon RX 6950 XT
16GB$1,099amd
NVIDIA GeForce RTX 4080
16GB$1,199nvidia
NVIDIA T4
16GB$2,200nvidia
AMD Radeon RX 7900 XT
20GB$899amd
Apple M3
24GBapple
Apple M2
24GBapple
AMD Radeon RX 7900 XTX
24GB$999amd
NVIDIA GeForce RTX 3090
24GB$1,499nvidia
NVIDIA GeForce RTX 4090
24GB$1,599nvidia
NVIDIA GeForce RTX 3090 Ti
24GB$1,999nvidia
NVIDIA RTX A5000
24GB$2,250nvidia
NVIDIA L4
24GB$2,500nvidia
Apple M4
32GBapple
Apple M2 Pro
32GBapple
Apple M1 Pro
32GBapple
NVIDIA GeForce RTX 5090
32GB$1,999nvidia
Apple M3 Pro
36GBapple
NVIDIA A100 40GB
40GB$10,000nvidia
Apple M4 Pro
48GBapple
NVIDIA RTX A6000
48GB$4,650nvidia
NVIDIA L40S
48GB$7,500nvidia
Apple M1 Max
64GBapple
NVIDIA A100 80GB
80GB$15,000nvidia
NVIDIA H100
80GB$30,000nvidia
Apple M2 Max
96GBapple
Apple M4 Max
128GBapple
Apple M3 Max
128GBapple
Apple M1 Ultra
128GBapple
AMD Instinct MI250X
128GB$10,000amd
Apple M4 Ultra
192GBapple
Apple M3 Ultra
192GBapple
Apple M2 Ultra
192GBapple
AMD Instinct MI300X
192GB$15,000amd
Frequently Asked Questions
What is the best AI model I can run with 6GB VRAM?
The largest model you can run with 6GB VRAM is Gemma 2 9B Instruct (9.2B parameters) using Q4_K_M quantization. There are 94 total compatible models.
Can I run Llama with 6GB VRAM?
Yes! You can run 5 Llama models with 6GB VRAM: Llama 3.1 8B Instruct (Q5_K_M), Code Llama 7B (Q4_K_M), Llama 3.2 3B Instruct (Q8_0), Llama 3.2 1B Instruct (FP16), TinyLlama 1.1B (Q8_0).
What GPU has 6GB VRAM?
GPUs with 6GB or more VRAM include: NVIDIA RTX 3060 Laptop, NVIDIA GeForce GTX 1660, NVIDIA GeForce GTX 1660 SUPER, NVIDIA GeForce GTX 1060 6GB, NVIDIA GeForce GTX 1660 Ti, and 91 more.