Back to RunThisModel

AI Models You Can Run with 10GB VRAM

103 models compatible across 9 categories

Compatible Models
103
Largest Model
Phi-4 (14B)
Categories
9
💬

Chat / LLM (43 models)

Phi-4

Microsoft

14B
8.9GB VRAMQ4_K_M
85%

Qwen 2.5 14B

Alibaba

14B
8.9GB VRAMQ4_K_M
85%

Gemma 3 12B

Google

12B
7.3GB VRAMQ4_K_M
85%

Mistral Nemo 12B

Mistral AI

12B
7.5GB VRAMQ4_K_M
85%

Solar 10.7B

Upstage

10.7B
6.5GB VRAMQ4_K_M
85%

Falcon 3 10B

TII

10B
6.4GB VRAMQ4_K_M
85%

Gemma 2 9B Instruct

Google

9.2B
9.7GB VRAMQ8_0
98%

Yi 1.5 9B Chat

01.AI

9B
9.2GB VRAMQ8_0
98%

DeepSeek R1 Distill 8B

DeepSeek

8B
8.4GB VRAMQ8_0
98%

Llama 3.1 8B Instruct

Meta

8B
8.4GB VRAMQ8_0
98%

Granite 3.3 8B

IBM

8B
8.6GB VRAMQ8_0
98%

EXAONE 3.5 7.8B

LG AI

7.8B
8.2GB VRAMQ8_0
98%

InternLM 2.5 7B

Shanghai AI Lab

7.7B
8.2GB VRAMQ8_0
98%

Qwen 2.5 7B Instruct

Alibaba

7.6B
9.0GB VRAMQ8_0
98%

Mistral 7B Instruct v0.3

Mistral AI

7.3B
7.7GB VRAMQ8_0
98%

Falcon 3 7B

TII

7B
8.3GB VRAMQ8_0
98%

OLMo 2 7B

Allen AI

7B
7.7GB VRAMQ8_0
98%

OpenChat 3.5 7B

OpenChat

7B
7.7GB VRAMQ8_0
98%

Yi 1.5 6B Chat

01.AI

6B
6.5GB VRAMQ8_0
98%

Gemma 3 4B

Google

4B
4.3GB VRAMQ8_0
98%

Nemotron Mini 4B

NVIDIA

4B
4.7GB VRAMQ8_0
98%

Danube 3 4B

H2O.ai

4B
4.4GB VRAMQ8_0
98%

Phi-3.5 Mini 3.8B

Microsoft

3.8B
4.3GB VRAMQ8_0
98%

Phi-4 Mini 3.8B

Microsoft

3.8B
4.3GB VRAMQ8_0
98%

Llama 3.2 3B Instruct

Meta

3.2B
3.7GB VRAMQ8_0
98%

Qwen 2.5 3B

Alibaba

3B
3.9GB VRAMQ8_0
98%

Falcon 3 3B

TII

3B
3.8GB VRAMQ8_0
98%

StableLM Zephyr 3B

Stability AI

3B
3.3GB VRAMQ8_0
98%

Rocket 3B

Pansophic

3B
3.3GB VRAMQ8_0
98%

Gemma 2 2B

Google

2.6B
3.1GB VRAMQ8_0
98%

EXAONE 3.5 2.4B

LG AI

2.4B
3.1GB VRAMQ8_0
98%

Granite 3.3 2B

IBM

2B
3.0GB VRAMQ8_0
98%

SmolLM2 1.7B

HuggingFace

1.7B
2.2GB VRAMQ8_0
98%

Qwen 2.5 1.5B

Alibaba

1.5B
2.3GB VRAMQ8_0
98%

DeepSeek R1 Distill 1.5B

DeepSeek

1.5B
2.3GB VRAMQ8_0
98%

Llama 3.2 1B Instruct

Meta

1.24B
2.8GB VRAMFP16
100%

TinyLlama 1.1B

TinyLlama

1.1B
1.6GB VRAMQ8_0
98%

Gemma 3 1B

Google

1B
1.5GB VRAMQ8_0
98%

Falcon 3 1B

TII

1B
2.2GB VRAMQ8_0
98%

Qwen 2.5 0.5B

Alibaba

0.5B
1.1GB VRAMQ8_0
98%

Danube 3 500M

H2O.ai

0.5B
1.0GB VRAMQ8_0
98%

SmolLM2 360M

HuggingFace

0.36B
0.9GB VRAMQ8_0
98%

SmolLM2 135M

HuggingFace

0.135B
0.8GB VRAMFP16
100%
💻

Coding (16 models)

👁

Multimodal (6 models)

🎨

Image Generation (7 models)

🎤

Speech Recognition (9 models)

🔊

Text to Speech (14 models)

🎵

Audio Generation (1 model)

🧩

Embedding (5 models)

🔄

Reranker (2 models)

Compatible GPUs (66 with 10GB+ VRAM)

Intel Arc B570

10GB$219intel

NVIDIA GeForce RTX 3080

10GB$699nvidia

NVIDIA GeForce GTX 1080 Ti

11GB$699nvidia

NVIDIA GeForce RTX 2080 Ti

11GB$1,199nvidia

NVIDIA RTX 4080 Laptop

12GBnvidia

Intel Arc B580

12GB$249intel

NVIDIA GeForce RTX 3060 12GB

12GB$329nvidia

AMD Radeon RX 7700 XT

12GB$449amd

AMD Radeon RX 6700 XT

12GB$479amd

NVIDIA GeForce RTX 5070

12GB$549nvidia

NVIDIA GeForce RTX 4070 SUPER

12GB$599nvidia

NVIDIA GeForce RTX 4070

12GB$599nvidia

NVIDIA GeForce RTX 4070 Ti

12GB$799nvidia

NVIDIA GeForce RTX 3080 Ti

12GB$1,199nvidia

NVIDIA RTX 4090 Laptop

16GBnvidia

NVIDIA RTX 3080 Laptop

16GBnvidia

Apple M1

16GBapple

AMD Radeon RX 7600 XT

16GB$329amd

Intel Arc A770

16GB$349intel

NVIDIA GeForce RTX 5060 Ti

16GB$449nvidia

AMD Radeon RX 9070

16GB$449amd

NVIDIA GeForce RTX 4060 Ti 16GB

16GB$499nvidia

AMD Radeon RX 7800 XT

16GB$499amd

AMD Radeon RX 9070 XT

16GB$549amd

AMD Radeon RX 7900 GRE

16GB$549amd

AMD Radeon RX 6800

16GB$579amd

AMD Radeon RX 6800 XT

16GB$649amd

NVIDIA GeForce RTX 5070 Ti

16GB$749nvidia

NVIDIA GeForce RTX 4070 Ti SUPER

16GB$799nvidia

NVIDIA GeForce RTX 5080

16GB$999nvidia

NVIDIA GeForce RTX 4080 SUPER

16GB$999nvidia

AMD Radeon RX 6900 XT

16GB$999amd

NVIDIA RTX A4000

16GB$1,000nvidia

AMD Radeon RX 6950 XT

16GB$1,099amd

NVIDIA GeForce RTX 4080

16GB$1,199nvidia

NVIDIA T4

16GB$2,200nvidia

AMD Radeon RX 7900 XT

20GB$899amd

Apple M3

24GBapple

Apple M2

24GBapple

AMD Radeon RX 7900 XTX

24GB$999amd

NVIDIA GeForce RTX 3090

24GB$1,499nvidia

NVIDIA GeForce RTX 4090

24GB$1,599nvidia

NVIDIA GeForce RTX 3090 Ti

24GB$1,999nvidia

NVIDIA RTX A5000

24GB$2,250nvidia

NVIDIA L4

24GB$2,500nvidia

Apple M4

32GBapple

Apple M2 Pro

32GBapple

Apple M1 Pro

32GBapple

NVIDIA GeForce RTX 5090

32GB$1,999nvidia

Apple M3 Pro

36GBapple

NVIDIA A100 40GB

40GB$10,000nvidia

Apple M4 Pro

48GBapple

NVIDIA RTX A6000

48GB$4,650nvidia

NVIDIA L40S

48GB$7,500nvidia

Apple M1 Max

64GBapple

NVIDIA A100 80GB

80GB$15,000nvidia

NVIDIA H100

80GB$30,000nvidia

Apple M2 Max

96GBapple

Apple M4 Max

128GBapple

Apple M3 Max

128GBapple

Apple M1 Ultra

128GBapple

AMD Instinct MI250X

128GB$10,000amd

Apple M4 Ultra

192GBapple

Apple M3 Ultra

192GBapple

Apple M2 Ultra

192GBapple

AMD Instinct MI300X

192GB$15,000amd

Frequently Asked Questions

What is the best AI model I can run with 10GB VRAM?
The largest model you can run with 10GB VRAM is Phi-4 (14B parameters) using Q4_K_M quantization. There are 103 total compatible models.
Can I run Llama with 10GB VRAM?
Yes! You can run 6 Llama models with 10GB VRAM: Code Llama 13B Instruct (Q4_K_M), Llama 3.1 8B Instruct (Q8_0), Code Llama 7B (Q8_0), Llama 3.2 3B Instruct (Q8_0), Llama 3.2 1B Instruct (FP16), TinyLlama 1.1B (Q8_0).
What GPU has 10GB VRAM?
GPUs with 10GB or more VRAM include: Intel Arc B570, NVIDIA GeForce RTX 3080, NVIDIA GeForce GTX 1080 Ti, NVIDIA GeForce RTX 2080 Ti, NVIDIA RTX 4080 Laptop, and 61 more.

Other VRAM Tiers