Back to RunThisModel

AI Models You Can Run with 64GB VRAM

143 models compatible across 9 categories

Compatible Models

143

Largest Model

Magnum v4 72B (72B)

Categories

9

4GB 6GB 8GB 10GB 12GB 16GB 20GB 24GB 32GB 48GB 64GB 80GB 128GB 192GB

💬

Chat / LLM (72 models)

Magnum v4 72B

Anthracite

44.7GB VRAMQ4_K_M

Llama 3.1 70B Instruct

Meta

50.0GB VRAMQ5_K_M

Euryale L3.3 70B v2.3

Sao10K

40.1GB VRAMQ4_K_M

Llama 3.1 70B (lorablated)

mlabonne

40.1GB VRAMQ4_K_M

Mixtral 8x7B Instruct

Mistral AI

30.5GB VRAMQ5_K_M

Phi-3.5 MoE

Microsoft

24.1GB VRAMQ4_K_M

Qwen 2.5 32B

Alibaba

19.0GB VRAMQ4_K_M

Skyfall 31B v4.2

TheDrummer

62.5GB VRAMBF16

Qwen3 30B-A3B

Alibaba

36.0GB VRAMQ8_0

Gemma 3 27B

Google

15.9GB VRAMQ4_K_M

Dolphin Mistral 24B (Venice Edition)

Cognitive Computations

48.5GB VRAMBF16

Dolphin 3.0 R1 Mistral 24B

Cognitive Computations

48.5GB VRAMBF16

Cydonia 24B v4.3

TheDrummer

48.5GB VRAMBF16

Mistral Small 22B

Mistral AI

12.9GB VRAMQ4_K_M

Magnum v4 22B

Anthracite

44.5GB VRAMBF16

DeepSeek MoE 16B

DeepSeek

11.0GB VRAMQ4_K_M

Rocinante XL 16B v1

TheDrummer

32.5GB VRAMBF16

Phi-4

Microsoft

15.0GB VRAMQ8_0

Qwen 2.5 14B

Alibaba

15.1GB VRAMQ8_0

Gemma 3 12B

Google

12.2GB VRAMQ8_0

Mistral Nemo 12B

Mistral AI

12.6GB VRAMQ8_0

Magnum v4 12B

Anthracite

24.5GB VRAMBF16

Rocinante 12B v1.1

TheDrummer

24.5GB VRAMBF16

Mistral Nemo Base 12B

Mistral AI

24.5GB VRAMBF16

Solar 10.7B

Upstage

11.1GB VRAMQ8_0

Falcon 3 10B

TII

10.7GB VRAMQ8_0

Gemma 2 9B Instruct

Google

Yi 1.5 9B Chat

01.AI

Gemma 3 MoE 9B

Google

7.0GB VRAMQ4_K_M

DeepSeek R1 Distill 8B

DeepSeek

Llama 3.1 8B Instruct

Meta

17.0GB VRAMFP16

Granite 3.3 8B

IBM

Dolphin 3.0 Llama 3.1 8B

Cognitive Computations

16.5GB VRAMBF16

NeuralDaredevil 8B (abliterated)

mlabonne

16.5GB VRAMBF16

Llama 3.1 8B Instruct (abliterated)

mlabonne

16.5GB VRAMBF16

Stheno L3 8B v3.2

Sao10K

16.5GB VRAMBF16

Qwen3 8B Base

Alibaba

16.5GB VRAMBF16

EXAONE 3.5 7.8B

LG AI

InternLM 2.5 7B

Shanghai AI Lab

Qwen 2.5 7B Instruct

Alibaba

Mistral 7B Instruct v0.3

Mistral AI

15.5GB VRAMFP16

Falcon 3 7B

TII

OLMo 2 7B

Allen AI

OpenChat 3.5 7B

OpenChat

OLMoE 1B-7B

AI2

Yi 1.5 6B Chat

01.AI

Gemma 3 4B

Google

Nemotron Mini 4B

NVIDIA

Danube 3 4B

H2O.ai

Phi-3.5 Mini 3.8B

Microsoft

Phi-4 Mini 3.8B

Microsoft

Granite 3.0 3B-A800M

IBM

2.4GB VRAMQ4_K_M

Llama 3.2 3B Instruct

Meta

Qwen 2.5 3B

Alibaba

Falcon 3 3B

TII

StableLM Zephyr 3B

Stability AI

Rocket 3B

Pansophic

Gemma 2 2B

Google

EXAONE 3.5 2.4B

LG AI

Granite 3.3 2B

IBM

SmolLM2 1.7B

HuggingFace

Qwen 2.5 1.5B

Alibaba

DeepSeek R1 Distill 1.5B

DeepSeek

Granite 3.0 1B-A400M

IBM

1.3GB VRAMQ4_K_M

Llama 3.2 1B Instruct

Meta

TinyLlama 1.1B

TinyLlama

Gemma 3 1B

Google

Falcon 3 1B

TII

Qwen 2.5 0.5B

Alibaba

Danube 3 500M

H2O.ai

SmolLM2 360M

HuggingFace

SmolLM2 135M

HuggingFace

💻

Coding (17 models)

Codestral 22B (abliterated)

failspy

44.5GB VRAMBF16

Qwen 2.5 Coder 14B

Alibaba

15.1GB VRAMQ8_0

Code Llama 13B Instruct

Meta

7.8GB VRAMQ4_K_M

Yi Coder 9B

01.AI

CodeGemma 7B

Google

Qwen 2.5 Coder 7B

Alibaba

StarCoder2 7B

BigCode

Code Llama 7B

Meta

DeepSeek Coder 6.7B

DeepSeek

Qwen 2.5 Coder 3B

Alibaba

StarCoder2 3B

BigCode

Stable Code 3B

Stability AI

CodeGemma 2B

Google

Qwen 2.5 Coder 1.5B

Alibaba

Yi Coder 1.5B

01.AI

DeepSeek Coder 1.3B

DeepSeek

Qwen 2.5 Coder 0.5B

Alibaba

👁

Multimodal (6 models)

LLaVA 1.6 7B

LLaVA

Phi-3.5 Vision

Microsoft

3.2GB VRAMQ4_K_M

PaliGemma 3B

Google

2.5GB VRAMQ4_K_M

Qwen2-VL 2B

Alibaba

MiniCPM-V 2.6

OpenBMB

Moondream 2

Moondream

1.5GB VRAMQ4_K_M

🎨

Image Generation (9 models)

FLUX.1 Schnell (GGUF)

Black Forest Labs

14.0GB VRAMQ5_0

FLUX.1 Dev (GGUF)

Black Forest Labs

14.0GB VRAMQ5_0

Stable Diffusion XL (CoreML)

Stability AI

3.3GB VRAMCoreML

SDXL Turbo (GGUF)

Stability AI

Stable Diffusion 3 Medium (GGUF)

Stability AI

Stable Diffusion 2.1 Base (CoreML)

Stability AI / Apple

1.6GB VRAMCoreML-Palettized

Stable Diffusion 1.5 (CoreML)

Runway

2.5GB VRAMCoreML-Palettized

Stable Diffusion 1.5 (GGUF)

Runway / GPUStack

Stable Diffusion 2.1 (GGUF)

Stability AI

🎤

Speech Recognition (9 models)

Whisper Large v3

OpenAI

Whisper Large v3 Turbo

OpenAI

Whisper Medium

OpenAI

Distil-Whisper Large v3

HuggingFace

Whisper Small

OpenAI

Whisper Base

OpenAI

Whisper Base English

OpenAI

Whisper Tiny English (Quantized)

OpenAI

Whisper Tiny

OpenAI

🔊

Text to Speech (14 models)

Kokoro 82M TTS

Kokoro

0.6GB VRAMONNX-Q8F16

Piper TTS - Amy (English)

Rhasspy

Piper TTS - Lessac (English)

Rhasspy

Piper TTS - LibriTTS-R (English)

Rhasspy

Piper TTS - Spanish (MLS)

Rhasspy

Piper TTS - French (Siwis)

Rhasspy

Piper TTS - German (Thorsten)

Rhasspy

Piper TTS - Chinese (Huayan)

Rhasspy

Piper TTS - Japanese (Kokoro)

Rhasspy

Piper TTS - Korean

Rhasspy

Piper TTS - Russian (Irina)

Rhasspy

Piper TTS - Portuguese (Faber)

Rhasspy

Piper TTS - Italian (Riccardo)

Rhasspy

Piper TTS - Arabic (Kareem)

Rhasspy

🎵

Audio Generation (3 models)

ACE-Step 1.5XL

ACE Studio

Stable Audio Open

Stability AI

MusicGen Small

Meta

0.8GB VRAMONNX-Q4F16

🧩

Embedding (5 models)

BGE Large EN v1.5

BAAI

Nomic Embed Text v1.5

Nomic AI

BGE Small EN v1.5

BAAI

Snowflake Arctic Embed S

Snowflake

all-MiniLM-L6-v2

Sentence Transformers

🔄

Reranker (2 models)

BGE Reranker v2 M3

BAAI

Jina Reranker Tiny EN

Jina AI

Compatible GPUs (12 with 64GB+ VRAM)

Apple M1 Max

NVIDIA A100 80GB

80GB$15,000nvidia

NVIDIA H100

80GB$30,000nvidia

Apple M2 Max

Apple M4 Max

Apple M3 Max

Apple M1 Ultra

AMD Instinct MI250X

128GB$10,000amd

Apple M4 Ultra

Apple M3 Ultra

Apple M2 Ultra

AMD Instinct MI300X

192GB$15,000amd

Frequently Asked Questions

What is the best AI model I can run with 64GB VRAM?

The largest model you can run with 64GB VRAM is Magnum v4 72B (72B parameters) using Q4_K_M quantization. There are 143 total compatible models.

Can I run Llama with 64GB VRAM?

Yes! You can run 10 Llama models with 64GB VRAM: Llama 3.1 70B Instruct (Q5_K_M), Llama 3.1 70B (lorablated) (Q4_K_M), Code Llama 13B Instruct (Q4_K_M), Llama 3.1 8B Instruct (FP16), Dolphin 3.0 Llama 3.1 8B (BF16), Llama 3.1 8B Instruct (abliterated) (BF16), Code Llama 7B (Q8_0), Llama 3.2 3B Instruct (Q8_0), Llama 3.2 1B Instruct (FP16), TinyLlama 1.1B (Q8_0).

What GPU has 64GB VRAM?

GPUs with 64GB or more VRAM include: Apple M1 Max, NVIDIA A100 80GB, NVIDIA H100, Apple M2 Max, Apple M4 Max, and 7 more.

Other VRAM Tiers

4GB VRAM 6GB VRAM 8GB VRAM 10GB VRAM 12GB VRAM 16GB VRAM 20GB VRAM 24GB VRAM 32GB VRAM 48GB VRAM 80GB VRAM 128GB VRAM 192GB VRAM

© runthismodel · 2026privacy terms disclaimer changelog embed badge runpod vast.ai huggingface ollama lm-studiomade for the people who actually read GGUF metadata

 ┌─┐                ╔══╗     ╔══╗
 │░│  RUN  THIS  M  ║▓▓║ DEL ║▓▓║
 └─┘                ╚══╝     ╚══╝