Browse AI Models — Hardware Requirements & Compatibility

./models·full catalog · 145 modelssidebar drives compatibility math · pick a GPU or use detected

browse

every curated GGUF in the index

sort by grade · vram · fit · throughput · downloads · params · arch

runnable135

marginal2

no-go8

./models·compatibility matrix · 145/145sorted by fit desc

model	params	vram	fit ↓	tok/s	weights	dl/mo	ctx	arch
Whisper Tiny English (Quantized)sttfeatured OpenAI · mit	0.039B	0.1GB	██████████████████98%	10080	32.2 MB	72.9k	—	whisper
all-MiniLM-L6-v2embed Sentence Transformers · apache-2.0	0.023B	0.1GB	██████████████████98%	10080	23.0 MB	224.5M	256	bert
BGE Small EN v1.5embed BAAI · mit	0.033B	0.1GB	██████████████████98%	10080	36.8 MB	53M	512	bert
Snowflake Arctic Embed Sembed Snowflake · apache-2.0	0.033B	0.1GB	██████████████████98%	10080	36.0 MB	40.2k	512	bert
Piper TTS - Amy (English)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.1 MB	0	—	piper
Piper TTS - Lessac (English)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.2 MB	0	—	piper
Piper TTS - Spanish (MLS)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.1 MB	0	—	piper
Piper TTS - German (Thorsten)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.1 MB	0	—	piper
Piper TTS - Chinese (Huayan)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.2 MB	0	—	piper
Piper TTS - Japanese (Kokoro)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.0 MB	0	—	piper
Piper TTS - Koreantts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.0 MB	0	—	piper
Piper TTS - Russian (Irina)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.2 MB	0	—	piper
Piper TTS - Portuguese (Faber)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.2 MB	0	—	piper
Piper TTS - Arabic (Kareem)tts Rhasspy · mit	0.02B	0.1GB	██████████████████98%	10080	63.2 MB	0	—	piper
Jina Reranker Tiny EN Jina AI · apache-2.0	0.033B	0.1GB	██████████████████98%	10080	67.5 MB	8k	8.2k	bert
Whisper Tinystt OpenAI · mit	0.039B	0.2GB	██████████████████98%	10080	77.7 MB	1.5M	—	whisper
Whisper Basestt OpenAI · mit	0.074B	0.3GB	██████████████████98%	10080	148.0 MB	4.6M	—	whisper
Whisper Base Englishstt OpenAI · mit	0.074B	0.3GB	██████████████████98%	10080	148.0 MB	23.8k	—	whisper
Nomic Embed Text v1.5embedfeatured Nomic AI · apache-2.0	0.137B	0.3GB	██████████████████98%	10080	146.1 MB	17.9M	8.2k	nomic-bert
Piper TTS - French (Siwis)tts Rhasspy · mit	0.02B	0.5GB	██████████████████97%	10080	28.1 MB	0	—	piper
Piper TTS - Italian (Riccardo)tts Rhasspy · mit	0.02B	0.5GB	██████████████████97%	10080	28.1 MB	0	—	piper
Piper TTS - LibriTTS-R (English)tts Rhasspy · mit	0.02B	0.6GB	██████████████████97%	10080	78.6 MB	0	—	piper
Kokoro 82M TTSttsfeatured Kokoro · apache-2.0	0.082B	0.6GB	██████████████████97%	10080	86.0 MB	517.4k	—	kokoro
SmolLM2 135M HuggingFace · apache-2.0	0.135B	0.6GB	██████████████████97%	10080	144.8 MB	1.7M	8.2k	smollm
SmolLM2 360M HuggingFace · apache-2.0	0.36B	0.8GB	██████████████████97%	5000	270.6 MB	283.9k	8.2k	smollm
MusicGen Smalltts Meta · cc-by-nc-4.0	0.3B	0.8GB	█████████████████░97%	6000	302.4 MB	197.6k	—	musicgen
Danube 3 500M H2O.ai · apache-2.0	0.5B	0.8GB	█████████████████░97%	3600	317.9 MB	31.1k	8.2k	danube
BGE Large EN v1.5embed BAAI · mit	0.335B	0.8GB	█████████████████░97%	5373	358.2 MB	13.8M	512	bert
Whisper Smallstt OpenAI · mit	0.24B	0.9GB	█████████████████░97%	7500	487.6 MB	2.4M	—	whisper
Qwen 2.5 0.5B Alibaba · apache-2.0	0.5B	1.0GB	█████████████████░97%	3600	491.4 MB	4.2M	32.8k	qwen2
TinyLlama 1.1B TinyLlama · apache-2.0	1.1B	1.1GB	█████████████████░97%	1636	668.8 MB	2M	2k	llama
Qwen 2.5 Coder 0.5Bcode Alibaba · apache-2.0	0.5B	1.1GB	█████████████████░97%	3600	675.7 MB	99.1k	32.8k	qwen2
Llama 3.2 1B Instructfeatured Meta · llama3.2	1.24B	1.3GB	█████████████████░97%	1452	807.7 MB	7.4M	131.1k	llama
Gemma 3 1Bfeatured Google · gemma	1B	1.3GB	█████████████████░97%	1800	806.1 MB	1.8M	32.8k	gemma3
Granite 3.0 1B-A400Mmoe·0.4b act IBM · apache-2.0	1.3B	1.3GB	█████████████████░97%	4500	821.8 MB	878	4.1k	granitemoe
DeepSeek Coder 1.3Bcode DeepSeek · mit	1.3B	1.3GB	█████████████████░97%	1385	873.6 MB	43.3k	16.4k	llama
Yi Coder 1.5Bcode 01.AI · apache-2.0	1.5B	1.4GB	█████████████████░97%	1200	963.7 MB	5.1k	4.1k	yi
Qwen2-VL 2Bvlm Alibaba · apache-2.0	2.2B	1.4GB	█████████████████░97%	818	986.0 MB	3.7M	32.8k	qwen2-vl
SmolLM2 1.7B HuggingFace · apache-2.0	1.7B	1.5GB	█████████████████░97%	1059	1.06 GB	163.4k	8.2k	smollm
Falcon 3 1B TII · apache-2.0	1B	1.5GB	█████████████████░97%	1800	1.06 GB	9.9k	8.2k	falcon
Moondream 2vlmfeatured Moondream · apache-2.0	1.8B	1.5GB	█████████████████░97%	1000	1.00 GB	1.9M	2k	moondream
Qwen 2.5 1.5B Alibaba · apache-2.0	1.5B	1.5GB	█████████████████░96%	1200	1.12 GB	10.7M	32.8k	qwen2
DeepSeek R1 Distill 1.5B DeepSeek · mit	1.5B	1.5GB	█████████████████░96%	1200	1.12 GB	681.8k	131.1k	qwen2
Qwen 2.5 Coder 1.5Bcode Alibaba · apache-2.0	1.5B	1.5GB	█████████████████░96%	1200	1.12 GB	748.8k	32.8k	qwen2
Stable Diffusion 2.1 Base (CoreML)imagefeatured Stability AI / Apple · creativeml-openrail-m	0.86B	1.6GB	█████████████████░96%	2093	1.14 GB	40	—	unet-diffusion
BGE Reranker v2 M3 BAAI · mit	0.568B	1.6GB	█████████████████░96%	3169	1.16 GB	14.1M	8.2k	xlm-roberta
Distil-Whisper Large v3sttfeatured HuggingFace · mit	0.76B	1.9GB	█████████████████░96%	2368	1.52 GB	869.8k	—	whisper
Whisper Mediumsttfeatured OpenAI · mit	0.77B	1.9GB	█████████████████░96%	2338	1.53 GB	475.7k	—	whisper
Granite 3.3 2B IBM · apache-2.0	2B	1.9GB	█████████████████░96%	900	1.55 GB	21.9k	8.2k	granite
Whisper Large v3 Turbosttfeatured OpenAI · mit	0.81B	2.0GB	█████████████████░96%	2222	1.62 GB	7.7M	—	whisper
CodeGemma 2Bcode Google · gemma	2B	2.0GB	█████████████████░96%	900	1.63 GB	31k	8.2k	gemma
EXAONE 3.5 2.4B LG AI · other	2.4B	2.0GB	█████████████████░96%	750	1.64 GB	63.8k	32.8k	exaone
Gemma 2 2B Google · gemma	2.6B	2.1GB	█████████████████░96%	692	1.71 GB	315.4k	8.2k	gemma2
StableLM Zephyr 3B Stability AI · other	3B	2.1GB	█████████████████░96%	600	1.71 GB	28.4k	4.1k	stablelm
Rocket 3B Pansophic · other	3B	2.1GB	█████████████████░96%	600	1.71 GB	420	4.1k	stablelm
Stable Code 3Bcode Stability AI · other	3B	2.1GB	█████████████████░96%	600	1.71 GB	2.2k	16.4k	stablelm
MiniCPM-V 2.6vlm OpenBMB · apache-2.0	2B	2.1GB	█████████████████░96%	900	1.60 GB	151.6k	2k	minicpm-v
Stable Diffusion 1.5 (GGUF)imagefeatured Runway / GPUStack · creativeml-openrail-m	0.86B	2.1GB	█████████████████░96%	2093	1.75 GB	1.3k	—	unet-diffusion
StarCoder2 3Bcode BigCode · bigcode-openrail-m	3B	2.3GB	█████████████████░96%	600	1.89 GB	123k	16.4k	starcoder
Falcon 3 3B TII · apache-2.0	3B	2.4GB	█████████████████░96%	600	2.01 GB	6.2k	8.2k	falcon
Llama 3.2 3B Instructfeatured Meta · llama3.2	3.2B	2.4GB	█████████████████░96%	562	2.02 GB	1.4M	131.1k	llama
Granite 3.0 3B-A800Mmoe·0.8b act IBM · apache-2.0	3.4B	2.4GB	█████████████████░96%	2250	2.06 GB	3.4k	4.1k	granitemoe
Qwen 2.5 3B Alibaba · apache-2.0	3B	2.5GB	█████████████████░96%	600	2.10 GB	12.7M	32.8k	qwen2
Qwen 2.5 Coder 3Bcode Alibaba · apache-2.0	3B	2.5GB	█████████████████░96%	600	2.10 GB	229.1k	32.8k	qwen2
Stable Diffusion 1.5 (CoreML)image Runway · creativeml-openrail-m	0.86B	2.5GB	█████████████████░96%	2093	1.57 GB	1.6M	—	unet-diffusion
PaliGemma 3Bvlm Google · gemma	3B	2.5GB	█████████████████░96%	600	2.00 GB	198.8k	256	paligemma
Stable Diffusion 2.1 (GGUF)image Stability AI · creativeml-openrail-m	0.86B	2.7GB	█████████████████░95%	2093	2.32 GB	—	—	unet-diffusion
Phi-3.5 Mini 3.8Bfeatured Microsoft · mit	3.8B	2.7GB	█████████████████░95%	474	2.39 GB	901.4k	131.1k	phi3
Danube 3 4B H2O.ai · apache-2.0	4B	2.7GB	█████████████████░95%	450	2.39 GB	429	8.2k	danube
Gemma 3 4Bfeatured Google · gemma	4B	2.8GB	█████████████████░95%	450	2.49 GB	1.5M	32.8k	gemma3
Phi-4 Mini 3.8Bfeatured Microsoft · mit	3.8B	2.8GB	█████████████████░95%	474	2.49 GB	1.1M	131.1k	phi4
Nemotron Mini 4B NVIDIA · other	4B	3.0GB	█████████████████░95%	450	2.70 GB	421k	8.2k	nemotron
Phi-3.5 Visionvlmfeatured Microsoft · mit	4.2B	3.2GB	█████████████████░95%	429	2.50 GB	2M	131.1k	phi3v
Stable Diffusion XL (CoreML)imagefeatured Stability AI · creativeml-openrail-m	3.5B	3.3GB	█████████████████░95%	514	3.05 GB	1.4M	—	unet-diffusion
Whisper Large v3sttfeatured OpenAI · mit	1.55B	3.4GB	█████████████████░95%	1161	3.10 GB	5.1M	—	whisper
Yi 1.5 6B Chat 01.AI · apache-2.0	6B	3.9GB	█████████████████░94%	300	3.67 GB	5.9k	4.1k	yi
DeepSeek Coder 6.7Bcode DeepSeek · mit	6.7B	4.3GB	█████████████████░94%	269	4.08 GB	143.7k	16.4k	llama
Code Llama 7Bcode Meta · llama2	7B	4.3GB	█████████████████░94%	257	4.08 GB	244.6k	16.4k	llama
OLMoE 1B-7Bfeaturedmoe·1.3b act AI2 · apache-2.0	6.9B	4.4GB	█████████████████░94%	1385	4.21 GB	37k	4.1k	olmoe
Mistral 7B Instruct v0.3featured Mistral AI · apache-2.0	7.3B	4.6GB	█████████████████░93%	247	4.37 GB	3.1M	32.8k	mistral
OpenChat 3.5 7B OpenChat · apache-2.0	7B	4.6GB	█████████████████░93%	257	4.37 GB	4.9k	8.2k	mistral
StarCoder2 7Bcode BigCode · bigcode-openrail-m	7B	4.7GB	█████████████████░93%	257	4.46 GB	12.3k	16.4k	starcoder
OLMo 2 7B Allen AI · apache-2.0	7B	4.7GB	█████████████████░93%	257	4.47 GB	49.4k	4.1k	olmo
Qwen 2.5 Coder 7Bcodefeatured Alibaba · apache-2.0	7.6B	4.9GB	█████████████████░93%	237	4.68 GB	2.1M	32.8k	qwen2
InternLM 2.5 7B Shanghai AI Lab · apache-2.0	7.7B	4.9GB	█████████████████░93%	234	4.71 GB	109.7k	32.8k	internlm2
EXAONE 3.5 7.8B LG AI · other	7.8B	4.9GB	█████████████████░93%	231	4.77 GB	139.5k	32.8k	exaone
LLaVA 1.6 7Bvlmfeatured LLaVA · apache-2.0	7B	5.0GB	█████████████████░93%	257	4.40 GB	705.7k	4.1k	llava
Falcon 3 7B TII · apache-2.0	7B	5.0GB	█████████████████░93%	257	4.40 GB	9.7k	8.2k	falcon
SDXL Turbo (GGUF)imagefeatured Stability AI · stability-community	3.5B	5.0GB	█████████████████░93%	514	3.50 GB	783.7k	—	unet-diffusion
DeepSeek R1 Distill 8Bfeatured DeepSeek · mit	8B	5.1GB	█████████████████░93%	225	4.92 GB	439k	131.1k	llama
Llama 3.1 8B Instructfeatured Meta · llama3.1	8B	5.1GB	█████████████████░93%	225	4.92 GB	9.9M	131.1k	llama
Dolphin 3.0 Llama 3.1 8B Cognitive Computations · llama3.1	8B	5.1GB	█████████████████░93%	225	4.92 GB	360.8k	131.1k	llama
NeuralDaredevil 8B (abliterated) mlabonne · llama3	8B	5.1GB	█████████████████░93%	225	4.92 GB	13.5k	8.2k	llama
Llama 3.1 8B Instruct (abliterated) mlabonne · llama3.1	8B	5.1GB	█████████████████░93%	225	4.92 GB	4.4k	131.1k	llama
Stheno L3 8B v3.2 Sao10K · cc-by-nc-4.0	8B	5.1GB	█████████████████░93%	225	4.92 GB	13.4k	8.2k	llama
Granite 3.3 8B IBM · apache-2.0	8B	5.1GB	█████████████████░93%	225	4.94 GB	62.1k	8.2k	granite
Qwen 2.5 7B Instructfeatured Alibaba · apache-2.0	7.6B	5.3GB	█████████████████░93%	237	4.70 GB	11.9M	131.1k	qwen2
Qwen3 8B Basefeatured Alibaba · apache-2.0	8B	5.3GB	█████████████████░93%	225	4.80 GB	453.7k	32.8k	qwen3
CodeGemma 7Bcode Google · gemma	8.5B	5.5GB	█████████████████░93%	212	5.33 GB	2.6k	8.2k	gemma
Yi 1.5 9B Chat 01.AI · apache-2.0	9B	5.5GB	█████████████████░93%	200	5.33 GB	18.2k	4.1k	yi
Yi Coder 9Bcode 01.AI · apache-2.0	9B	5.5GB	█████████████████░93%	200	5.33 GB	8.9k	4.1k	yi
Gemma 2 9B Instructfeatured Google · gemma	9.2B	5.9GB	█████████████████░92%	196	5.76 GB	391k	8.2k	gemma2
Stable Audio Opentts Stability AI · stability-community	1B	6.0GB	█████████████████░92%	1800	2.50 GB	43k	—	stable-audio
Falcon 3 10B TII · apache-2.0	10B	6.4GB	████████████████░░92%	180	6.29 GB	4.4k	8.2k	falcon
Solar 10.7B Upstage · apache-2.0	10.7B	6.5GB	████████████████░░91%	168	6.46 GB	52.1k	4.1k	llama
Gemma 3 MoE 9Bmoe·2.5b act Google · gemma	9B	7.0GB	████████████████░░91%	720	5.50 GB	—	8.2k	gemma3-moe
Gemma 3 12Bfeatured Google · gemma	12B	7.3GB	████████████████░░91%	150	7.30 GB	2.6M	32.8k	gemma3
Mistral Nemo 12Bfeatured Mistral AI · apache-2.0	12B	7.5GB	████████████████░░91%	150	7.48 GB	451.4k	131.1k	mistral
Magnum v4 12B Anthracite · apache-2.0	12B	7.5GB	████████████████░░91%	150	7.48 GB	686	131.1k	mistral
Rocinante 12B v1.1 TheDrummer · other	12B	7.5GB	████████████████░░91%	150	7.48 GB	811	131.1k	mistral
Mistral Nemo Base 12Bfeatured Mistral AI · apache-2.0	12B	7.7GB	████████████████░░90%	150	7.20 GB	29.7k	131.1k	mistral
Code Llama 13B Instructcode Meta · llama2	13B	7.8GB	████████████████░░90%	138	7.87 GB	2.7k	16.4k	llama
ACE-Step 1.5XLttsfeatured ACE Studio · apache-2.0	1.5B	8.0GB	████████████████░░90%	1200	3.00 GB	—	—	acestep
Qwen 2.5 14Bfeatured Alibaba · apache-2.0	14B	8.9GB	████████████████░░89%	129	8.99 GB	1.9M	131.1k	qwen2
Qwen 2.5 Coder 14Bcodefeatured Alibaba · apache-2.0	14B	8.9GB	████████████████░░89%	129	8.99 GB	3M	32.8k	qwen2
Phi-4featured Microsoft · mit	14B	8.9GB	████████████████░░89%	129	9.05 GB	814.3k	16.4k	phi3
Stable Diffusion 3 Medium (GGUF)imagefeatured Stability AI · stability-community	2.5B	9.2GB	████████████████░░89%	720	9.29 GB	3.1k	—	mmdit-diffusion
Rocinante XL 16B v1 TheDrummer · other	16B	9.6GB	████████████████░░88%	112	9.75 GB	68	131.1k	mistral
DeepSeek MoE 16Bmoe·2.8b act DeepSeek · other	16.4B	11.0GB	████████████████░░87%	643	9.50 GB	14.3k	4.1k	deepseek-moe
TRELLIS Image Largeimagefeatured Microsoft · mit	1.2B	12.0GB	███████████████░░░86%	1500	2.40 GB	1.2M	—	trellis
Mistral Small 22B Mistral AI · apache-2.0	22B	12.9GB	███████████████░░░83%	82	13.34 GB	127.5k	32.8k	mistral
Codestral 22B (abliterated)code failspy · other	22B	12.9GB	███████████████░░░83%	82	13.34 GB	7.5k	32.8k	mistral
Magnum v4 22B Anthracite · other	22B	12.9GB	███████████████░░░83%	82	13.34 GB	248	32.8k	mistral
Dolphin 3.0 R1 Mistral 24Bfeatured Cognitive Computations · apache-2.0	24B	13.8GB	██████████████░░░░80%	75	14.33 GB	686	131.1k	mistral
Cydonia 24B v4.3 TheDrummer · other	24B	13.8GB	██████████████░░░░80%	75	14.33 GB	6k	32.8k	mistral
FLUX.1 Schnell (GGUF)imagefeatured Black Forest Labs · apache-2.0	12B	14.0GB	██████████████░░░░79%	150	12.00 GB	301.4k	—	rectified-flow
FLUX.1 Dev (GGUF)image Black Forest Labs · flux-1-dev-non-commercial	12B	14.0GB	██████████████░░░░79%	150	12.00 GB	1.1M	—	rectified-flow
Dolphin Mistral 24B (Venice Edition)featured Cognitive Computations · apache-2.0	24B	14.9GB	██████████████░░░░76%	75	14.40 GB	7.8k	32.8k	mistral
Gemma 3 27Bfeatured Google · gemma	27B	15.9GB	█████████████░░░░░73%	67	16.55 GB	1.4M	32.8k	gemma3
Wan 2.2 TI2V 5Bimagefeatured Wan-AI · apache-2.0	5B	16.0GB	█████████████░░░░░73%	360	10.00 GB	8.3k	—	wan-dit
CogVideoX 5Bimage THUDM · apache-2.0	5B	16.0GB	█████████████░░░░░73%	360	10.00 GB	16.8k	—	cogvideox
Hunyuan3D 2image Tencent · tencent-hunyuan	2.5B	16.0GB	█████████████░░░░░73%	720	5.00 GB	76.1k	—	hunyuan3d
Skyfall 31B v4.2 TheDrummer · other	31B	18.2GB	████████████░░░░░░65%	58	18.98 GB	1k	131.1k	mistral
Qwen 2.5 32B Alibaba · apache-2.0	32B	19.0GB	███████████░░░░░░░62%	56	19.85 GB	1M	131.1k	qwen2
Qwen3 30B-A3Bfeaturedmoe·3.3b act Alibaba · apache-2.0	30.5B	20.0GB	██████████░░░░░░░░58%	545	18.00 GB	—	32.8k	qwen3-moe
Phi-3.5 MoEfeaturedmoe·6.6b act Microsoft · mit	41.9B	24.1GB	███████░░░░░░░░░░░38%	272	25.35 GB	123.9k	131.1k	phimoe
Mixtral 8x7B Instructfeaturedmoe·12.9b act Mistral AI · apache-2.0	46.7B	25.1GB	██████░░░░░░░░░░░░32%	134	26.44 GB	806.7k	32.8k	mixtral
Mochi 1 Previewimage Genmo · apache-2.0	10B	30.0GB	████░░░░░░░░░░░░░░21%	offload	20.00 GB	3.5k	—	asymdit
Llama 3.1 70B Instructfeatured Meta · llama3.1	70B	40.1GB	██░░░░░░░░░░░░░░░░10%	offload	42.52 GB	630.4k	131.1k	llama
Euryale L3.3 70B v2.3featured Sao10K · llama3	70B	40.1GB	██░░░░░░░░░░░░░░░░10%	offload	42.52 GB	1.4k	131.1k	llama
Llama 3.1 70B (lorablated) mlabonne · llama3.1	70B	40.1GB	██░░░░░░░░░░░░░░░░10%	offload	42.52 GB	57	131.1k	llama
Magnum v4 72B Anthracite · apache-2.0	72B	44.7GB	██░░░░░░░░░░░░░░░░9%	offload	47.42 GB	764	131.1k	qwen2
HunyuanVideo 13Bimagefeatured Tencent · tencent-hunyuan	13B	60.0GB	█░░░░░░░░░░░░░░░░░5%	offload	26.00 GB	924	—	hunyuan-dit
Qwen3 235B-A22Bmoe·22b act Alibaba · apache-2.0	235B	144.0GB	░░░░░░░░░░░░░░░░░░2%	offload	140.00 GB	—	32.8k	qwen3-moe
Mixtral 8x22B Instructmoe·39.1b act Mistral AI · apache-2.0	141B	88.0GB	░░░░░░░░░░░░░░░░░░2%	offload	85.00 GB	32.6k	65.5k	mixtral