Can RTX 4060 Ti run Mistral Small 22B?

Yes — runs locally

~0 tok/sec · Cannot run — insufficient VRAM

Your VRAM

8 GB

Model size

22B

Best quant

Q4_K_M

VRAM needed

12.9 GB

The verdict

The RTX 4060 Ti (8 GB VRAM) handles Mistral Small 22B comfortably using the Q4_K_M quantization, which fits in 12.9 GB. Expected throughput is around 0 tokens/second, which feels Cannot run — insufficient VRAM in interactive use. 22B parameter model. Strong reasoning and multilingual. Needs 16GB+ RAM.

How to run it

1. Install Ollama or LM Studio.
2. Pull the Q4_K_M GGUF — best balance of quality and speed on 8 GB.
3. Start chatting. Expect ~0 tok/sec on first-token, faster after warmup.

See full Mistral Small 22B setup →

Other models that run great on RTX 4060 Ti

FAQ (20)

What GPU do I need to run Mistral Small 22B?

To run Mistral Small 22B, you need a GPU with at least 12.9 GB of VRAM, such as an NVIDIA RTX 3080 or higher.

Is Mistral Small 22B good for coding?

Mistral Small 22B is well-suited for coding tasks due to its strong reasoning capabilities and multilingual support, making it effective for code generation and documentation.

Mistral Small 22B vs Llama 3.1 8B?

Mistral Small 22B has more parameters (22B vs 8B), offering better performance in complex reasoning and multilingual tasks, but requires more VRAM and RAM.

Can I run Mistral Small 22B on a Mac?

Yes, you can run Mistral Small 22B on a Mac with a compatible GPU and sufficient VRAM, but ensure your system meets the minimum requirements of 16GB+ RAM and 12.9 GB VRAM.

How much VRAM does Mistral Small 22B need?

Mistral Small 22B requires at least 12.9 GB of VRAM, depending on the quantization level used.

Is Mistral Small 22B censored?

Mistral Small 22B is not inherently censored, but it may include content filters to prevent harmful or inappropriate output.

Is Mistral Small 22B commercial-use allowed?

Yes, Mistral Small 22B is licensed under Apache-2.0, allowing commercial use as long as you comply with the license terms.

Mistral Small 22B context length?

Mistral Small 22B supports a context length of up to 32,768 tokens, which is significantly longer than many other models.

Want personalized recommendations for your exact setup? Detect my hardware →