Can RTX 4070 Ti SUPER run Codestral 22B (abliterated)?
Yes — runs locally
~0 tok/sec · Cannot run — model too large for this GPU
The verdict
The RTX 4070 Ti SUPER (16 GB VRAM) handles Codestral 22B (abliterated) comfortably using the Q4_K_M quantization, which fits in 12.9 GB. Expected throughput is around 0 tokens/second, which feels Cannot run — model too large for this GPU in interactive use. Mistral Codestral with refusal direction ablated. Code-specialized model without the 'I can't help with that' filter. Inherits Codestral's non-commercial license.
How to run it
- 1. Install Ollama or LM Studio.
- 2. Pull the
Q4_K_MGGUF — best balance of quality and speed on 16 GB. - 3. Start chatting. Expect ~0 tok/sec on first-token, faster after warmup.
Other models that run great on RTX 4070 Ti SUPER
FAQ (20)
What GPU do I need to run Codestral 22B (abliterated)?
To run Codestral 22B (abliterated), you will need a GPU with at least 12.9 GB of VRAM for the lowest quantization level, up to 44.5 GB for the highest precision.
Is Codestral 22B (abliterated) good for coding?
Yes, Codestral 22B (abliterated) is specialized for coding tasks and can provide high-quality code generation and assistance without the 'I can't help with that' filter.
Codestral 22B (abliterated) vs Llama 3.1 8B?
Codestral 22B (abliterated) has 22 billion parameters, making it significantly larger than Llama 3.1 8B, which may result in better performance for complex tasks but requires more VRAM.
Can I run Codestral 22B (abliterated) on a Mac?
Yes, you can run Codestral 22B (abliterated) on a Mac, provided your Mac has a compatible GPU with sufficient VRAM to handle the model's requirements.
How much VRAM does Codestral 22B (abliterated) need?
Codestral 22B (abliterated) requires between 12.9 GB and 44.5 GB of VRAM, depending on the quantization level used.
Is Codestral 22B (abliterated) censored?
No, Codestral 22B (abliterated) has had its refusal direction ablated, meaning it does not include the 'I can't help with that' filter and is less likely to refuse requests.
Is Codestral 22B (abliterated) commercial-use allowed?
No, Codestral 22B (abliterated) operates under a non-commercial license, which means it cannot be used for commercial purposes.
Codestral 22B (abliterated) context length?
Codestral 22B (abliterated) supports a context length of 32,768 tokens, allowing for handling very long sequences of text.
Want personalized recommendations for your exact setup? Detect my hardware →