~/runthismodel
daemon okbuild 5a3c91d00:00:00Z

Can M4 Max run TRELLIS Image Large?

S

Yes — runs locally

~102 tok/sec · Instant — feels like typing. No noticeable delay.

Your VRAM
128 GB
Model size
1.2B
Best quant
FP16
VRAM needed
12.0 GB

The verdict

The M4 Max (128 GB VRAM) handles TRELLIS Image Large comfortably using the FP16 quantization, which fits in 12.0 GB. Expected throughput is around 102 tokens/second, which feels Instant — feels like typing. No noticeable delay. in interactive use. Image-to-3D model that produces textured meshes. Runs in ~12 GB VRAM and outputs glTF.

Setup tutorial: TRELLIS Image Large on M4 Max

AI-generated, GPU-specific. Verified commands for your exact hardware.

TL;DR

Run TRELLIS Image Large on an Apple M4 Max with Grade S performance at ~265 tok/sec using the FP16 quantization. Requires 12.0GB VRAM, leaving ample headroom.

Prerequisites

Before starting, ensure you have at least 128GB of free disk space, macOS 12.3 or later, and Xcode Command Line Tools installed. You can install Xcode CLT by running `xcode-select --install` in your terminal.

Expected performance

You can expect the model to run at approximately 265 tokens per second, utilizing 12.0GB of VRAM. With 116.0GB of remaining VRAM, you can achieve a practical context window that is significantly larger than typical models, enhancing the quality and detail of the generated 3D models.

1. Install runtimeOllama (preferred on Apple Silicon)

brew install ollama
ollama init

2. Download the model

Download the FP16 quantized version of TRELLIS Image Large (2.4GB) from Hugging Face.

ollama pull JeffreyXiang/TRELLIS-image-large

3. Run it

ollama run TRELLIS-image-large
ollama interactive TRELLIS-image-large

4. Optimize for M4 Max

To optimize performance on the Apple M4 Max, use the Metal/MLX backend to leverage the GPU's 128GB of unified memory. Ensure that MPS layers are enabled to take full advantage of the GPU's capabilities. With 128GB of VRAM, you have 116GB of headroom after allocating 12.0GB for the model, allowing for large context windows and efficient processing.

Troubleshooting

The model fails to load due to insufficient VRAM.

Ensure that no other applications are using significant amounts of VRAM. Close any unnecessary apps and try running the model again.

Performance is below the expected 265 tok/sec.

Check that the Metal/MLX backend is enabled and that MPS layers are properly configured. You can also try restarting your machine to clear any background processes that might be affecting performance.

The model crashes during inference.

Increase the swap space or reduce the batch size to prevent out-of-memory errors. You can also try running the model with a different runtime, such as LM Studio, to see if the issue persists.

Alternative runtimes

While Ollama is the preferred runtime for Apple Silicon, you can also use alternatives like LM Studio, llama.cpp, or MLX. LM Studio offers a more graphical interface and is useful for users who prefer a visual setup. llama.cpp is a lightweight option for command-line users, while MLX provides advanced features for fine-tuning and optimizing models on Apple hardware. Choose the runtime based on your specific needs and preferences.

Other models that run great on M4 Max

FAQ (20)

What GPU do I need to run TRELLIS Image Large?

To run TRELLIS Image Large, you need a GPU with at least 12 GB of VRAM. NVIDIA RTX 3060 or higher is recommended.

Is TRELLIS Image Large good for coding?

TRELLIS Image Large is primarily designed for generating 3D models from images, not for coding tasks. It is not suitable for code generation or programming assistance.

TRELLIS Image Large vs Llama 3.1 8B?

TRELLIS Image Large has 1.2 billion parameters and specializes in image-to-3D conversion, while Llama 3.1 8B is a text-based model with 8 billion parameters, making it better suited for language tasks.

Can I run TRELLIS Image Large on a Mac?

Yes, you can run TRELLIS Image Large on a Mac with a compatible GPU that has at least 12 GB of VRAM, such as an AMD Radeon Pro W5700X or higher.

How much VRAM does TRELLIS Image Large need?

TRELLIS Image Large requires 12 GB of VRAM to run effectively, regardless of quantization.

Is TRELLIS Image Large censored?

TRELLIS Image Large is not inherently censored, but its outputs may be influenced by the training data and any filters applied by the user or platform.

Is TRELLIS Image Large commercial-use allowed?

Yes, TRELLIS Image Large is licensed under the MIT License, which allows for commercial use without additional restrictions.

TRELLIS Image Large context length?

The context length for TRELLIS Image Large is unknown, as it primarily focuses on image-to-3D conversion rather than text processing.

Want personalized recommendations for your exact setup? Detect my hardware →