~/runthismodel
daemon okbuild 5a3c91d00:00:00Z
./models/browse/hunyuan-video
Tencent · video-gen
HunyuanVideo 13B
Tencent's flagship open video model. 720p quality but requires 60+ GB VRAM (or aggressive offloading + tiling).
13b paramshunyuan-dittencent-hunyuan6060 GB vram
about·model card

HunyuanVideo is Tencent's answer to closed-source video generation. The output quality rivals commercial tools but the hardware bar is steep — comfortable inference wants an A100 80 GB or H100. Aggressive Q8 quants and tiled VAE bring this down to ~24 GB at the cost of quality.

probe://hardware·which quants fit your rig
we auto-detect via WebGL/WebGPU. select manually if your GPU isn't recognized.
./quantizations·1 variants
QuantizationBitsFile SizeVRAM NeededRAM NeededQuality
FP161626 GB60 GB64 GB
100%

How to run HunyuanVideo 13B

Pick a runtime — copy & paste. Commands are pre-filled with this model’s repo.

GUI. Browse → download → chat. MLX on Apple Silicon.

LM Studio home →
  1. 1

    Open LM Studio

    Go to the 🔍 Search tab.

  2. 2

    Search for

    tencent/HunyuanVideo
  3. 3

    Download

    Pick the FP16 quant — best balance of size vs. quality.

  4. 4

    Chat

    Hit ▶ Load Model and start chatting. Toggle 'Local Server' to expose an OpenAI-compatible API on :1234.

Community benchmarks

Real seconds-per-image reports from people running HunyuanVideo 13B on actual hardware.

No community runs yet for this model. Be the first to submit your numbers.

faq·common questions
how much VRAM do I need to run HunyuanVideo 13B?

HunyuanVideo 13B requires 60 GB VRAM minimum with FP16 quantization. For full precision you need 60 GB.

which quant should I pick?

Q4_K_M is the best quality/VRAM balance — ~92% of FP16 quality at ~25% the footprint. Q8_0 is near-lossless if you have the headroom.

faq://ai-curated·20 entries
What GPU do I need to run HunyuanVideo 13B?

To run HunyuanVideo 13B, you need a GPU with at least 60 GB of VRAM. NVIDIA A100 or V100 are recommended for optimal performance.

Is HunyuanVideo 13B good for coding?

HunyuanVideo 13B is primarily designed for video generation and may not be the best choice for coding tasks. Consider models like Codex or CodeLlama for coding-specific applications.

HunyuanVideo 13B vs Llama 3.1 8B?

HunyuanVideo 13B has 13 billion parameters and focuses on video generation, while Llama 3.1 8B is smaller and more versatile, suitable for a wider range of tasks including text generation.

Can I run HunyuanVideo 13B on a Mac?

Running HunyuanVideo 13B on a Mac is challenging due to the high VRAM requirement. You would need a Mac with an external GPU setup or consider cloud-based solutions.

How much VRAM does HunyuanVideo 13B need?

HunyuanVideo 13B requires 60 GB of VRAM to run efficiently. Aggressive offloading and tiling techniques can help reduce this requirement.

Is HunyuanVideo 13B censored?

HunyuanVideo 13B is not explicitly censored, but its outputs are governed by the training data and the model's architecture, which may include content filters.

Is HunyuanVideo 13B commercial-use allowed?

HunyuanVideo 13B is licensed under the tencent-hunyuan license, which allows commercial use but includes specific terms and conditions. Review the license for details.

HunyuanVideo 13B context length?

The context length for HunyuanVideo 13B is currently unknown. For detailed information, refer to the official documentation or contact the developers.

Does HunyuanVideo 13B support function calling?

HunyuanVideo 13B does not natively support function calling as it is primarily designed for video generation tasks. Function calling is more common in text-based models.

HunyuanVideo 13B quantization options?

HunyuanVideo 13B supports various quantization options, including INT8 and FP16, to reduce memory usage and improve inference speed.

Can HunyuanVideo 13B run on CPU?

While HunyuanVideo 13B can technically run on a CPU, it is highly inefficient due to the large model size and high computational requirements. A GPU is strongly recommended.

HunyuanVideo 13B fine-tuning?

HunyuanVideo 13B can be fine-tuned for specific video generation tasks using a dataset tailored to your needs. Fine-tuning requires significant computational resources and expertise.

HunyuanVideo 13B system requirements?

To run HunyuanVideo 13B, you need a system with at least 60 GB of VRAM, a powerful CPU, and ample RAM. SSD storage is also recommended for faster data loading.

HunyuanVideo 13B performance benchmark?

HunyuanVideo 13B can generate 720p videos at a rate of approximately 5-10 frames per second on a high-end GPU like the NVIDIA A100. Performance varies based on hardware and optimization techniques.

HunyuanVideo 13B for RAG?

HunyuanVideo 13B is not designed for Retrieval-Augmented Generation (RAG). It is optimized for video generation tasks and may not perform well in RAG scenarios.

HunyuanVideo 13B for agents?

HunyuanVideo 13B can be used to generate video content for agents, but it is not designed to handle the complex interactions and decision-making required for agent behavior.

HunyuanVideo 13B for coding vs general?

HunyuanVideo 13B is specialized for video generation and is not suitable for coding or general-purpose tasks. Use it for creating high-quality video content.

HunyuanVideo 13B vs ChatGPT?

HunyuanVideo 13B is focused on video generation, while ChatGPT is a text-based conversational model. They serve different purposes and are not directly comparable.

HunyuanVideo 13B download size?

The download size of HunyuanVideo 13B is approximately 26 GB for the base model. Additional files and dependencies may increase the total size.

Best quant for HunyuanVideo 13B?

For HunyuanVideo 13B, FP16 quantization is often the best balance between performance and accuracy. INT8 can further reduce memory usage but may impact video quality.