AI News & Updates

The latest model releases, hardware launches, and software updates relevant to running AI locally.

May 15, 2026

SulphurAI's Text-to-Video Model Surpasses 627K Downloads

SulphurAI/Sulphur-2-base, a new AI model, showcases advanced capabilities in natural language processing and machine learning tasks.

May 15, 2026

Model Release

SeeSee21's Z-Anime: Text-to-Image Model Surpasses 12,000 Downloads

SeeSee21/Z-Anime, an AI model, has gained popularity for its ability to generate highly realistic anime-style images and animations.

May 15, 2026

Model Release

Unsloth's Qwen3.6-27B-MTP-GGUF: Image-Text-To-Text AI Model Surpasses 74K Downloads

AI model "unsloth/Qwen3.6-27B-MTP-GGUF" by Alibaba Cloud sets new benchmarks in efficiency and performance, optimizing large language models for faster, more accurate responses.

April 8, 2026

Model Release

Qwen 3.5 Model Family Released: New SOTA for Open-Weight LLMs

Alibaba Cloud releases the Qwen 3.5 family spanning 0.5B to 72B parameters, setting new benchmarks for open-weight language models across reasoning, coding, and multilingual tasks.

April 3, 2026

Model Release

Google Launches Gemma 4: Smaller, Faster, and More Capable

Google DeepMind releases Gemma 4 in 2B, 4B, 9B, and 27B sizes with improved instruction following, extended context, and native tool-use capabilities.

March 28, 2026

Hardware

NVIDIA RTX 5090 Finally Available: 32GB VRAM Changes the Game for Local AI

After months of limited stock, the RTX 5090 with 32GB GDDR7 is reaching wider availability. Its massive VRAM pool enables running 70B models locally for the first time on a consumer GPU.

March 20, 2026

Model Release

DeepSeek R1 Goes Fully Open Source with MIT License

DeepSeek releases the full R1 reasoning model under MIT license with official GGUF quantizations. The distilled variants from 1.5B to 70B offer breakthrough reasoning at every hardware tier.

March 12, 2026

Model Release

Meta Releases Llama 3.3 70B: Best Open Model for Its Size

Meta's Llama 3.3 70B delivers performance matching the original Llama 3.1 405B on key benchmarks while being six times smaller and runnable on high-end consumer hardware.

March 5, 2026

Software

Stable Diffusion 3 Medium Gets Official GGUF Support

Stability AI and the llama.cpp community deliver GGUF-quantized Stable Diffusion 3 Medium, cutting VRAM requirements from 8GB to under 4GB and enabling image generation on budget GPUs.

February 25, 2026

Software

FLUX.1 Schnell Achieves Real-Time Local Image Generation

New optimizations bring FLUX.1 Schnell generation time under 3 seconds on an RTX 4090, making high-quality AI image generation feel instant on consumer hardware.

February 18, 2026

Model Release

Whisper Large v3 Turbo: 8x Faster Speech Recognition

OpenAI releases Whisper Large v3 Turbo, a distilled version that runs 8 times faster than Large v3 while retaining 99 percent of its accuracy across 100 languages.

February 10, 2026

Hardware

Apple M4 Ultra: 192GB Unified Memory Makes It the Ultimate Local AI Machine

Apple's M4 Ultra chip with up to 192GB of unified memory can run 405B parameter models locally. We analyze its AI inference performance and how it compares to discrete GPUs.

February 1, 2026

Site Update

RunThisModel Now Covers 109 AI Models with Hardware Compatibility Checks

Our model database has grown to 109 models across 8 categories including LLMs, image generation, speech recognition, TTS, and more. Every model has GGUF file sizes verified against Hugging Face.

January 22, 2026

Software

GGUF Is Now the Standard Format for Local AI Model Distribution

The GGUF format has emerged as the dominant standard for distributing quantized AI models. Major model providers now publish official GGUF files, and every local inference tool supports it natively.

January 15, 2026

Hardware

Cloud GPU Prices Drop 40%: RunPod and Vast.ai Battle for AI Users

Increased GPU supply and competition have driven cloud GPU rental prices down sharply. RTX 4090 instances now start at $0.25/hour and A100 80GB at $1.20/hour.