AI Models
186 models · 50 new in 60d
- ▾Gemma 4 31B DenseOpen
Google · 256K tokens · self-host
Best for: Self-hosted multimodal production, commercial use, multilingual apps
How: Dense 31B — fits on a single A100 or 2x RTX 4090. Apache 2.0 = fully commercial. Supports images and video natively.
Example: Deploy as a private multimodal assistant that reads screenshots, logs, and video clips.
LMSYS Arena #3 textMMLU ~82%multimodalimages + video35+ languagesApache 2.0dense architectureHardware to self-hostVRAM: 20GB (quantized) / 62GB (FP16)GPU: 1× A100 80GB or 2× RTX 4090 24GBRAM: 32GB+ system RAM31B dense. Native multimodal (images + video) increases compute cost vs text-only.
API: Ollama, vLLM, Hugging Face, Vertex AI. ollama run gemma4:31b
Brand new (Apr 2026). Ranked #3 on LMSYS Arena text leaderboard at launch.
- ▾Gemma 4 27B MoEOpen
Google · 128K tokens · self-host
Best for: Faster self-hosted inference, cost-efficient multimodal
How: MoE variant — faster inference than the 31B dense. Same multimodal capabilities.
Example: Process image-based monitoring alerts faster than the dense variant at the same quality.
LMSYS Arena #6 textMoE efficiencymultimodalimages + videoApache 2.0Hardware to self-hostVRAM: 18GB (quantized) / 54GB (FP16)GPU: RTX 4090 24GB or 1× A100 40GBRAM: 32GB+ system RAM27B total MoE — faster inference than the 31B dense thanks to sparse activations.
API: Ollama, vLLM, Hugging Face. ollama run gemma4:27b-moe
- ▾Gemma 4 E4BOpen
Google · 128K tokens · self-host
Best for: Edge, mobile, IoT, on-device AI with multimodal input
How: 4B params — runs on any hardware. Supports images, video, AND native audio input.
Example: Run on a Raspberry Pi to process security camera feeds with voice commands.
tinyon-devicemultimodal + audioApache 2.0Hardware to self-hostVRAM: 3GB (quantized) / 8GB (FP16)GPU: Any — CPU, phone, Jetson, Raspberry Pi 5, integrated GPURAM: 4-8GB system RAM4B params. Edge-first design: runs on phones, SBCs, IoT devices.
API: Ollama, Hugging Face. Runs on phones and Raspberry Pi.
- ▾DeepSeek V3.2Open
DeepSeek · 164K tokens · self-host
Best for: Long-context coding, upgraded V3 deployments
How: Drop-in upgrade from V3. Uses Dynamic Sparse Attention for better long-context performance.
Example: Feed your entire microservice codebase and get cross-service dependency analysis.
HumanEval 94.0%codingmathsparse attention (DSA)MIT licenseimproved contextHardware to self-hostVRAM: 350GB (quantized)GPU: 8× H100 80GBRAM: 512GB+ system RAMSame hardware footprint as V3 — 671B with sparse attention.
API: api.deepseek.com OR self-host via vLLM. Same OpenAI-compatible API.
- ▾Mistral Large 3Open
Mistral · 256K tokens · self-host
Best for: European deployments, agent workflows, long-context multilingual apps
How: Major upgrade from Large 2. MoE architecture with 41B active params. Same API, just change model ID.
Example: Build a multi-tool agent that queries DBs, calls APIs, and generates reports in 30+ languages.
MoE 41B active / 675B totalmultilingualfunction calling256K contextHardware to self-hostVRAM: 350GB (quantized)GPU: 8× H100 80GBRAM: 512GB+ system RAM675B MoE (41B active). Datacenter class — most users go via api.mistral.ai.
API: api.mistral.ai OR self-host via vLLM. OpenAI-compatible.
- ▾Ministral 3 (3B/8B/14B)Open
Mistral · 128K tokens · self-host
Best for: Edge deployment, on-device AI, lightweight vision tasks
How: 3B fits on phones, 8B on laptops, 14B on dev GPUs. All have vision support.
Example: Run 8B on a Jetson to classify manufacturing defects from camera feeds.
edge-friendlyvisiondense3 sizesHardware to self-hostVRAM: 2GB (3B) / 6GB (8B) / 10GB (14B quantized)GPU: Phone/CPU (3B) · Laptop GPU (8B) · RTX 3060+ (14B)RAM: 8-16GB system RAMAll three sizes are dense with vision. 3B runs on phones, 8B on laptops, 14B on dev GPUs.
API: Ollama, vLLM, Hugging Face. Also on Mistral API.
- ▾Wan 2.2Open
Alibaba · N/A · self-host
Best for: Best-in-class open-source video. The 5B variant runs on a single 24GB consumer card.
How: ComfyUI nodes ship official support. Or `python generate.py --task t2v-A14B --prompt '...'` from the WanX repo.
Example: python generate.py --task t2v-A14B --prompt 'a corgi running on the beach at sunset' --resolution 720P
MoE video architectureopen weightsT2V + I2V5B small variant for consumer GPUs720p outputHardware to self-hostVRAM: 24GB (5B variant) / 80GB (A14B)GPU: RTX 4090 for 5B · H100 for A14BRAM: 32–64GB system RAM5B model fits a single 4090. A14B MoE delivers Sora-class quality but needs an H100 or 2× 4090 with offload.
API: huggingface.co/Wan-AI/Wan2.2-T2V-A14B · Wan-AI/Wan2.2-T2V-5B
- ▾Llama 4 MaverickOpen
Meta · 1M tokens · self-host
Best for: Self-hosted production deployments, privacy-sensitive workloads
How: ollama run llama4-maverick OR deploy on vLLM with tensor parallelism. Also available hosted on Together/Groq.
Example: Deploy on 2x A100 GPUs behind your API gateway for private code review.
MMLU 88.4%HumanEval 84.8%multilingualmultimodalMoE architecture17B active / 400B totalHardware to self-hostVRAM: 200GB (quantized)GPU: 2× H100 80GB or 4× A100 80GBRAM: 256GB system RAM400B total params (17B active). FP16 needs ~800GB, FP8 ~400GB, INT4 ~200GB.
API: Self-host via vLLM, Ollama, or use via Together, Fireworks, Groq
- ▾Llama 4 ScoutOpen
Meta · 10M tokens · self-host
Best for: Processing entire codebases, very long documents, single-GPU deployments
How: Fits on a single H100. Best open model for extreme context lengths.
Example: Feed your entire monorepo into context and ask about cross-service dependencies.
MMLU 86.2%longest context (10M)MoE 17B active / 109B totalfits single H100Hardware to self-hostVRAM: 80GBGPU: 1× H100 80GBRAM: 128GB system RAM17B active params, fits in a single H100 at FP8.
API: Same as Maverick — vLLM, Ollama, Together, Fireworks
- ▾Qwen 3 235BOpen
Alibaba · 128K tokens · self-host
Best for: Flexible thinking control, commercial self-hosting, multilingual
How: Supports /think and /no_think tags to toggle reasoning on/off per request. Apache 2.0 = fully commercial.
Example: Use /no_think for fast classification, /think for complex debugging — same model.
AIME 2024 85.7%HumanEval 90.2%hybrid thinkingMoE 22B activeApache 2.0multilingualHardware to self-hostVRAM: 140GB (quantized)GPU: 4× A100 80GB or 2× H100RAM: 256GB+ system RAM235B total (22B active). MoE architecture — only 22B params active per forward pass.
API: Self-host via vLLM/SGLang or use via Together, Fireworks. Also on Alibaba Cloud.
- ▾Qwen 3 30BOpen
Alibaba · 128K tokens · self-host
Best for: Local development, laptop-friendly reasoning, privacy
How: Excellent for local dev. MoE means only 3B params active — fast on consumer hardware.
Example: Run on your dev machine as a private coding assistant with reasoning.
AIME 2024 66.7%MoE 3B active / 30B totalruns on consumer GPUhybrid thinkingHardware to self-hostVRAM: 20GB (quantized) / 60GB (FP16)GPU: RTX 4090 24GB (quantized) or 1× A100RAM: 32GB+ system RAM30B total (3B active). The 3B active params make inference fast on consumer hardware.
API: ollama run qwen3:30b — fits on RTX 4090 (24GB)
- ▾Gemma 3 27BOpen
Google · 128K tokens · self-host
Best for: On-device/edge deployment, multimodal at small scale
How: ollama run gemma3:27b. Fits on RTX 3090/4090. Good multimodal + tool use at small size.
Example: Run on a dev server to process screenshots and generate bug reports.
MMLU 75.6%HumanEval 78.0%compactmultimodalruns on single GPUfunction callingHardware to self-hostVRAM: 18GB (quantized) / 54GB (FP16)GPU: RTX 3090/4090 24GB or 1× A100 40GBRAM: 32GB+ system RAM27B dense. Fits on a single high-end consumer GPU with quantization.
API: Ollama, vLLM, Hugging Face. Also on Vertex AI.
- ▾Nomic Embed Text v2-MoEOpen
Nomic AI · 8K tokens · self-host
Best for: Self-hosted RAG, privacy-first search, zero-cost embeddings
How: Self-host for zero cost. Comparable quality to OpenAI embeddings.
Example: Run alongside pgvector on the same server — full RAG pipeline with zero API costs.
MoE embeddingmatryoshkaApache 2.0self-hostableHardware to self-hostVRAM: 2GB or CPU-onlyGPU: Any — runs on CPU at reasonable speedRAM: 4-8GB system RAMTiny MoE embedding model. CPU inference is fast enough for most use cases.
API: pip install nomic OR Ollama. Also hosted on Nomic Atlas.
- ▾Step-Video-T2VOpen
StepFun · N/A · self-host
Best for: Highest-quality open-source video model when you have the hardware to run it.
How: Clone stepfun-ai/Step-Video-T2V repo, install requirements, run sample.py with your prompt.
Example: python sample_video.py --prompt 'underwater coral reef, schools of fish' --num-frames 204 --resolution 544x992
30B paramsMIT licensecompetitive with Sora-class qualitydeep compression video VAEHardware to self-hostVRAM: 80GB (FP16) / 40GB (FP8)GPU: H100 80GB · or 2× A100 40GB · or A100 40GB with FP8RAM: 128GB system RAM30B is genuinely heavy but the MIT license + quality tradeoff is uniquely permissive in the open-source video space.
API: huggingface.co/stepfun-ai/stepvideo-t2v
- ▾DeepSeek R1Open
DeepSeek · 128K tokens · self-host
Best for: Budget reasoning, self-hosted chain-of-thought, research
How: API is OpenAI-compatible. Self-host the 70B distill on 2x A100. MIT license = no restrictions.
Example: Run the 14B distill locally for debugging complex distributed system issues.
AIME 2024 79.8%SWE-bench 49.2%GPQA Diamond 71.5%reasoningmathcodingMIT licensedistillableHardware to self-hostVRAM: 10GB (14B distill) / 48GB (70B distill) / 1TB+ (full 671B)GPU: RTX 4090 (14B) · 2× A100 (70B) · 8× H100 (full)RAM: Full model needs 256GB+ system RAMFull 671B MoE is massive. Distilled versions (14B, 32B, 70B) are far more practical.
API: api.deepseek.com ($0.55/M in, $2.19/M out) OR self-host via vLLM/Ollama
- ▾Codestral 25.01Open
Mistral · 256K tokens · self-host
Best for: Code completion, inline suggestions, editor integration
How: Supports FIM for inline completion. Integrate with any editor via LSP or Continue.dev.
Example: Deploy as your team's FIM-capable completion server behind an LSP proxy.
HumanEval 91.0%code completionFIM (fill-in-middle)80+ languagesHardware to self-hostVRAM: 16GB (quantized) / 45GB (FP16)GPU: RTX 4090 24GB or 1× A100 40GBRAM: 32GB+ system RAM22B dense. Fits on a single consumer GPU with quantization.
API: codestral.mistral.ai — dedicated code endpoint
- ▾Llama 3.3 70BOpen
Meta · 128K tokens · self-host
Best for: Proven workhorse for self-hosted deployments, fine-tuning base
How: ollama run llama3.3:70b. For production: vLLM on 2x A100 or 4x A10G.
Example: Fine-tune on your internal docs for a private knowledge base chatbot.
MMLU 86.0%HumanEval 88.4%mature ecosystemfine-tuning friendlywide hardware supportHardware to self-hostVRAM: 40GB (4-bit) / 140GB (FP16)GPU: 2× A100 80GB or 4× A10G 24GBRAM: 64GB+ system RAM70B dense. Widely supported — runs on Ollama with quantization on 48GB VRAM.
API: Ollama, vLLM, TGI, or hosted (Together $0.60/M, Groq, Fireworks)
- ▾DeepSeek V3Open
DeepSeek · 128K tokens · self-host
Best for: Cost-sensitive production APIs, coding tasks, math-heavy pipelines
How: Cheapest top-tier API. OpenAI-compatible. Self-host needs 8x A100.
Example: Replace GPT-4 in your CI pipeline for automated code review at 1/10th the cost.
HumanEval 92.1%MMLU 88.5%codingmathMoE 37B active / 671B totalMIT licenseHardware to self-hostVRAM: 350GB (quantized) / 1.3TB (FP16)GPU: 8× H100 80GB or 8× A100 80GBRAM: 512GB+ system RAM671B total (37B active). Most users rent via API — self-hosting needs datacenter hardware.
API: api.deepseek.com ($0.27/M in, $1.10/M out) OR self-host
- ▾Phi-4Open
Microsoft · 16K tokens · self-host
Best for: Edge deployment, STEM tasks, embedded AI in products
How: ollama run phi4. MIT license — embed in commercial products freely.
Example: Embed in a CI pipeline to validate config files and Terraform plans.
GPQA Diamond 56.2%MATH 80.4%14B paramsSTEM reasoningMIT licenseruns on laptopHardware to self-hostVRAM: 9GB (quantized) / 28GB (FP16)GPU: Any 8GB+ GPU (RTX 3060, laptop 4050, etc.)RAM: 16GB system RAM14B dense. Runs locally on most developer laptops with quantization.
API: Ollama, Hugging Face, Azure AI
- ▾HunyuanVideoOpen
Tencent · N/A · self-host
Best for: Self-hosted video generation, research, building custom pipelines.
How: Clone repo, install diffusers, run sample.py with your prompt. Or use ComfyUI workflows.
Example: python sample_video.py --prompt 'a cat surfing at sunset' --video-length 129 --infer-steps 50
fully open weights13B paramscompetitive qualityfine-tunableHardware to self-hostVRAM: 60GBGPU: H100 80GB or 2× RTX 4090 (with offload)RAM: 64GB system RAMQuantized to 8-bit fits on a single A100 40GB. Comfy workflows offload UNet to CPU at the cost of speed.
API: huggingface.co/tencent/HunyuanVideo
- ▾Qwen 2.5 Coder 32BOpen
Alibaba · 128K tokens · self-host
Best for: Private code completion, self-hosted Copilot replacement
How: ollama run qwen2.5-coder:32b. Plug into Continue.dev or Copilot alternatives.
Example: Set up as your team's private code completion backend — zero data leaves your infra.
HumanEval 92.7%LiveCodeBench 48.5%code completioncode generationApache 2.0Hardware to self-hostVRAM: 20GB (quantized) / 64GB (FP16)GPU: RTX 4090 24GB or 1× A100 40GBRAM: 32GB+ system RAM32B dense. Fits on a single consumer GPU with 4-bit quantization.
API: Ollama, vLLM, or hosted on Together/Fireworks
- ▾LTX-VideoOpen
Lightricks · N/A · self-host
Best for: When you need quick turnaround — prototypes, drafts, dataset generation. Speed-first open-source video.
How: diffusers LTXPipeline. Generates a 5-second 768×512 clip in seconds on an H100; a few minutes on a 4090.
Example: from diffusers import LTXPipeline; pipe = LTXPipeline.from_pretrained('Lightricks/LTX-Video'); pipe(prompt='falling autumn leaves').frames
real-time generation on H1002B params (small footprint)fast iteration loop13B variant for higher qualityHardware to self-hostVRAM: 12GB (2B) / 24GB (13B)GPU: RTX 3090 / 4090 for 2B · H100 for 13BRAM: 32GB system RAMSmallest/fastest of the open-source video models — great for iterating on prompts before committing GPU time to bigger models.
API: huggingface.co/Lightricks/LTX-Video · Lightricks/LTX-Video-13B
- ▾Mochi 1Open
Genmo · N/A · self-host
Best for: Open-source video where commercial use matters — the Apache 2.0 license is unrestricted.
How: Diffusers pipeline or the official genmoai/models repo. ComfyUI workflows are well-documented.
Example: python -m mochi_preview.cli --prompt 'time lapse of a city street at golden hour' --num-frames 84
Apache 2.0 (commercial-friendly)10B paramshigh motion fidelityactive community fine-tunesHardware to self-hostVRAM: 60GB (full precision) / 24GB with quantizationGPU: H100 80GB · or RTX 4090 with FP8 quant + offloadRAM: 64GB system RAMMemory-hungry at full precision but FP8 / GGUF quants from the community fit a single 4090.
API: huggingface.co/genmo/mochi-1-preview
- ▾CogVideoX-5BOpen
THUDM / Zhipu AI · N/A · self-host
Best for: Pioneer open-source T2V — solid baseline for self-hosted experimentation and fine-tuning.
How: pip install diffusers; CogVideoXPipeline.from_pretrained('THUDM/CogVideoX-5b') and run with a text prompt.
Example: from diffusers import CogVideoXPipeline; pipe = CogVideoXPipeline.from_pretrained('THUDM/CogVideoX-5b'); pipe(prompt='a panda playing piano').frames
fully open weights5B paramsfits on a single 24GB carddiffusers integrationHardware to self-hostVRAM: 18GB (with CPU offload) / 24GB nativeGPU: RTX 4090 24GBRAM: 32GB system RAMQuantized + offload tricks let it run on 12GB. Slower than newer entries but the most fine-tuned-on open video model.
API: huggingface.co/THUDM/CogVideoX-5b
- ▾FastContext 1.0 4B SFTNewOpen
microsoft · self-host
Best for: Trending on HuggingFace (114 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("microsoft/FastContext-1.0-4B-SFT")
transformerssafetensorsqwen3text-generationExplorer SubAgentAPI: huggingface.co/microsoft/FastContext-1.0-4B-SFT
Auto-discovered from HuggingFace trending. 114 likes, 13 downloads.
- ▾MiMo V2.5 Pro FP4 DFlashNewOpen
XiaomiMiMo · self-host
Best for: Trending on HuggingFace (115 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash")
transformerssafetensorsmimo_v2text-generationagentAPI: huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash
Auto-discovered from HuggingFace trending. 115 likes, 4K downloads.
- ▾Gemma 4 12B Coder Fable5 Composer2.5 V1 GGUFNewOpen
yuxinlu1 · self-host
Best for: Trending on HuggingFace (736 likes this week)
How: Available on Hugging Face. 20K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF")
ggufgemma4codingcodereasoningAPI: huggingface.co/yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
Auto-discovered from HuggingFace trending. 736 likes, 20K downloads.
- ▾Rio 3.5 Open 397BNewOpen
prefeitura-rio · self-host
Best for: Trending on HuggingFace (304 likes this week)
How: Available on Hugging Face. 189K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("prefeitura-rio/Rio-3.5-Open-397B")
transformerssafetensorsqwen3_5_moeimage-text-to-textconversationalAPI: huggingface.co/prefeitura-rio/Rio-3.5-Open-397B
Auto-discovered from HuggingFace trending. 304 likes, 189K downloads.
- ▾Qwopus3.6 27B Coder MTP GGUFNewOpen
Jackrong · self-host
Best for: Trending on HuggingFace (203 likes this week)
How: Available on Hugging Face. 62K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus3.6-27B-Coder-MTP-GGUF")
transformersggufllama.cppimage-text-to-textvisionAPI: huggingface.co/Jackrong/Qwopus3.6-27B-Coder-MTP-GGUF
Auto-discovered from HuggingFace trending. 203 likes, 62K downloads.
- ▾SCAIL 2NewOpen
zai-org · self-host
Best for: Trending on HuggingFace (191 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("zai-org/SCAIL-2")
diffuserscharacter-animationvideo-generationpose-drivendiffusionAPI: huggingface.co/zai-org/SCAIL-2
Auto-discovered from HuggingFace trending. 191 likes, 0 downloads.
- ▾MiniMax M3NewOpen
MiniMaxAI · self-host
Best for: Trending on HuggingFace (857 likes this week)
How: Available on Hugging Face. 14K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("MiniMaxAI/MiniMax-M3")
transformerssafetensorsminimax_m3_vlimage-text-to-textmultimodalAPI: huggingface.co/MiniMaxAI/MiniMax-M3
Auto-discovered from HuggingFace trending. 857 likes, 14K downloads.
- ▾Kimi K2.7 CodeNewOpen
moonshotai · self-host
Best for: Trending on HuggingFace (756 likes this week)
How: Available on Hugging Face. 57K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("moonshotai/Kimi-K2.7-Code")
transformerssafetensorskimi_k25image-feature-extractioncompressed-tensorsAPI: huggingface.co/moonshotai/Kimi-K2.7-Code
Auto-discovered from HuggingFace trending. 756 likes, 57K downloads.
- ▾Gemma 4 26B A4B It Qat GGUFNewOpen
unsloth · self-host
Best for: Trending on HuggingFace (144 likes this week)
How: Available on Hugging Face. 129K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/gemma-4-26B-A4B-it-qat-GGUF")
transformersggufgemma4image-text-to-textunslothAPI: huggingface.co/unsloth/gemma-4-26B-A4B-it-qat-GGUF
Auto-discovered from HuggingFace trending. 144 likes, 129K downloads.
- ▾Diffusiongemma 26B A4B It GGUFNewOpen
unsloth · self-host
Best for: Trending on HuggingFace (277 likes this week)
How: Available on Hugging Face. 107K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/diffusiongemma-26B-A4B-it-GGUF")
ggufgemma4unslothgemmagoogleAPI: huggingface.co/unsloth/diffusiongemma-26B-A4B-it-GGUF
Auto-discovered from HuggingFace trending. 277 likes, 107K downloads.
- ▾Nex N2 MiniNewOpen
nex-agi · self-host
Best for: Trending on HuggingFace (220 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nex-agi/Nex-N2-mini")
transformerssafetensorsqwen3_5_moeimage-text-to-texttext-generationAPI: huggingface.co/nex-agi/Nex-N2-mini
Auto-discovered from HuggingFace trending. 220 likes, 8K downloads.
- ▾Diffusiongemma 26B A4B ItNewOpen
google · self-host
Best for: Trending on HuggingFace (895 likes this week)
How: Available on Hugging Face. 312K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("google/diffusiongemma-26B-A4B-it")
transformerssafetensorsdiffusion_gemmaimage-text-to-textconversationalAPI: huggingface.co/google/diffusiongemma-26B-A4B-it
Auto-discovered from HuggingFace trending. 895 likes, 312K downloads.
- ▾Gemma 4 12B OBLITERATEDNewOpen
OBLITERATUS · self-host
Best for: Trending on HuggingFace (326 likes this week)
How: Available on Hugging Face. 71K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("OBLITERATUS/Gemma-4-12B-OBLITERATED")
transformerssafetensorsggufgemma4_unifiedimage-text-to-textAPI: huggingface.co/OBLITERATUS/Gemma-4-12B-OBLITERATED
Auto-discovered from HuggingFace trending. 326 likes, 71K downloads.
- ▾Nex N2 ProNewOpen
nex-agi · self-host
Best for: Trending on HuggingFace (288 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nex-agi/Nex-N2-Pro")
transformerssafetensorsqwen3_5_moeimage-text-to-texttext-generationAPI: huggingface.co/nex-agi/Nex-N2-Pro
Auto-discovered from HuggingFace trending. 288 likes, 4K downloads.
- ▾North Mini Code 1.0NewOpen
CohereLabs · self-host
Best for: Trending on HuggingFace (394 likes this week)
How: Available on Hugging Face. 11K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("CohereLabs/North-Mini-Code-1.0")
transformerssafetensorscohere2_moetext-generationconversationalAPI: huggingface.co/CohereLabs/North-Mini-Code-1.0
Auto-discovered from HuggingFace trending. 394 likes, 11K downloads.
- ▾NVIDIA Nemotron 3 Ultra 550B A55B NVFP4NewOpen
nvidia · self-host
Best for: Trending on HuggingFace (160 likes this week)
How: Available on Hugging Face. 91K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4")
transformerssafetensorsnemotron_htext-generationnvidiaAPI: huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4
Auto-discovered from HuggingFace trending. 160 likes, 91K downloads.
- ▾MisoTTSNewOpen
MisoLabs · self-host
Best for: Trending on HuggingFace (188 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("MisoLabs/MisoTTS")
pytorchsafetensorstext-to-speechspeech-synthesisvoiceAPI: huggingface.co/MisoLabs/MisoTTS
Auto-discovered from HuggingFace trending. 188 likes, 0 downloads.
- ▾NVIDIA Nemotron 3 Ultra 550B A55B BF16NewOpen
nvidia · self-host
Best for: Trending on HuggingFace (189 likes this week)
How: Available on Hugging Face. 59K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16")
transformerssafetensorsnemotron_htext-generationnvidiaAPI: huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
Auto-discovered from HuggingFace trending. 189 likes, 59K downloads.
- ▾Higgs Audio V3 Tts 4bNewOpen
bosonai · self-host
Best for: Trending on HuggingFace (446 likes this week)
How: Available on Hugging Face. 38K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("bosonai/higgs-audio-v3-tts-4b")
transformerssafetensorshiggs_multimodal_qwen3text-generationtext-to-speechAPI: huggingface.co/bosonai/higgs-audio-v3-tts-4b
Auto-discovered from HuggingFace trending. 446 likes, 38K downloads.
- ▾Nemotron 3.5 Asr Streaming 0.6bNewOpen
nvidia · self-host
Best for: Trending on HuggingFace (424 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nvidia/nemotron-3.5-asr-streaming-0.6b")
nemospeech-recognitioncache-aware ASRautomatic-speech-recognitionstreaming-asrAPI: huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b
Auto-discovered from HuggingFace trending. 424 likes, 5K downloads.
- ▾Ideogram 4 Nf4NewOpen
ideogram-ai · self-host
Best for: Trending on HuggingFace (334 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ideogram-ai/ideogram-4-nf4")
diffuserssafetensorstext-to-imageimage-generationdiffusionAPI: huggingface.co/ideogram-ai/ideogram-4-nf4
Auto-discovered from HuggingFace trending. 334 likes, 3K downloads.
- ▾Ideogram 4 Fp8NewOpen
ideogram-ai · self-host
Best for: Trending on HuggingFace (548 likes this week)
How: Available on Hugging Face. 11K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ideogram-ai/ideogram-4-fp8")
diffuserssafetensorstext-to-imageimage-generationdiffusionAPI: huggingface.co/ideogram-ai/ideogram-4-fp8
Auto-discovered from HuggingFace trending. 548 likes, 11K downloads.
- ▾Gemma 4 12b It GGUFNewOpen
unsloth · self-host
Best for: Trending on HuggingFace (599 likes this week)
How: Available on Hugging Face. 926K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/gemma-4-12b-it-GGUF")
ggufgemma4unslothgemmagoogleAPI: huggingface.co/unsloth/gemma-4-12b-it-GGUF
Auto-discovered from HuggingFace trending. 599 likes, 926K downloads.
- ▾Mellum2 12B A2.5B ThinkingNewOpen
JetBrains · self-host
Best for: Trending on HuggingFace (274 likes this week)
How: Available on Hugging Face. 18K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("JetBrains/Mellum2-12B-A2.5B-Thinking")
transformerssafetensorsmellumtext-generationconversationalAPI: huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking
Auto-discovered from HuggingFace trending. 274 likes, 18K downloads.
- ▾Bonsai Image Ternary 4B Gemlite 2bitNewOpen
prism-ml · self-host
Best for: Trending on HuggingFace (92 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("prism-ml/bonsai-image-ternary-4B-gemlite-2bit")
diffuserssafetensorsternary1.58-bitgemliteAPI: huggingface.co/prism-ml/bonsai-image-ternary-4B-gemlite-2bit
Auto-discovered from HuggingFace trending. 92 likes, 0 downloads.
- ▾MOSS TTS V1.5NewOpen
OpenMOSS-Team · self-host
Best for: Trending on HuggingFace (95 likes this week)
How: Available on Hugging Face. 19K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("OpenMOSS-Team/MOSS-TTS-v1.5")
safetensorsmoss_tts_delaytext-to-speechcustom_codezhAPI: huggingface.co/OpenMOSS-Team/MOSS-TTS-v1.5
Auto-discovered from HuggingFace trending. 95 likes, 19K downloads.
- ▾Qwen3.6 35B A3B NVFP4NewOpen
nvidia · self-host
Best for: Trending on HuggingFace (193 likes this week)
How: Available on Hugging Face. 822K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nvidia/Qwen3.6-35B-A3B-NVFP4")
Model Optimizersafetensorsqwen3_5_moenvidiaModelOptAPI: huggingface.co/nvidia/Qwen3.6-35B-A3B-NVFP4
Auto-discovered from HuggingFace trending. 193 likes, 822K downloads.
- ▾PaddleOCR VL 1.6NewOpen
PaddlePaddle · self-host
Best for: Trending on HuggingFace (269 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("PaddlePaddle/PaddleOCR-VL-1.6")
PaddleOCRsafetensorspaddleocr_vlERNIE4.5PaddlePaddleAPI: huggingface.co/PaddlePaddle/PaddleOCR-VL-1.6
Auto-discovered from HuggingFace trending. 269 likes, 9K downloads.
- ▾LFM2.5 8B A1B GGUFNewOpen
LiquidAI · self-host
Best for: Trending on HuggingFace (177 likes this week)
How: Available on Hugging Face. 87K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("LiquidAI/LFM2.5-8B-A1B-GGUF")
ggufliquidlfm2edgellama.cppAPI: huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF
Auto-discovered from HuggingFace trending. 177 likes, 87K downloads.
- ▾Qwen3.6 27B OBLITERATEDNewOpen
OBLITERATUS · self-host
Best for: Trending on HuggingFace (120 likes this week)
How: Available on Hugging Face. 17K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("OBLITERATUS/Qwen3.6-27B-OBLITERATED")
transformerssafetensorsggufqwen3_5_texttext-generationAPI: huggingface.co/OBLITERATUS/Qwen3.6-27B-OBLITERATED
Auto-discovered from HuggingFace trending. 120 likes, 17K downloads.
- ▾Step 3.7 FlashNewOpen
stepfun-ai · self-host
Best for: Trending on HuggingFace (359 likes this week)
How: Available on Hugging Face. 47K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("stepfun-ai/Step-3.7-Flash")
transformerssafetensorsstep3p7text-generationvision-languageAPI: huggingface.co/stepfun-ai/Step-3.7-Flash
Auto-discovered from HuggingFace trending. 359 likes, 47K downloads.
- ▾LFM2.5 8B A1BNewOpen
LiquidAI · self-host
Best for: Trending on HuggingFace (551 likes this week)
How: Available on Hugging Face. 135K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("LiquidAI/LFM2.5-8B-A1B")
transformerssafetensorslfm2_moetext-generationliquidAPI: huggingface.co/LiquidAI/LFM2.5-8B-A1B
Auto-discovered from HuggingFace trending. 551 likes, 135K downloads.
- ▾LocateAnything 3BOpen
nvidia · self-host
Best for: Trending on HuggingFace (2063 likes this week)
How: Available on Hugging Face. 87K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("nvidia/LocateAnything-3B")
transformerssafetensorslocateanythingimage-feature-extractionnvidiaAPI: huggingface.co/nvidia/LocateAnything-3B
Auto-discovered from HuggingFace trending. 2063 likes, 87K downloads.
- ▾Lens TurboNewOpen
microsoft · self-host
Best for: Trending on HuggingFace (125 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("microsoft/Lens-Turbo")
diffuserssafetensorstext-to-imageenarxiv:2605.21573API: huggingface.co/microsoft/Lens-Turbo
Auto-discovered from HuggingFace trending. 125 likes, 1K downloads.
- ▾LensNewOpen
microsoft · self-host
Best for: Trending on HuggingFace (138 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("microsoft/Lens")
diffuserssafetensorstext-to-imageenarxiv:2605.21573API: huggingface.co/microsoft/Lens
Auto-discovered from HuggingFace trending. 138 likes, 1K downloads.
- ▾Qwopus3.6 27B V2 MTP GGUFNewOpen
Jackrong · self-host
Best for: Trending on HuggingFace (178 likes this week)
How: Available on Hugging Face. 125K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus3.6-27B-v2-MTP-GGUF")
transformersggufllama.cppimage-text-to-textvisionAPI: huggingface.co/Jackrong/Qwopus3.6-27B-v2-MTP-GGUF
Auto-discovered from HuggingFace trending. 178 likes, 125K downloads.
- ▾MiniCPM5 1BNewOpen
openbmb · self-host
Best for: Trending on HuggingFace (776 likes this week)
How: Available on Hugging Face. 101K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("openbmb/MiniCPM5-1B")
transformerssafetensorsllamatext-generationminicpmAPI: huggingface.co/openbmb/MiniCPM5-1B
Auto-discovered from HuggingFace trending. 776 likes, 101K downloads.
- ▾Qwopus3.6 27B V2 GGUFNewOpen
Jackrong · self-host
Best for: Trending on HuggingFace (183 likes this week)
How: Available on Hugging Face. 29K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus3.6-27B-v2-GGUF")
transformersggufllama.cppimage-text-to-textvisionAPI: huggingface.co/Jackrong/Qwopus3.6-27B-v2-GGUF
Auto-discovered from HuggingFace trending. 183 likes, 29K downloads.
- ▾SANA WM_bidirectionalNewOpen
Efficient-Large-Model · self-host
Best for: Trending on HuggingFace (86 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Efficient-Large-Model/SANA-WM_bidirectional")
diffuserssafetensorstext-to-videoimage-to-videocamera-controlAPI: huggingface.co/Efficient-Large-Model/SANA-WM_bidirectional
Auto-discovered from HuggingFace trending. 86 likes, 0 downloads.
- ▾Command A Plus 05 2026 Bf16NewOpen
CohereLabs · self-host
Best for: Trending on HuggingFace (126 likes this week)
How: Available on Hugging Face. 14K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("CohereLabs/command-a-plus-05-2026-bf16")
transformerssafetensorscohere2_visionimage-text-to-textconversationalAPI: huggingface.co/CohereLabs/command-a-plus-05-2026-bf16
Auto-discovered from HuggingFace trending. 126 likes, 14K downloads.
- ▾Intern S2 PreviewNewOpen
internlm · self-host
Best for: Trending on HuggingFace (86 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("internlm/Intern-S2-Preview")
transformerssafetensorsintern_s2_previewimage-text-to-textconversationalAPI: huggingface.co/internlm/Intern-S2-Preview
Auto-discovered from HuggingFace trending. 86 likes, 2K downloads.
- ▾Command A Plus 05 2026 W4a4NewOpen
CohereLabs · self-host
Best for: Trending on HuggingFace (213 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("CohereLabs/command-a-plus-05-2026-w4a4")
transformerssafetensorscohere2_visionimage-text-to-textconversationalAPI: huggingface.co/CohereLabs/command-a-plus-05-2026-w4a4
Auto-discovered from HuggingFace trending. 213 likes, 8K downloads.
- ▾Ring 2.6 1TNewOpen
inclusionAI · self-host
Best for: Trending on HuggingFace (89 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("inclusionAI/Ring-2.6-1T")
transformerssafetensorsbailing_hybridtext-generationconversationalAPI: huggingface.co/inclusionAI/Ring-2.6-1T
Auto-discovered from HuggingFace trending. 89 likes, 3K downloads.
- ▾HRM Text 1BNewOpen
sapientinc · self-host
Best for: Trending on HuggingFace (751 likes this week)
How: Available on Hugging Face. 135K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("sapientinc/HRM-Text-1B")
transformerssafetensorshrm_texttext-generationhrmAPI: huggingface.co/sapientinc/HRM-Text-1B
Auto-discovered from HuggingFace trending. 751 likes, 135K downloads.
- ▾Qwopus3.5 9B Coder GGUFNewOpen
Jackrong · self-host
Best for: Trending on HuggingFace (181 likes this week)
How: Available on Hugging Face. 39K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus3.5-9B-Coder-GGUF")
transformersgguftext-generation-inferenceunslothqwen3_5API: huggingface.co/Jackrong/Qwopus3.5-9B-Coder-GGUF
Auto-discovered from HuggingFace trending. 181 likes, 39K downloads.
- ▾Qwen3.6 35B A3B MTP GGUFNewOpen
unsloth · self-host
Best for: Trending on HuggingFace (393 likes this week)
How: Available on Hugging Face. 628K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/Qwen3.6-35B-A3B-MTP-GGUF")
transformersggufunslothqwenqwen3_5_moeAPI: huggingface.co/unsloth/Qwen3.6-35B-A3B-MTP-GGUF
Auto-discovered from HuggingFace trending. 393 likes, 628K downloads.
- ▾Qwen3.6 27B MTP GGUFNewOpen
unsloth · self-host
Best for: Trending on HuggingFace (613 likes this week)
How: Available on Hugging Face. 983K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/Qwen3.6-27B-MTP-GGUF")
transformersggufunslothqwenqwen3_5API: huggingface.co/unsloth/Qwen3.6-27B-MTP-GGUF
Auto-discovered from HuggingFace trending. 613 likes, 983K downloads.
- ▾Qwen3.6 27B MTP UD GGUFNewOpen
havenoammo · self-host
Best for: Trending on HuggingFace (87 likes this week)
How: Available on Hugging Face. 43K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("havenoammo/Qwen3.6-27B-MTP-UD-GGUF")
transformersggufunslothqwenqwen3_5API: huggingface.co/havenoammo/Qwen3.6-27B-MTP-UD-GGUF
Auto-discovered from HuggingFace trending. 87 likes, 43K downloads.
- ▾Supertonic 3NewOpen
Supertone · self-host
Best for: Trending on HuggingFace (771 likes this week)
How: Available on Hugging Face. 58K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Supertone/supertonic-3")
supertoniconnxtext-to-speechspeech-synthesisttsAPI: huggingface.co/Supertone/supertonic-3
Auto-discovered from HuggingFace trending. 771 likes, 58K downloads.
- ▾Qwopus3.6 35B A3B V1 GGUFNewOpen
Jackrong · self-host
Best for: Trending on HuggingFace (117 likes this week)
How: Available on Hugging Face. 67K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus3.6-35B-A3B-v1-GGUF")
transformersgguftext-generation-inferenceunslothqwen3_6API: huggingface.co/Jackrong/Qwopus3.6-35B-A3B-v1-GGUF
Auto-discovered from HuggingFace trending. 117 likes, 67K downloads.
- ▾Sulphur 2 BaseNewOpen
SulphurAI · self-host
Best for: Trending on HuggingFace (1537 likes this week)
How: Available on Hugging Face. 1666K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("SulphurAI/Sulphur-2-base")
diffusersgguftext-to-videobase_model:Lightricks/LTX-2.3base_model:quantized:Lightricks/LTX-2.3API: huggingface.co/SulphurAI/Sulphur-2-base
Auto-discovered from HuggingFace trending. 1537 likes, 1.7M downloads.
- ▾Scenema AudioOpen
ScenemaAI · self-host
Best for: Trending on HuggingFace (101 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ScenemaAI/scenema-audio")
scenema-audioaudio-generationdiffusiontext-to-audiovoice-cloningAPI: huggingface.co/ScenemaAI/scenema-audio
Auto-discovered from HuggingFace trending. 101 likes, 237 downloads.
- ▾Deepseek V4 GgufOpen
antirez · self-host
Best for: Trending on HuggingFace (139 likes this week)
How: Available on Hugging Face. 284K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("antirez/deepseek-v4-gguf")
ggufquantizeddeepseekdeepseek-v4deepseek-v4-flashAPI: huggingface.co/antirez/deepseek-v4-gguf
Auto-discovered from HuggingFace trending. 139 likes, 284K downloads.
- ▾DramaboxOpen
ResembleAI · self-host
Best for: Trending on HuggingFace (239 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ResembleAI/Dramabox")
ltx-audio-ttsdramabox-ttsttsvoice-cloningaudio-generationAPI: huggingface.co/ResembleAI/Dramabox
Auto-discovered from HuggingFace trending. 239 likes, 1K downloads.
- ▾MiniCPM V 4.6Open
openbmb · self-host
Best for: Trending on HuggingFace (1084 likes this week)
How: Available on Hugging Face. 445K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("openbmb/MiniCPM-V-4.6")
transformerssafetensorsminicpmv4_6image-text-to-textminicpm-vAPI: huggingface.co/openbmb/MiniCPM-V-4.6
Auto-discovered from HuggingFace trending. 1084 likes, 445K downloads.
- ▾NVIDIA Nemotron 3 Nano Omni 30B A3B Reasoning GGUFOpen
unsloth · self-host
Best for: Trending on HuggingFace (100 likes this week)
How: Available on Hugging Face. 45K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF")
ggufnvidiaunslothnemotron-3multimodalAPI: huggingface.co/unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF
Auto-discovered from HuggingFace trending. 100 likes, 45K downloads.
- ▾Z AnimeOpen
SeeSee21 · self-host
Best for: Trending on HuggingFace (418 likes this week)
How: Available on Hugging Face. 16K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("SeeSee21/Z-Anime")
diffuserssafetensorsggufz-animetext-to-imageAPI: huggingface.co/SeeSee21/Z-Anime
Auto-discovered from HuggingFace trending. 418 likes, 16K downloads.
- ▾Ling 2.6 1TOpen
inclusionAI · self-host
Best for: Trending on HuggingFace (111 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("inclusionAI/Ling-2.6-1T")
transformerssafetensorsbailing_hybridtext-generationconversationalAPI: huggingface.co/inclusionAI/Ling-2.6-1T
Auto-discovered from HuggingFace trending. 111 likes, 642 downloads.
- ▾LTX 2.3 WorkflowsOpen
RuneXX · self-host
Best for: Trending on HuggingFace (564 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("RuneXX/LTX-2.3-Workflows")
ltxltx-2comfyuicomfyggufAPI: huggingface.co/RuneXX/LTX-2.3-Workflows
Auto-discovered from HuggingFace trending. 564 likes, 0 downloads.
- ▾Fara 7BOpen
microsoft · self-host
Best for: Trending on HuggingFace (593 likes this week)
How: Available on Hugging Face. 15K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("microsoft/Fara-7B")
transformerssafetensorsqwen2_5_vlimage-text-to-textmultimodalAPI: huggingface.co/microsoft/Fara-7B
Auto-discovered from HuggingFace trending. 593 likes, 15K downloads.
- ▾NeedleOpen
Cactus · self-host
Best for: function-calling tasks
How: run Needle on consumer devices
Example: function-calling (tool use) model
26M parameter modelruns at 6000 tok/s prefill1200 tok/s decode on consumer devicesAuto-discovered from news articles.
- ▾Qwen3.6 27B Heretic Uncensored FINETUNE NEO CODE Di IMatrix MAX GGUFOpen
DavidAU · self-host
Best for: Trending on HuggingFace (105 likes this week)
How: Available on Hugging Face. 144K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF")
transformersggufunslothhereticuncensoredAPI: huggingface.co/DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF
Auto-discovered from HuggingFace trending. 105 likes, 144K downloads.
- ▾LTX2.3 10ErosOpen
TenStrip · self-host
Best for: Trending on HuggingFace (281 likes this week)
How: Available on Hugging Face. 136K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("TenStrip/LTX2.3-10Eros")
diffusersimage-to-videoregion:usAPI: huggingface.co/TenStrip/LTX2.3-10Eros
Auto-discovered from HuggingFace trending. 281 likes, 136K downloads.
- ▾Granite 4.1 30bOpen
ibm-granite · self-host
Best for: Trending on HuggingFace (100 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ibm-granite/granite-4.1-30b")
transformerssafetensorsgranitetext-generationlanguageAPI: huggingface.co/ibm-granite/granite-4.1-30b
Auto-discovered from HuggingFace trending. 100 likes, 6K downloads.
- ▾Granite 4.1 8bOpen
ibm-granite · self-host
Best for: Trending on HuggingFace (157 likes this week)
How: Available on Hugging Face. 20K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("ibm-granite/granite-4.1-8b")
transformerssafetensorsgranitetext-generationlanguageAPI: huggingface.co/ibm-granite/granite-4.1-8b
Auto-discovered from HuggingFace trending. 157 likes, 20K downloads.
- ▾Ling 2.6 FlashOpen
inclusionAI · self-host
Best for: Trending on HuggingFace (456 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("inclusionAI/Ling-2.6-flash")
safetensorsbailing_hybridtext-generationconversationalcustom_codeAPI: huggingface.co/inclusionAI/Ling-2.6-flash
Auto-discovered from HuggingFace trending. 456 likes, 1K downloads.
- ▾OmniVoiceOpen
k2-fsa · self-host
Best for: Trending on HuggingFace (872 likes this week)
How: Available on Hugging Face. 2236K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("k2-fsa/OmniVoice")
omnivoicesafetensorszero-shotmultilingualvoice-cloningAPI: huggingface.co/k2-fsa/OmniVoice
Auto-discovered from HuggingFace trending. 872 likes, 2.2M downloads.
- ▾CyberSecQwen-4BOpen
Hugging Face · 128K tokens · self-host
Best for: defensive cyber tasks
How: use the model for specialized cyber defense tasks
Example: model can be used for detecting and preventing cyber threats
defensive cyberspecializedlocally-runnableAuto-discovered from news articles.
- ▾GAIAOpen
AMD · self-host
Best for: local AI applications on Windows and Linux
How: integrate with Lemonade SDK
Example: use GAIA for local AI tasks on Windows and Linux systems
easy to useleverages Lemonade SDKAuto-discovered from news articles.
- ▾Laguna XS.2Open
poolside · self-host
Best for: Trending on HuggingFace (228 likes this week)
How: Available on Hugging Face. 14K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("poolside/Laguna-XS.2")
transformerssafetensorslagunatext-generationlaguna-xs.2API: huggingface.co/poolside/Laguna-XS.2
Auto-discovered from HuggingFace trending. 228 likes, 14K downloads.
- ▾Qwen3.6 27B DFlashOpen
z-lab · self-host
Best for: Trending on HuggingFace (262 likes this week)
How: Available on Hugging Face. 29K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("z-lab/Qwen3.6-27B-DFlash")
transformerssafetensorsqwen3feature-extractiondflashAPI: huggingface.co/z-lab/Qwen3.6-27B-DFlash
Auto-discovered from HuggingFace trending. 262 likes, 29K downloads.
- ▾Qwen3.6 35B A3B DFlashOpen
z-lab · self-host
Best for: Trending on HuggingFace (165 likes this week)
How: Available on Hugging Face. 27K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("z-lab/Qwen3.6-35B-A3B-DFlash")
transformerssafetensorsqwen3feature-extractiondflashAPI: huggingface.co/z-lab/Qwen3.6-35B-A3B-DFlash
Auto-discovered from HuggingFace trending. 165 likes, 27K downloads.
- ▾Qwen3.6 27B Uncensored HauhauCS AggressiveOpen
HauhauCS · self-host
Best for: Trending on HuggingFace (265 likes this week)
How: Available on Hugging Face. 303K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive")
ggufuncensoredqwen3.6visionmultimodalAPI: huggingface.co/HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive
Auto-discovered from HuggingFace trending. 265 likes, 303K downloads.
- ▾Hy3 PreviewOpen
tencent · self-host
Best for: Trending on HuggingFace (189 likes this week)
How: Available on Hugging Face. 14K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("tencent/Hy3-preview")
transformerssafetensorshy_v3text-generationconversationalAPI: huggingface.co/tencent/Hy3-preview
Auto-discovered from HuggingFace trending. 189 likes, 14K downloads.
- ▾MiMo V2.5 ProOpen
XiaomiMiMo · self-host
Best for: Trending on HuggingFace (506 likes this week)
How: Available on Hugging Face. 40K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("XiaomiMiMo/MiMo-V2.5-Pro")
safetensorsmimo_v2text-generationagentlong-contextAPI: huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro
Auto-discovered from HuggingFace trending. 506 likes, 40K downloads.
- ▾Qwen3.6 27B FP8Open
Qwen · self-host
Best for: Trending on HuggingFace (160 likes this week)
How: Available on Hugging Face. 745K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-27B-FP8")
transformerssafetensorsqwen3_5image-text-to-textconversationalAPI: huggingface.co/Qwen/Qwen3.6-27B-FP8
Auto-discovered from HuggingFace trending. 160 likes, 745K downloads.
- ▾DeepSeek V4 FlashOpen
deepseek-ai · self-host
Best for: Trending on HuggingFace (1371 likes this week)
How: Available on Hugging Face. 3525K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-V4-Flash")
transformerssafetensorsconversationallicense:miteval-resultsAPI: huggingface.co/deepseek-ai/DeepSeek-V4-Flash
Auto-discovered from HuggingFace trending. 1371 likes, 3.5M downloads.
- ▾DeepSeek V4 ProOpen
deepseek-ai · self-host
Best for: Trending on HuggingFace (4867 likes this week)
How: Available on Hugging Face. 2935K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-V4-Pro")
transformerssafetensorsdeepseek_v4text-generationconversationalAPI: huggingface.co/deepseek-ai/DeepSeek-V4-Pro
Auto-discovered from HuggingFace trending. 4867 likes, 2.9M downloads.
- ▾Qwen3.6 35B A3B Claude 4.6 Opus Reasoning Distilled GGUFOpen
hesamation · self-host
Best for: Trending on HuggingFace (200 likes this week)
How: Available on Hugging Face. 129K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF")
ggufllama.cppqwenqwen3.6qwen3_5_moeAPI: huggingface.co/hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Auto-discovered from HuggingFace trending. 200 likes, 129K downloads.
- ▾Qwen3.6 27B GGUFOpen
unsloth · self-host
Best for: Trending on HuggingFace (633 likes this week)
How: Available on Hugging Face. 1355K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/Qwen3.6-27B-GGUF")
transformersggufunslothqwenqwen3_5API: huggingface.co/unsloth/Qwen3.6-27B-GGUF
Auto-discovered from HuggingFace trending. 633 likes, 1.4M downloads.
- ▾Qwen3.6 27BOpen
Qwen · self-host
Best for: Trending on HuggingFace (1554 likes this week)
How: Available on Hugging Face. 5064K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-27B")
transformerssafetensorsqwen3_5image-text-to-textconversationalAPI: huggingface.co/Qwen/Qwen3.6-27B
Auto-discovered from HuggingFace trending. 1554 likes, 5.1M downloads.
- ▾Qwopus GLM 18B Merged GGUFOpen
Jackrong · self-host
Best for: Trending on HuggingFace (201 likes this week)
How: Available on Hugging Face. 70K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jackrong/Qwopus-GLM-18B-Merged-GGUF")
ggufmergefrankenmergeqwen3.5reasoningAPI: huggingface.co/Jackrong/Qwopus-GLM-18B-Merged-GGUF
Auto-discovered from HuggingFace trending. 201 likes, 70K downloads.
- ▾Kimi K2.6Open
moonshotai · self-host
Best for: Trending on HuggingFace (1197 likes this week)
How: Available on Hugging Face. 825K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("moonshotai/Kimi-K2.6")
transformerssafetensorskimi_k25feature-extractioncompressed-tensorsAPI: huggingface.co/moonshotai/Kimi-K2.6
Auto-discovered from HuggingFace trending. 1197 likes, 825K downloads.
- ▾Qwen3.6 35B A3B FP8Open
Qwen · self-host
Best for: Trending on HuggingFace (158 likes this week)
How: Available on Hugging Face. 490K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-35B-A3B-FP8")
transformerssafetensorsqwen3_5_moeimage-text-to-textconversationalAPI: huggingface.co/Qwen/Qwen3.6-35B-A3B-FP8
Auto-discovered from HuggingFace trending. 158 likes, 490K downloads.
- ▾Qwen3.6 35B A3B Uncensored HauhauCS AggressiveOpen
HauhauCS · self-host
Best for: Trending on HuggingFace (1860 likes this week)
How: Available on Hugging Face. 2698K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive")
ggufuncensoredqwen3.6moevisionAPI: huggingface.co/HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
Auto-discovered from HuggingFace trending. 1860 likes, 2.7M downloads.
- ▾ERNIE Image Turbo GGUFOpen
unsloth · self-host
Best for: Trending on HuggingFace (180 likes this week)
How: Available on Hugging Face. 30K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/ERNIE-Image-Turbo-GGUF")
ggmlgguftext-to-imageunslothbase_model:baidu/ERNIE-Image-TurboAPI: huggingface.co/unsloth/ERNIE-Image-Turbo-GGUF
Auto-discovered from HuggingFace trending. 180 likes, 30K downloads.
- ▾Gemma 4 31B It NVFP4 TurboOpen
LilaRest · self-host
Best for: Trending on HuggingFace (247 likes this week)
How: Available on Hugging Face. 105K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("LilaRest/gemma-4-31B-it-NVFP4-turbo")
transformerssafetensorsgemma4text-generationgemma-4-31b-itAPI: huggingface.co/LilaRest/gemma-4-31B-it-NVFP4-turbo
Auto-discovered from HuggingFace trending. 247 likes, 105K downloads.
- ▾Supergemma4 26b Uncensored Mlx 4bit V2Open
Jiunsong · self-host
Best for: Trending on HuggingFace (172 likes this week)
How: Available on Hugging Face. 14K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jiunsong/supergemma4-26b-uncensored-mlx-4bit-v2")
mlxsafetensorsgemma4uncensoredapple-siliconAPI: huggingface.co/Jiunsong/supergemma4-26b-uncensored-mlx-4bit-v2
Auto-discovered from HuggingFace trending. 172 likes, 14K downloads.
- ▾Gemma 4 E4B It OBLITERATEDOpen
OBLITERATUS · self-host
Best for: Trending on HuggingFace (526 likes this week)
How: Available on Hugging Face. 128K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("OBLITERATUS/gemma-4-E4B-it-OBLITERATED")
safetensorsggufgemma4abliterateduncensoredAPI: huggingface.co/OBLITERATUS/gemma-4-E4B-it-OBLITERATED
Auto-discovered from HuggingFace trending. 526 likes, 128K downloads.
- ▾Gemma 4 31B JANG_4M CRACKOpen
dealignai · self-host
Best for: Trending on HuggingFace (1487 likes this week)
How: Available on Hugging Face. 170K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("dealignai/Gemma-4-31B-JANG_4M-CRACK")
mlxsafetensorsgemma4abliterateduncensoredAPI: huggingface.co/dealignai/Gemma-4-31B-JANG_4M-CRACK
Auto-discovered from HuggingFace trending. 1487 likes, 170K downloads.
- ▾ERNIE Image TurboOpen
baidu · self-host
Best for: Trending on HuggingFace (344 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("baidu/ERNIE-Image-Turbo")
diffuserssafetensorstext-to-image8Blicense:apache-2.0API: huggingface.co/baidu/ERNIE-Image-Turbo
Auto-discovered from HuggingFace trending. 344 likes, 6K downloads.
- ▾Qwen3.6 35B A3B GGUFOpen
unsloth · self-host
Best for: Trending on HuggingFace (966 likes this week)
How: Available on Hugging Face. 2500K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("unsloth/Qwen3.6-35B-A3B-GGUF")
transformersggufunslothqwenqwen3_5_moeAPI: huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF
Auto-discovered from HuggingFace trending. 966 likes, 2.5M downloads.
- ▾Supergemma4 26b Uncensored Gguf V2Open
Jiunsong · self-host
Best for: Trending on HuggingFace (627 likes this week)
How: Available on Hugging Face. 267K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Jiunsong/supergemma4-26b-uncensored-gguf-v2")
ggufgemma4uncensoredfastllama.cppAPI: huggingface.co/Jiunsong/supergemma4-26b-uncensored-gguf-v2
Auto-discovered from HuggingFace trending. 627 likes, 267K downloads.
- ▾GLM 5.1Open
zai-org · self-host
Best for: Trending on HuggingFace (1472 likes this week)
How: Available on Hugging Face. 171K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("zai-org/GLM-5.1")
transformerssafetensorsglm_moe_dsatext-generationconversationalAPI: huggingface.co/zai-org/GLM-5.1
Auto-discovered from HuggingFace trending. 1472 likes, 171K downloads.
- ▾ERNIE ImageOpen
baidu · self-host
Best for: Trending on HuggingFace (550 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("baidu/ERNIE-Image")
diffuserssafetensorstext-to-image8Blicense:apache-2.0API: huggingface.co/baidu/ERNIE-Image
Auto-discovered from HuggingFace trending. 550 likes, 7K downloads.
- ▾HY Embodied 0.5Open
tencent · self-host
Best for: Trending on HuggingFace (897 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("tencent/HY-Embodied-0.5")
transformerssafetensorshunyuan_vl_motimage-text-to-texthunyuanAPI: huggingface.co/tencent/HY-Embodied-0.5
Auto-discovered from HuggingFace trending. 897 likes, 2K downloads.
- ▾Qwen3.6 35B A3BOpen
Qwen · self-host
Best for: Trending on HuggingFace (1803 likes this week)
How: Available on Hugging Face. 5477K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-35B-A3B")
transformerssafetensorsqwen3_5_moeimage-text-to-textconversationalAPI: huggingface.co/Qwen/Qwen3.6-35B-A3B
Auto-discovered from HuggingFace trending. 1803 likes, 5.5M downloads.
- ▾MiniMax M2.7Open
MiniMaxAI · self-host
Best for: Trending on HuggingFace (1052 likes this week)
How: Available on Hugging Face. 469K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("MiniMaxAI/MiniMax-M2.7")
transformerssafetensorsminimax_m2text-generationconversationalAPI: huggingface.co/MiniMaxAI/MiniMax-M2.7
Auto-discovered from HuggingFace trending. 1052 likes, 469K downloads.
- ▾Nucleus ImageOpen
NucleusAI · self-host
Best for: Trending on HuggingFace (213 likes this week)
How: Available on Hugging Face.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("NucleusAI/Nucleus-Image")
diffuserssafetensorsmoesparse-moediffusionAPI: huggingface.co/NucleusAI/Nucleus-Image
Auto-discovered from HuggingFace trending. 213 likes, 2K downloads.
- ▾Gemma 4 31B ItOpen
google · self-host
Best for: Trending on HuggingFace (2640 likes this week)
How: Available on Hugging Face. 9794K downloads.
Example: from transformers import AutoModelForCausalLM; model = AutoModelForCausalLM.from_pretrained("google/gemma-4-31B-it")
transformerssafetensorsgemma4image-text-to-textconversationalAPI: huggingface.co/google/gemma-4-31B-it
Auto-discovered from HuggingFace trending. 2640 likes, 9.8M downloads.