RTX a6000 ( 1x ) Models that run well on the RTX a6000's 48gb of VRAM. For models spanning 2x cards with Nvlink, see other collection. black-forest-labs/FLUX.2-klein-9B Image-to-Image • Updated about 15 hours ago • 2.41k • 108
Small Models ( 8B max ) GTX 1660 Super Must run at Quant 4 or higher, 15tps or higher. Full Precision vs Quant not benchmarked. MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF Text Generation • 8B • Updated Dec 6, 2025 • 59.1k • 4 unsloth/Qwen3-4B-Instruct-2507-GGUF 4B • Updated Aug 20, 2025 • 56.5k • 133 unsloth/SmolLM3-3B-128K-GGUF 3B • Updated Jul 8, 2025 • 2.6k • 37
MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF Text Generation • 8B • Updated Dec 6, 2025 • 59.1k • 4
RTX a6000 ( 1x ) Models that run well on the RTX a6000's 48gb of VRAM. For models spanning 2x cards with Nvlink, see other collection. black-forest-labs/FLUX.2-klein-9B Image-to-Image • Updated about 15 hours ago • 2.41k • 108
Small Models ( 8B max ) GTX 1660 Super Must run at Quant 4 or higher, 15tps or higher. Full Precision vs Quant not benchmarked. MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF Text Generation • 8B • Updated Dec 6, 2025 • 59.1k • 4 unsloth/Qwen3-4B-Instruct-2507-GGUF 4B • Updated Aug 20, 2025 • 56.5k • 133 unsloth/SmolLM3-3B-128K-GGUF 3B • Updated Jul 8, 2025 • 2.6k • 37
MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF Text Generation • 8B • Updated Dec 6, 2025 • 59.1k • 4