bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF Text Generation • 33B • Updated Jan 22, 2025 • 16.9k • 300
Running 3.82k The Ultra-Scale Playbook 🌌 3.82k The ultimate guide to training LLM on large GPU Clusters