view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs Apr 3 โข 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 30 days ago โข 3
cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit Text Generation โข 5B โข Updated 3 days ago โข 55k โข 31
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit Text Generation โข 84B โข Updated 3 days ago โข 50 โข 5