hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LR-Retrained Text Generation • 3B • Updated Dec 2, 2025 • 3
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LR-Retrained Text Generation • 3B • Updated Dec 2, 2025 • 3
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-R1-QwQ-Seed-42-MLR Text Generation • 3B • Updated Nov 30, 2025 • 2
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-R1-QwQ-Seed-42-MLR Text Generation • 3B • Updated Nov 30, 2025 • 2
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-ROC-Seed-42 Text Generation • 3B • Updated Nov 26, 2025 • 3
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMe Text Generation • 3B • Updated Nov 25, 2025 • 7
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-R1-MC-Seed-42 Text Generation • 3B • Updated Nov 25, 2025 • 3
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-R1-MC-Seed-42 Text Generation • 3B • Updated Nov 25, 2025 • 3
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-Seed-42-ROC-Seed-42 Text Generation • 3B • Updated Nov 26, 2025 • 3
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI Paper • 2511.07885 • Published Nov 11, 2025 • 7
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LRDRMe Text Generation • 3B • Updated Nov 25, 2025 • 7
Cartridges: Lightweight and general-purpose long context representations via self-study Paper • 2506.06266 • Published Jun 6, 2025 • 7
Archon: An Architecture Search Framework for Inference-Time Techniques Paper • 2409.15254 • Published Sep 23, 2024 • 1