T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models Paper • 2504.04718 • Published Apr 7, 2025 • 43
view article Article Introducing OptiMind, a research model designed for optimization 4 days ago • 22
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published May 14, 2025 • 99
view article Article Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data Mar 26, 2025 • 9
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 254
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day Dec 8, 2025 • 48
Tiny-A2D Collection Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 14
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 274
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE Paper • 2502.06282 • Published Feb 10, 2025 • 6
view article Article Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio Jul 31, 2025 • 60
view article Article ✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Jan 3, 2025 • 22