Working Notes on Late Interaction Dynamics: Analyzing Targeted Behaviors of Late Interaction Models Paper • 2603.26259 • Published Mar 27 • 8
view article Article DeepSeek-V4: a million-token context that agents can actually use 8 days ago • 39
Running on CPU Upgrade 231 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 231 Explore synthetic data experiments on a virtual bookshelf
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 55
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 310
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published Oct 22, 2025 • 38
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published Oct 22, 2025 • 38
Running on CPU Upgrade Featured 3.14k The Smol Training Playbook 📚 3.14k The secrets to building world-class LLMs
Holo1.5 Collection Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35
view reply My hands are full at the moment, so I'll have to pass sorry @ariG23498 !But I'll be more than happy to further discuss VLM-related research and training tricks on X (I think we already follow each other anyway 😉).