lancer

lancer001010

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Memory in the Age of AI Agents

upvoted an article about 1 month ago

Continuous batching from first principles

upvoted an article 2 months ago

Supercharge your OCR Pipelines with Open Models

View all activity

Organizations

None yet

upvoted a paper 16 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 19 days ago • 125

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25, 2025

•

291

upvoted an article 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

289

upvoted an article 3 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted an article 4 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

upvoted a paper 4 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228

upvoted a paper 6 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 157

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

580

upvoted 2 articles 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

Article

I trained a Language Model to schedule events with GRPO!

Apr 29, 2025

•

upvoted a paper 9 months ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 133

lancer

AI & ML interests

Recent Activity

Organizations

lancer001010's activity

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

mem-agent: Equipping LLM Agents with Memory Using RL

From GRPO to DAPO and GSPO: What, Why, and How

Vision Language Models (Better, faster, stronger)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

I trained a Language Model to schedule events with GRPO!