Zhichao Lin's picture

16 2

Zhichao Lin

chaosmyth

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

upvoted a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper about 2 months ago

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 7 days ago • 116

upvoted 4 papers about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 165

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management

Paper • 2510.06727 • Published Oct 8, 2025 • 4

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

upvoted 4 papers 2 months ago

Reinforcement Learning Foundations for Deep Research Systems: A Survey

Paper • 2509.06733 • Published Sep 8, 2025 • 32

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 99

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 100

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

upvoted 5 papers 3 months ago

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published Oct 14, 2025 • 13

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7, 2025 • 31

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 79

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published Aug 11, 2025 • 51

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

upvoted a collection 3 months ago

Daily Papers

1 item • Updated Oct 26, 2023 • 82

upvoted an article 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

269