2 13

Yuxiang Zhang

TokerZ

AI & ML interests

LLM-based Agent, RL, Large Reasoning Model

Recent Activity

updated a model 4 days ago

TokerZ/7B-E2

updated a model 4 days ago

TokerZ/7B-E4

updated a model 5 days ago

TokerZ/7B-E3

View all activity

Organizations

None yet

updated 2 models 4 days ago

TokerZ/7B-E2

Updated 4 days ago

TokerZ/7B-E4

8B • Updated 4 days ago • 13

updated a model 5 days ago

TokerZ/7B-E3

Updated 5 days ago

published 3 models 5 days ago

updated 2 models 7 days ago

TokerZ/sr1-7b

Updated 7 days ago

TokerZ/sr1_4b

Updated 7 days ago

published 2 models 7 days ago

TokerZ/sr1-7b

Updated 7 days ago

TokerZ/sr1_4b

Updated 7 days ago

upvoted a paper 15 days ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 50

upvoted a paper about 1 month ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 78

upvoted a paper 2 months ago

Solving a Million-Step LLM Task with Zero Errors

Paper • 2511.09030 • Published Nov 12, 2025 • 20

upvoted 3 papers 3 months ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 27

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 87

Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI

Paper • 2510.16720 • Published Oct 19, 2025 • 7

authored a paper 3 months ago

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Paper • 2510.12635 • Published Oct 14, 2025 • 16

upvoted a paper 3 months ago

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Paper • 2510.12635 • Published Oct 14, 2025 • 16

commented a paper 3 months ago

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Paper • 2510.12635 • Published Oct 14, 2025 • 16 •

upvoted a paper 6 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 19

Yuxiang Zhang

AI & ML interests

Recent Activity

Organizations

TokerZ's activity