7 16 10

Wenhao Yu

wyu1

https://wyu97.github.io/

wyu97

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

upvoted a paper 23 days ago

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

submitted a paper 23 days ago

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

View all activity

Organizations

upvoted a paper 16 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published 17 days ago • 17

upvoted a paper 23 days ago

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Paper • 2512.10284 • Published 23 days ago • 25

upvoted a paper about 1 month ago

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 50

upvoted a paper 2 months ago

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Paper • 2510.14438 • Published Oct 16, 2025 • 13

upvoted 2 papers 3 months ago

Don't Throw Away Your Pretrained Model

Paper • 2510.09913 • Published Oct 10, 2025 • 4

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 36

upvoted 3 papers 4 months ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 101

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

upvoted 2 papers 5 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 129

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6, 2025 • 161

upvoted a paper 9 months ago

Towards Trustworthy GUI Agents: A Survey

Paper • 2503.23434 • Published Mar 30, 2025 • 21

upvoted a paper 11 months ago

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Paper • 2501.15427 • Published Jan 26, 2025 • 6

upvoted 2 papers over 1 year ago

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 26

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 67

upvoted an article over 1 year ago

Article

BigCodeBench: The Next Generation of HumanEval

Jun 18, 2024

•

Wenhao Yu

AI & ML interests

Recent Activity

Organizations

wyu1's activity

BigCodeBench: The Next Generation of HumanEval