Xinyu Zhu

TianHongZXY

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

upvoted a paper 25 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

updated a model 29 days ago

TianHongZXY/Qwen3-4B-NSR

published a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

View all activity

Organizations

upvoted a paper 25 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 25 days ago • 128

updated a model 29 days ago

TianHongZXY/Qwen3-4B-NSR

4B • Updated 29 days ago • 10

published a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

0.5B • Updated Nov 5, 2025

updated a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

0.5B • Updated Nov 5, 2025

upvoted a paper 3 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

authored a paper 3 months ago

RAST: Reasoning Activation in LLMs via Small-model Transfer

Paper • 2506.15710 • Published May 30, 2025

updated a dataset 4 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4, 2025 • 2.16k • 7.72k

published a dataset 4 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4, 2025 • 2.16k • 7.72k

upvoted a paper 4 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 347

updated a dataset 4 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28, 2025 • 2.16k • 9.4k

published a dataset 4 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28, 2025 • 2.16k • 9.4k

liked a dataset 4 months ago

cais/hle

Viewer • Updated Sep 10, 2025 • 2.5k • 20.2k • 644

liked a dataset 5 months ago

nvidia/OpenScienceReasoning-2

Viewer • Updated Jul 31, 2025 • 803k • 480 • 52

liked a model 5 months ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17, 2025 • 27.5k • • 390

liked a dataset 5 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 9.75k • 170

upvoted a collection 5 months ago

RLVR-Decomposed

Collection

The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning" • 9 items • Updated Jun 1, 2025 • 3

updated a model 5 months ago

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28, 2025 • 6

updated a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12, 2025

published a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12, 2025

updated a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8, 2025

Xinyu Zhu

AI & ML interests

Recent Activity

Organizations

TianHongZXY's activity