arxiv:2510.12831
taicheng guo
taicheng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
liked
a model
29 days ago
meta-llama/Llama-3.2-3B
upvoted
a
paper
about 2 months ago
Glyph: Scaling Context Windows via Visual-Text Compression