arxiv:2604.14683
Qianqian Xie
mistletoe111
AI & ML interests
None yet
Recent Activity
upvoted a paper about 20 hours ago
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation upvoted a paper 2 days ago
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models updated a dataset 3 days ago
NJU-LINK/DR3-Eval