How Far Are We from Genuinely Useful Deep Research Agents? Paper • 2512.01948 • Published about 1 month ago • 54
How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity Paper • 2511.08487 • Published Nov 11, 2025 • 2
Rectifying LLM Thought from Lens of Optimization Paper • 2512.01925 • Published about 1 month ago • 23
deepseek-ai/DeepSeek-V3.2-Speciale Text Generation • 685B • Updated about 1 month ago • 25.4k • 627
deepseek-ai/DeepSeek-V3.2 Text Generation • 685B • Updated about 1 month ago • 116k • • 1.05k
Rectifying LLM Thought from Lens of Optimization Paper • 2512.01925 • Published about 1 month ago • 23
Rectifying LLM Thought from Lens of Optimization Paper • 2512.01925 • Published about 1 month ago • 23 • 2
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning Paper • 2511.14366 • Published Nov 18, 2025 • 16
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning Paper • 2511.14366 • Published Nov 18, 2025 • 16
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 38 items • Updated Nov 6, 2025 • 57
NVIDIA Nemotron V2 Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 8 days ago • 100
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis Paper • 2508.15754 • Published Aug 21, 2025 • 4
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis Paper • 2508.15754 • Published Aug 21, 2025 • 4
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis Paper • 2508.15754 • Published Aug 21, 2025 • 4 • 2