Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2604.18486

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

Interesting work but not directly related

VOID: Video Object and Interaction Deletion

Paper • 2604.02296 • Published about 1 month ago • 54
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 13 days ago • 90
WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 24 days ago • 245
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

Paper • 2604.19734 • Published 12 days ago • 29

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 53
Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published about 1 month ago • 500
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 22 days ago • 78
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 19 days ago • 88

Ai papper for efficient ai model

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published 16 days ago • 23
Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 16 days ago • 57
REFRAG: Rethinking RAG based Decoding

Paper • 2509.01092 • Published Sep 1, 2025 • 9
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 13 days ago • 90

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 4
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published Mar 1 • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published Feb 28
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

Ai papper for efficient ai model

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published 16 days ago • 23
Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 16 days ago • 57
REFRAG: Rethinking RAG based Decoding

Paper • 2509.01092 • Published Sep 1, 2025 • 9
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 13 days ago • 90

Interesting work but not directly related

VOID: Video Object and Interaction Deletion

Paper • 2604.02296 • Published about 1 month ago • 54
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 13 days ago • 90
WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 24 days ago • 245
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

Paper • 2604.19734 • Published 12 days ago • 29

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 4
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published Mar 1 • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published Feb 28
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 53
Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published about 1 month ago • 500
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 22 days ago • 78
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 19 days ago • 88

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs