🏗️ Building on HF

Dipankar Sarkar PRO

dipankarsarkar

https://www.dipankar.cc

AI & ML interests

Building the AI-native stack. Agents as infrastructure, safety as architecture, performance as plumbing. I publish the receipts: papers, datasets, demos.

Recent Activity

upvoted a paper 16 minutes ago

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

upvoted a paper 16 minutes ago

Autonomous Scientific Discovery via Iterative Meta-Reflection

upvoted a paper 17 minutes ago

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

View all activity

Organizations

upvoted 2 papers 16 minutes ago

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

Paper • 2606.32029 • Published 2 days ago • 3

Autonomous Scientific Discovery via Iterative Meta-Reflection

Paper • 2607.01131 • Published 1 day ago • 3

upvoted 2 papers 17 minutes ago

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

Paper • 2607.00466 • Published 1 day ago • 15

MemSyco-Bench: Benchmarking Sycophancy in Agent Memory

Paper • 2607.01071 • Published 1 day ago • 16

upvoted a paper about 15 hours ago

Hierarchical Experimentalist Agents

Paper • 2606.29315 • Published 4 days ago • 1

upvoted a paper about 16 hours ago

Are We Measuring Strategy or Phrasing? The Gap Between Surface- and Approach-Level Diversity in LLM Math Reasoning

Paper • 2606.29985 • Published 3 days ago • 15

upvoted 3 papers about 17 hours ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published May 29 • 123

AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets

Paper • 2512.10971 • Published Dec 1, 2025 • 12

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 3 days ago • 6

upvoted a paper about 18 hours ago

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Paper • 2601.07853 • Published Jan 9 • 11

upvoted 7 papers about 19 hours ago

Who judges the judges? Governance from metrics: a runtime framework for continuous LLM compliance monitoring

Paper • 2605.24737 • Published May 23 • 2

Replayable Financial Agents: A Determinism-Faithfulness Assurance Harness for Tool-Using LLM Agents

Paper • 2601.15322 • Published Mar 7 • 1

TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories

Paper • 2604.07223 • Published Apr 8 • 1

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Paper • 2604.10577 • Published Apr 12 • 27

upvoted 3 papers about 21 hours ago

DataEvolver: Self-Evolving Multi-Agent Data Construction for Text-Rich Image Generation

Paper • 2606.31537 • Published 2 days ago • 17

Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction

Paper • 2606.28186 • Published 6 days ago • 6

Xiaomi-GUI-0 Technical Report

Paper • 2606.31410 • Published 2 days ago • 10

Dipankar Sarkar PRO

AI & ML interests

Recent Activity

Organizations

dipankarsarkar's activity