5 17 3

Zhenhailong Wang

mikewang

https://mikewangwzhl.github.io/

AI & ML interests

NLP, Computer Vision

Recent Activity

upvoted a paper about 2 months ago

MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks

upvoted a paper about 2 months ago

EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities

upvoted a paper 2 months ago

Scaling Latent Reasoning via Looped Language Models

View all activity

Organizations

upvoted 2 papers about 2 months ago

MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks

Paper • 2502.17832 • Published Feb 25, 2025 • 6

EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities

Paper • 2510.27545 • Published Oct 31, 2025 • 48

upvoted a paper 2 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

upvoted a paper 3 months ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published Oct 10, 2025 • 4

commented a paper 3 months ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published Oct 10, 2025 • 4 •

upvoted a paper 3 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29, 2025 • 11

upvoted a paper 4 months ago

FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games

Paper • 2509.01052 • Published Sep 1, 2025 • 21

upvoted a paper 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 266

upvoted a paper 6 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8, 2025 • 47

commented a paper 6 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8, 2025 • 47 •

upvoted a paper 6 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2, 2025 • 69

New activity in mikewang/PVD-160K 7 months ago

Add image-to-text task category

#2 opened 7 months ago by

nielsr

New activity in mikewang/PVD-160k-Mistral-7b 7 months ago

Add library name and pipeline tag

#1 opened 7 months ago by

nielsr

published a model 8 months ago

mikewang/DyMU

Updated Apr 11, 2025

upvoted 3 papers 8 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 98

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 79

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs

Paper • 2504.17040 • Published Apr 23, 2025 • 13

upvoted a paper 9 months ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16, 2025 • 48

updated a model 9 months ago

mikewang/DyMU

Updated Apr 11, 2025

upvoted a paper 10 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3, 2025 • 29