MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper โข 2603.09652 โข Published Mar 10 โข 15
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper โข 2603.09652 โข Published Mar 10 โข 15
Running Agents 2 MiniAppBench Leaderboard ๐ 2 Submit MiniApp model results and view the leaderboard
Running Agents 2 MiniAppBench Leaderboard ๐ 2 Submit MiniApp model results and view the leaderboard