WM/W2R trajectories of WebShop TM-WM (step60/80/92) with Qwen3-8B and Qwen3-32B agents at 40k context.
YOULING HUANG
Ricardo-H
·
AI & ML interests
None yet
Recent Activity
updated a dataset 4 days ago
Ricardo-H/ws-w2w-llama-3.1-8b-webshop-wm-w2r-qwen3-32b published a dataset 4 days ago
Ricardo-H/ws-w2w-llama-3.1-8b-webshop-wm-w2r-qwen3-32b updated a collection 4 days ago
WS TM-WM Sweep - Qwen3 Agents (40k)Organizations
None yet
models 161
Ricardo-H/tw-wm-token-match-llama-step171
8B • Updated • 35
Ricardo-H/tw-wm-tm-0501-step-170
8B • Updated • 21
Ricardo-H/ws-qwen-webshop-token-match-0430-step-84
8B • Updated • 9
Ricardo-H/ws-llama-webshop-token-match-0429-step-92
8B • Updated • 19
Ricardo-H/ws-llama-webshop-token-match-0429-step-80
8B • Updated • 25
Ricardo-H/ws-llama-webshop-token-match-0429-step-60
8B • Updated • 16
Ricardo-H/ocar-gigpo-observe-alfworld-1.5b
2B • Updated • 21
Ricardo-H/ocar-grpo-observe-alfworld-1.5b
2B • Updated • 24
Ricardo-H/ocar-v3-alfworld-7b
8B • Updated • 18
Ricardo-H/ocar-grpo-observe-alfworld-7b
8B • Updated • 15
datasets 23
Ricardo-H/ws-w2w-llama-3.1-8b-webshop-wm-w2r-qwen3-32b
Updated • 18
Ricardo-H/ws-step92-webshop-wm-w2r-qwen3-32b-40k
Updated • 40
Ricardo-H/ws-step92-webshop-wm-w2r-qwen3-8b-40k
Updated • 19
Ricardo-H/ws-step80-webshop-wm-w2r-qwen3-32b-40k
Updated • 50
Ricardo-H/ws-step80-webshop-wm-w2r-qwen3-8b-40k
Updated • 71
Ricardo-H/ws-step60-webshop-wm-w2r-qwen3-32b-40k
Updated • 71
Ricardo-H/ws-step60-webshop-wm-w2r-qwen3-8b-40k
Updated • 60
Ricardo-H/tw-behr-llama-3.1-8b-textworld-wm-w2r-qwen3-32b
Updated • 87
Ricardo-H/tw-behr-llama-3.1-8b-textworld-wm-w2r-qwen3-8b
Updated • 103
Ricardo-H/tw-step171-llama-textworld-wm-w2r-qwen3-32b
Updated • 106