Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
26
1
Jiarui Yao
FlippyDora
Follow
manh-linh's profile picture
research4pan's profile picture
Antigonish's profile picture
3 followers
·
25 following
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents
authored
a paper
2 days ago
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
authored
a paper
2 days ago
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
View all activity
Organizations
FlippyDora
's datasets
117
Sort: Recently updated
FlippyDora/Mistral_math500_8_orm
Viewer
•
Updated
Jan 22, 2025
•
500
•
4
FlippyDora/orm_qwen25_math
Viewer
•
Updated
Jan 10, 2025
•
200k
•
2
FlippyDora/orm_deepseek_math
Viewer
•
Updated
Jan 10, 2025
•
200k
•
2
FlippyDora/dpo_pair_data_filtered_length15_ans
Viewer
•
Updated
Dec 24, 2024
•
41.2k
•
3
FlippyDora/deepseek_dpo_pair_data_filtered_length15
Viewer
•
Updated
Dec 24, 2024
•
25k
•
3
FlippyDora/deepseek_dpo_pair_data_filtered
Viewer
•
Updated
Dec 24, 2024
•
29.8k
•
5
FlippyDora/deepseek_dpo_pair_data
Viewer
•
Updated
Dec 24, 2024
•
67.1k
•
5
FlippyDora/gsm8k_deepseek_pair_traj_actions_ds
Viewer
•
Updated
Dec 20, 2024
•
32.8k
•
3
•
1
FlippyDora/math_deepseek_pair_traj_actions_ds
Viewer
•
Updated
Dec 20, 2024
•
54.4k
•
3
FlippyDora/dpo_pair_data_filtered_length15
Viewer
•
Updated
Dec 19, 2024
•
41.2k
•
4
FlippyDora/dpo_pair_data_traj
Viewer
•
Updated
Dec 16, 2024
•
4.28k
•
4
FlippyDora/dpo_pair_data_filtered
Viewer
•
Updated
Dec 4, 2024
•
52.2k
•
6
FlippyDora/dpo_pair_data
Viewer
•
Updated
Dec 3, 2024
•
149k
•
4
FlippyDora/dpo_pair_data_part1
Viewer
•
Updated
Dec 3, 2024
•
82.4k
•
8
FlippyDora/pair_traj_actions_ds
Viewer
•
Updated
Dec 3, 2024
•
166k
•
3
FlippyDora/REFUEL
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_sampled_h_from_sampled_len_ckp_4
Viewer
•
Updated
Nov 4, 2024
•
1
•
4
FlippyDora/REFUEL_sampled_h_from_sampled_len_ckp_3
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_sampled_h_from_sampled_len_ckp_2
Viewer
•
Updated
Nov 4, 2024
•
1
•
7
FlippyDora/REFUEL_sampled_h_from_sampled_len_ckp_1
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_sampled_h_from_sampled_len_ckp_0
Viewer
•
Updated
Nov 4, 2024
•
1
•
4
FlippyDora/REFUEL_5_turns_only
Viewer
•
Updated
Nov 4, 2024
•
1
•
2
FlippyDora/REFUEL_5_turns_only_ckp_4
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_5_turns_only_ckp_3
Viewer
•
Updated
Nov 4, 2024
•
1
•
4
FlippyDora/REFUEL_5_turns_only_ckp_2
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_5_turns_only_ckp_1
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
FlippyDora/REFUEL_5_turns_only_ckp_0
Viewer
•
Updated
Nov 4, 2024
•
1
•
3
Previous
1
2
3
4
Next