Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
18
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
1 follower
·
20 following
AI & ML interests
None yet
Recent Activity
updated
a model
16 days ago
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step300
updated
a model
16 days ago
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step250
updated
a model
17 days ago
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step400
View all activity
Organizations
FlippyDora
's models
62
Sort: Recently updated
FlippyDora/Qwen2.5-Math-1.5B-raft-vanilla_numina_math-step_20
2B
•
Updated
Mar 14, 2025
•
6
FlippyDora/Qwen2.5-Math-1.5B-raft-pp_numina_math-step_20
2B
•
Updated
Mar 14, 2025
•
6
FlippyDora/Qwen1.5B-Inst_numina_raft1_orig_eos
Text Generation
•
2B
•
Updated
Mar 6, 2025
•
5
FlippyDora/qwen_sft_1
Text Generation
•
8B
•
Updated
Mar 4, 2025
•
5
FlippyDora/qwen_sft_2
Text Generation
•
8B
•
Updated
Mar 4, 2025
•
3
FlippyDora/Qwen_numina_raft3_orig_eos
Text Generation
•
8B
•
Updated
Mar 1, 2025
•
4
FlippyDora/Qwen_numina_raft2_orig_eos
Text Generation
•
8B
•
Updated
Mar 1, 2025
•
4
FlippyDora/3B_rpr_mixtureBT_criteria_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24, 2025
•
6
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k5
3B
•
Updated
Feb 24, 2025
•
5
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24, 2025
•
6
FlippyDora/3B_mixtureBT_rpr_criteria_k5_epoch5_loadBalance0.5
3B
•
Updated
Feb 22, 2025
•
4
FlippyDora/3B_mixtureBT_helpsteer2_pkusafe_attr_heads6_loadBalance0.5
3B
•
Updated
Feb 12, 2025
•
5
FlippyDora/3B_mixtureBT_rpr_criteria_epoch5_loadBalance0.5
3B
•
Updated
Feb 10, 2025
•
5
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8, 2025
•
5
FlippyDora/3B_helpsteer2_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8, 2025
•
6
FlippyDora/CoT_Translator
7B
•
Updated
Feb 6, 2025
•
6
FlippyDora/CoT_Prover
7B
•
Updated
Feb 4, 2025
•
4
FlippyDora/dpo_rm
3B
•
Updated
Jan 21, 2025
•
5
FlippyDora/dpo_remove
3B
•
Updated
Jan 19, 2025
•
4
FlippyDora/origin_preference700k
3B
•
Updated
Jan 18, 2025
•
5
FlippyDora/MixtureBT_preference700k_LoadBalance0.5
3B
•
Updated
Jan 18, 2025
•
4
FlippyDora/MathLLM-StatementTranslator-7B-v0.1
7B
•
Updated
Jan 17, 2025
•
4
FlippyDora/MixtureBT_Helpsteer2_LoadBalance0.5
3B
•
Updated
Jan 16, 2025
•
4
FlippyDora/step_dpo_mistral_lr1e-7_step200
7B
•
Updated
Dec 5, 2024
•
8
FlippyDora/step_dpo_mistral_lr1e-7_step100
7B
•
Updated
Dec 5, 2024
•
4
FlippyDora/mdpo
3B
•
Updated
Nov 21, 2024
•
3
FlippyDora/mdpo_guess_cities
3B
•
Updated
Nov 21, 2024
•
4
FlippyDora/dpo-rm-translate
Updated
Nov 17, 2024
FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo
Updated
Oct 23, 2024
FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo
Updated
Oct 22, 2024
Previous
1
2
3
Next