mradermacher commited on
Commit
b496d05
·
verified ·
1 Parent(s): 3d9ffc6

auto-patch README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -7,6 +7,10 @@ model_name: R-PRM-7B-DPO
7
  mradermacher:
8
  readme_rev: 1
9
  quantized_by: mradermacher
 
 
 
 
10
  ---
11
  ## About
12
 
 
7
  mradermacher:
8
  readme_rev: 1
9
  quantized_by: mradermacher
10
+ tags:
11
+ - reinforcement-learning
12
+ - reward-model
13
+ - dpo
14
  ---
15
  ## About
16