Text Classification
Transformers
Safetensors
English
Chinese
internlm2
feature-extraction
reward model
custom_code
Instructions to use internlm/internlm2-20b-reward with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use internlm/internlm2-20b-reward with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="internlm/internlm2-20b-reward", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("internlm/internlm2-20b-reward", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Great reward model, what dataset did you use to train?
#1
by zolicsaki - opened
Specifically I was wondering if you trained it on lmsys chatbot arena conversations, because your model is performing so well when evaluated on those preferences. Thanks for the help!
https://huggingface.co/datasets/lmsys/chatbot_arena_conversations
Sorry for the late reply. We did use a portion of this dataset. We performed data cleaning and filtering, including removing toxic and unsafe data, to ensure quality and safety.