pszemraj/unified-mcqa
Viewer • Updated • 683k • 68
How to use pszemraj/deberta-v3-base-unified-mcqa-2-choice with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultipleChoice
tokenizer = AutoTokenizer.from_pretrained("pszemraj/deberta-v3-base-unified-mcqa-2-choice")
model = AutoModelForMultipleChoice.from_pretrained("pszemraj/deberta-v3-base-unified-mcqa-2-choice")If finetuning this further on downstream tasks (classification, etc) you may need to pass
ignore_mismatched_sizes=Truewhen loading this model.
This model is a fine-tuned version of microsoft/deberta-v3-base on the pszemraj/unified-mcqa dataset. It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Input Tokens Seen |
|---|---|---|---|---|---|
| 0.3639 | 0.4579 | 400 | 0.3093 | 0.8700 | 2444384 |
| 0.308 | 0.9157 | 800 | 0.2792 | 0.8920 | 4861008 |
| 0.2517 | 1.3732 | 1200 | 0.2989 | 0.8890 | 7293776 |
| 0.2399 | 1.8310 | 1600 | 0.2796 | 0.8990 | 9723744 |
| 0.1082 | 2.2885 | 2000 | 0.3557 | 0.9030 | 12157608 |
| 0.1269 | 2.7463 | 2400 | 0.3486 | 0.8980 | 14592584 |
Base model
microsoft/deberta-v3-base