nyu-mll/glue
Viewer • Updated • 1.49M • 484k • 498
How to use pszemraj/deberta-v3-xsmall-CoLA with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="pszemraj/deberta-v3-xsmall-CoLA") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("pszemraj/deberta-v3-xsmall-CoLA")
model = AutoModelForSequenceClassification.from_pretrained("pszemraj/deberta-v3-xsmall-CoLA")This model is a fine-tuned version of microsoft/deberta-v3-xsmall on the GLUE COLA dataset. It achieves the following results on the evaluation set:
Trying to find a decent optimum between accuracy/quality and inference speed.
{
"epoch": 3.0,
"eval_loss": 0.423,
"eval_matthews_correlation": 0.589,
"eval_runtime": 5.0422,
"eval_samples": 1043,
"eval_samples_per_second": 206.853,
"eval_steps_per_second": 51.763
}
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
|---|---|---|---|---|
| 0.3945 | 1.0 | 67 | 0.4323 | 0.5778 |
| 0.3214 | 2.0 | 134 | 0.4237 | 0.5895 |
| 0.3059 | 3.0 | 201 | 0.4636 | 0.5795 |
Base model
microsoft/deberta-v3-xsmall