copenlu/mm-framing
Viewer • Updated • 633k • 92 • 4
RoBERTa topic classifier for topic injection into the Longformer Framing Classifier. Classifies input text into one of 19 discrete topics:
These were derived empirically by consolidating the unstructured gpt_topic field from the mm_framing silver dataset into
discrete categories based on similarity.
Achieved a 76.4% validation accuracy on 64,000 examples, which was deemed sufficient for assisting domain-specific reasoning in downstream model.