view article Article A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model Jan 20 • 23
Gated Linear Attention Transformers with Hardware-Efficient Training Paper • 2312.06635 • Published Dec 11, 2023 • 9
distilbert/distilbert-base-uncased-finetuned-sst-2-english Text Classification • 67M • Updated Dec 19, 2023 • 3.27M • • 860