3 185 489

Anthonny Olime

Aviv-anthonnyolime

AI & ML interests

None yet

Recent Activity

liked a model about 6 hours ago

Supertone/supertonic-2

updated a collection about 9 hours ago

Dataset

liked a dataset about 9 hours ago

PleIAs/SYNTH

View all activity

Organizations

upvoted a collection about 1 month ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 136

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

265

upvoted a paper about 1 month ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 224

upvoted an article about 1 month ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

upvoted 2 articles about 2 months ago

Article

Text-to-image Architectural Experiments

Nov 13, 2025

•

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted an article 4 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

176

upvoted a paper 5 months ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published Nov 7, 2024 • 22

upvoted an article 5 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

upvoted 6 papers 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13, 2025 • 53

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

upvoted 2 collections 5 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 14 days ago • 88

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 442

upvoted 2 articles 5 months ago

Article

TextQuests: How Good are LLMs at Text-Based Video Games?

Aug 12, 2025

•

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11, 2025

•

upvoted a paper 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

Anthonny Olime

AI & ML interests

Recent Activity

Organizations

Aviv-anthonnyolime's activity

Transformers v5: Simple model definitions powering the AI ecosystem

From GRPO to DAPO and GSPO: What, Why, and How

Text-to-image Architectural Experiments

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

TextQuests: How Good are LLMs at Text-Based Video Games?

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks