Raushan Turganbay's picture

Raushan Turganbay

RaushanTurganbay

·

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

updated a model about 6 hours ago

RaushanTurganbay/audio-flamingo-3-hf-lora-finetuned

upvoted a paper 3 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

upvoted a paper 15 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

View all activity

Organizations

upvoted a paper 3 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 12 days ago • 137

upvoted a paper 15 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published 29 days ago • 21

upvoted an article 18 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

Mar 10

•

122

upvoted a paper about 1 month ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 170

upvoted an article about 2 months ago

Article

Custom Kernels for All from Codex and Claude

+2

Feb 13

•

73

upvoted an article 3 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

307

upvoted a paper 3 months ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published Dec 17, 2025 • 69

upvoted 3 articles 4 months ago

Article

Text-to-image Architectural Experiments

Nov 13, 2025

•

56

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

96

Article

Context Parallelism

Aug 13, 2024

•

30

upvoted a paper 4 months ago

Common Diffusion Noise Schedules and Sample Steps are Flawed

Paper • 2305.08891 • Published May 15, 2023 • 14

upvoted 2 papers 5 months ago

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 58

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

upvoted 2 articles 6 months ago

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

Sep 4, 2025

•

30

Article

ModernVBERT: Towards Smaller Visual Document Retrievers

Oct 3, 2025

•

46

upvoted 2 articles 7 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

95

Article

The Annotated Diffusion Model

Jun 7, 2022

•

339

upvoted a paper 7 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17, 2025 • 37

upvoted an article 7 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

186

upvoted a collection 7 months ago

👁️ LFM2-VL

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated 2 days ago • 64