Zhukov's picture

Zhukov

Geximus

·

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

MiniMaxAI/MiniMax-M2.7:Prevent whitespace leakage in beginning of prompt

upvoted an article 17 days ago

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

new activity 24 days ago

demon-zombie/MiniMax-M2.7-AWQ-4bit:These are NOT actual AWQ-quantized models.

View all activity

Organizations

None yet

New activity in MiniMaxAI/MiniMax-M2.7 6 days ago

Prevent whitespace leakage in beginning of prompt

#22 opened 22 days ago by

upvoted an article 17 days ago

Article

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Apr 3

•

8

New activity in demon-zombie/MiniMax-M2.7-AWQ-4bit 24 days ago

These are NOT actual AWQ-quantized models.

#1 opened 24 days ago by

New activity in MiniMaxAI/MiniMax-M2.7 25 days ago

MiniMax-M2.7 is highly verbose and slow

#18 opened 25 days ago by

New activity in cyankiwi/MiniMax-M2.7-AWQ-4bit 25 days ago

thanks for 4bit awq!

#1 opened 26 days ago by

upvoted an article 26 days ago

Article

2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5

30 days ago

•

3

liked a model 26 days ago

cyankiwi/MiniMax-M2.7-AWQ-4bit

Text Generation • 37B • Updated 26 days ago • 248k • 31

New activity in cyankiwi/MiniMax-M2.5-AWQ-4bit 27 days ago

Is minimax 2.7 on the way?

#3 opened 27 days ago by

New activity in MiniMaxAI/MiniMax-M2.5 29 days ago

Minimax 2.7???

#53 opened about 2 months ago by

New activity in togethercomputer/Aurora-Spec-Minimax-M2.5 about 1 month ago

Perfomance question

#4 opened about 1 month ago by

liked a model 2 months ago

cyankiwi/Qwen3.5-122B-A10B-AWQ-8bit

Image-Text-to-Text • 39B • Updated Mar 26 • 4.64k • 5

liked a model 3 months ago

cyankiwi/Qwen3-Coder-Next-AWQ-4bit

Text Generation • 14B • Updated Mar 26 • 112k • 28

New activity in Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice 3 months ago

Low generation speed and low GPU utilization (~12%) during inference

#18 opened 3 months ago by

liked a model 3 months ago

cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Text Generation • 5B • Updated 3 days ago • 55k • 31

liked a model 4 months ago

ai-sage/GigaAM-v3

Automatic Speech Recognition • Updated Nov 19, 2025 • 108k • 98

New activity in black-forest-labs/FLUX.2-dev 4 months ago

Why is Flux 2 so slow in Img2Img even though everything is in CUDA?

#22 opened 5 months ago by

New activity in cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit 5 months ago

Perfomance of this model is one of the best

#13 opened 5 months ago by

liked a Space 5 months ago

Qwen TTS Clone Demo

Create a custom voice clone and synthesize speech

New activity in cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit 6 months ago

why recently re-uploaded the core?

#7 opened 6 months ago by

liked a model 6 months ago

cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit

Text Generation • 84B • Updated 3 days ago • 50 • 5