This is the best quant version in the world,better than FP8

#2
by kq - opened

After my testing work, i am so amazed how this model could outperform official FP8 version.
thank you QuantTrio, thanks your great work!

perfect svg generated:
ScreenShot_2026-03-10_102923_641
ScreenShot_2026-03-10_102939_449

很强的一个模型,致敬并感谢作者!

Have you tested how much it can improved from fp8 version?


有人测过对比官方的FP8版本,TTFT和TPS能提升多少吗?

I agree, I tested multiple versions, this is the only one that is fast, scores the highest in my personal benchmark and fits into my rtx5090 👌

QuantTrio org

Thank you for your support.

Sign up or log in to comment