NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper β’ 2508.14444 β’ Published Aug 20, 2025 β’ 40
view article Article TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell 11 days ago β’ 11