NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 40
view article Article TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell 15 days ago • 11