NeMo加速MoE微调
Hugging Face介绍NVIDIA NeMo AutoModel,可在Transformers v5上通过专家并行、DeepEP与TransformerEngine提升MoE微调效率。官方称吞吐提升3.4至3.7倍,显存占用降低29%至32%,且兼容from_pretrained接口。
事件进展(1 篇报道)
-
2026-06-25 00:00Hugging Face Blog Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel