Enhancing Training Efficiency Using Packing with Flash Attention Paper • 2407.09105 • Published Jul 12, 2024 • 17
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 74