Optimizing Distributed Training on Frontier for Large Language Models Paper • 2312.12705 • Published Dec 20, 2023 • 1