Efficient Training Algorithms for Transformer-based Language Models
-
arxiv.org
Clear