2-Bit Quantization of Large Language Models
-
arxiv.org
Clear