2-Bit Quantization of Large Language Models - arxiv.org

Clear