最終更新:2025-03-15 (土) 03:13:06 (38d)  

AWQ
Top / AWQ

Activation-aware Weight Quantization for LLM Compression and Acceleration

https://github.com/mit-han-lab/llm-awq