Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonOpenGVLab/OmniQuant

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

50.9/100
891Forks: 78
View on GitHub
Loading report...

Similar Projects

Chinese-LLaMA-Alpaca

89

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python18.9K

LightCompress

68

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python710

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python70.5K

langextract

91

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python35.8K
Back to List