Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonOpenGVLab/OmniQuant

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

53.1/100
890Forks: 76
View on GitHub
Loading report...

Similar Projects

LightCompress

56

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python685

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python68.0K

langextract

88

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python34.4K

LightRAG

92

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python29.1K
Back to List