Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++vllm-project/vllm-ascend

vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

78.3/100
2.2KForks: 1.4K
View on GitHubHomepage →
Loading report...

Similar Projects

Mooncake

84

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++5.5K

ZhiLight

58

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++905

ScaleLLM

61

A high-performance inference system for large language models, designed for production environments.

C++500

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python82.4K
Back to List