Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++kvcache-ai/Mooncake

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

84.2/100
5.5KForks: 838
View on GitHubHomepage →
Loading report...

Similar Projects

vllm-ascend

78

Community maintained hardware plugin for vLLM on Ascend

C++2.2K

runanywhere-sdks

80

Production ready toolkit to run AI locally

C++10.3K

tiny-vllm

52

Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM

C++776

ScaleLLM

61

A high-performance inference system for large language models, designed for production environments.

C++500
Back to List