Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++kvcache-ai/Mooncake

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

84.1/100
5.2KForks: 694
View on GitHubHomepage →
Loading report...

Similar Projects

vllm-ascend

78

Community maintained hardware plugin for vLLM on Ascend

C++2.0K

runanywhere-sdks

80

Production ready toolkit to run AI locally

C++10.3K

whisper.cpp

85

Port of OpenAI's Whisper model in C/C++

C++48.9K

MNN

94

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++15.0K
Back to List