Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++mozilla-ai/llamafile

llamafile

Distribute and run LLMs with a single file.

88.5/100
24.8KForks: 1.4K
View on GitHubHomepage →
Loading report...

Similar Projects

RCLI

62

Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG

C++1.5K

lucebox-hub

74

Fast LLM speculative inference server for consumer hardware.

C++2.4K

whisper.cpp

85

Port of OpenAI's Whisper model in C/C++

C++50.6K

PowerInfer

65

High-speed Large Language Model Serving for Local Deployment

C++9.5K
Back to List