⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythontoverainc/willow-inference-server

willow-inference-server

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

48.3/100

★ 508Forks: 60

View on GitHub →

Loading report...

Similar Projects

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python★ 85.3K

sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python★ 29.9K

OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python★ 12.4K

FunClip

FunASR-powered video transcription, subtitle generation, and LLM-assisted clipping tool with a local Gradio UI.

Python★ 5.9K

← Back to List