Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythongpustack/gpustack

gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

84.1/100
5.1KForks: 546
View on GitHubHomepage →
Loading report...

Similar Projects

sglang

91

SGLang is a high-performance serving framework for large language models and multimodal models.

Python28.9K

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python82.4K

unsloth

93

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python66.1K

CowAgent

95

Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (formerly chatgpt-on-wechat)

Python45.2K
Back to List