Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonsgl-project/sglang

sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

90.7/100
28.9KForks: 6.4K
View on GitHubHomepage →
Loading report...

Similar Projects

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python82.4K

gpustack

84

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python5.1K

tokenspeed

74

TokenSpeed is a speed-of-light LLM inference engine.

Python1.4K

Automodel

73

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python572
Back to List