Nano vLLM
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
LLM training code for Databricks foundation models
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.