Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonOpenRLHF/OpenRLHF

OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

89.2/100
9.6KForks: 965
View on GitHubHomepage →
Loading report...

Similar Projects

ml-engineering

72

Machine Learning Engineering Open Book

Python18.1K

PaLM-rlhf-pytorch

83

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python7.9K

train-llm-from-scratch

76

A straightforward method for training your LLM, from downloading data to generating text.

Python4.9K

LLM-RL-Visualized

70

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python4.5K
Back to List