Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonmbzuai-oryx/Awesome-LLM-Post-training

Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

55.8/100
2.4KForks: 157
View on GitHub
Loading report...

Similar Projects

OpenRLHF

89

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python9.4K

Skywork-R1V

59

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python3.2K

verl-agent

65

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python1.8K

safe-rlhf

51

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python1.6K
Back to List