โ† Back to List
โš 
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonsail-sg/oat

oat

๐ŸŒพ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

61.6/100
โ˜… 660Forks: 63
View on GitHub โ†’
Loading report...

Similar Projects

OpenJudge

76

OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards

Pythonโ˜… 654

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Pythonโ˜… 72.0K

alignment-handbook

71

Robust recipes to align language models with human and AI preferences

Pythonโ˜… 5.6K

MedicalGPT

84

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. ่ฎญ็ปƒๅŒป็–—ๅคงๆจกๅž‹๏ผŒๅฎž็Žฐไบ†ๅŒ…ๆ‹ฌๅขž้‡้ข„่ฎญ็ปƒ(PT)ใ€ๆœ‰็›‘็ฃๅพฎ่ฐƒ(SFT)ใ€RLHFใ€DPOใ€ORPOใ€GRPOใ€‚

Pythonโ˜… 5.5K
โ† Back to List