最終更新:2025-05-13 (火) 20:48:44 (31d)  

TRL
Top / TRL

Transformer Reinforcement Learning

A comprehensive library to post-train foundation models

https://github.com/huggingface/trl