An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Discover insights about your collaboration patterns
We're continuously adding new repositories and metrics to build a more comprehensive picture of open source collaboration. Your contributions help the entire community learn and improve.