TRL documentation
DeepSpeed Integration
Getting started
Conceptual Guides
How-to guides
Command Line Interface (CLI)Customizing the TrainingReducing Memory UsageSpeeding Up TrainingUsing Trained Models
Integrations
Examples
Example OverviewCommunity TutorialsSentiment TuningTraining StackLlamaDetoxifying a Language ModelLearning to Use ToolsMulti Adapter RLHF
API
You are viewing v0.15.2 version. A newer version v1.4.0 is available.