File tree Expand file tree Collapse file tree 1 file changed +7
-5
lines changed Expand file tree Collapse file tree 1 file changed +7
-5
lines changed Original file line number Diff line number Diff line change @@ -1155,20 +1155,22 @@ Objectives
11551155
11561156LLM post-training requires specialized loss functions that are adapted to the unique characteristics of language models.
11571157
1158- GRPO
1159- ~~~~
1160-
1161- The :class: `~torchrl.objectives.llm.GRPOLoss ` class is a thin wrapper around the :class: `~torchrl.objectives.PPOLoss ` class
1162- that codes the LLM-specific functionalities.
1158+ GRPO, DAPO, CISPO
1159+ ^^^^^^^^^^^^^^^^^
11631160
11641161.. currentmodule :: torchrl.objectives.llm
11651162
11661163.. autosummary ::
11671164 :toctree: generated/
11681165 :template: rl_template.rst
11691166
1167+ LLMLossOutput
11701168 GRPOLoss
11711169 GRPOLossOutput
1170+ CISPOLoss
1171+ CISPOLossOutput
1172+ DAPO
1173+ DAPOLossOutput
11721174 MCAdvantage
11731175
11741176SFT
You can’t perform that action at this time.
0 commit comments