Skip to content

Commit 1d9fd2d

Browse files
committed
Update (base update)
[ghstack-poisoned]
1 parent ae87a4e commit 1d9fd2d

File tree

1 file changed

+7
-5
lines changed

1 file changed

+7
-5
lines changed

docs/source/reference/llms.rst

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1155,20 +1155,22 @@ Objectives
11551155

11561156
LLM post-training requires specialized loss functions that are adapted to the unique characteristics of language models.
11571157

1158-
GRPO
1159-
~~~~
1160-
1161-
The :class:`~torchrl.objectives.llm.GRPOLoss` class is a thin wrapper around the :class:`~torchrl.objectives.PPOLoss` class
1162-
that codes the LLM-specific functionalities.
1158+
GRPO, DAPO, CISPO
1159+
^^^^^^^^^^^^^^^^^
11631160

11641161
.. currentmodule:: torchrl.objectives.llm
11651162

11661163
.. autosummary::
11671164
:toctree: generated/
11681165
:template: rl_template.rst
11691166

1167+
LLMLossOutput
11701168
GRPOLoss
11711169
GRPOLossOutput
1170+
CISPOLoss
1171+
CISPOLossOutput
1172+
DAPO
1173+
DAPOLossOutput
11721174
MCAdvantage
11731175

11741176
SFT

0 commit comments

Comments
 (0)