Skip to content

Commit 045f308

Browse files
fix trl, release
1 parent 7f46f98 commit 045f308

File tree

6 files changed

+41
-5
lines changed

6 files changed

+41
-5
lines changed
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
task: llm-orpo
2+
base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
3+
project_name: autotrain-smallm2-orpo
4+
log: tensorboard
5+
backend: local
6+
7+
data:
8+
path: argilla/distilabel-capybara-dpo-7k-binarized
9+
train_split: train
10+
valid_split: null
11+
chat_template: chatml
12+
column_mapping:
13+
text_column: chosen
14+
rejected_text_column: rejected
15+
prompt_text_column: prompt
16+
17+
params:
18+
block_size: 1024
19+
model_max_length: 2048
20+
max_prompt_length: 512
21+
epochs: 3
22+
batch_size: 2
23+
lr: 3e-5
24+
peft: true
25+
quantization: int4
26+
target_modules: all-linear
27+
padding: right
28+
optimizer: adamw_torch
29+
scheduler: linear
30+
gradient_accumulation: 4
31+
mixed_precision: fp16
32+
33+
hub:
34+
username: ${HF_USERNAME}
35+
token: ${HF_TOKEN}
36+
push_to_hub: false

src/autotrain/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@
4545
warnings.filterwarnings("ignore", category=UserWarning, module="huggingface_hub")
4646

4747
logger = Logger().get_logger()
48-
__version__ = "0.8.30.dev0"
48+
__version__ = "0.8.30"
4949

5050

5151
def is_colab():

src/autotrain/trainers/clm/train_clm_dpo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -109,7 +109,7 @@ def train(config):
109109
ref_model=model_ref,
110110
train_dataset=train_data,
111111
eval_dataset=valid_data if config.valid_split is not None else None,
112-
tokenizer=tokenizer,
112+
processing_class=tokenizer,
113113
peft_config=peft_config if config.peft else None,
114114
)
115115

src/autotrain/trainers/clm/train_clm_orpo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ def train(config):
4848
**trainer_args,
4949
train_dataset=train_data,
5050
eval_dataset=valid_data if config.valid_split is not None else None,
51-
tokenizer=tokenizer,
51+
processing_class=tokenizer,
5252
peft_config=peft_config if config.peft else None,
5353
)
5454

src/autotrain/trainers/clm/train_clm_reward.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -116,7 +116,7 @@ def train(config):
116116
train_dataset=train_data,
117117
eval_dataset=valid_data if config.valid_split is not None else None,
118118
peft_config=peft_config if config.peft else None,
119-
tokenizer=tokenizer,
119+
processing_class=tokenizer,
120120
)
121121

122122
trainer.remove_callback(PrinterCallback)

src/autotrain/trainers/clm/train_clm_sft.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ def train(config):
4848
train_dataset=train_data,
4949
eval_dataset=valid_data if config.valid_split is not None else None,
5050
peft_config=peft_config if config.peft else None,
51-
tokenizer=tokenizer,
51+
processing_class=tokenizer,
5252
)
5353

5454
trainer.remove_callback(PrinterCallback)

0 commit comments

Comments
 (0)