This repository was archived by the owner on Sep 23, 2025. It is now read-only.

[Finetune] Integrate Chat template #178

Open

minmingzhu wants to merge 24 commits into intel:main from minmingzhu:chat_template

Contributor

minmingzhu commented Apr 8, 2024

No description provided.

carsonwang suggested changes

View reviewed changes

Contributor

carsonwang left a comment •

edited

Loading

Thanks for the work! Summarized the changes to update as we discussed offline:

Remove the added is_base_model parameter in the finetuning yaml file.
Allow user configuring chat_template in the yaml file. In most of the case, people don't configure it. Priority order: user configured chat_template > model's chat_template > our default template
Write the default template by following other models' (such as llama2 chat), that is, check roles in the message, etc.
The original data format needs to convert to chat format first, before applying the chat template.
Add unit tests to test the result after applying chat template, covering all use cases.
Support chat format as finetuning dataset format. Please follow openAI's format. We can support this in a separate PR.

minmingzhu force-pushed the chat_template branch from 3e6ccac to 6a0bf63 Compare

April 11, 2024 01:24

harborn reviewed

View reviewed changes

docs/finetune_parameters.md

    
              |lora_config|task_type: CAUSAL_LM<br>r: 8<br>lora_alpha: 32<br>lora_dropout: 0.1|Will be passed to the LoraConfig `__init__()` method, then it'll be used as config to build Peft model object.|

              |deltatuner_config|"algo": "lora"<br>"denas": True<br>"best_model_structure": "/path/to/best_structure_of_deltatuner_model"|Will be passed to the DeltaTunerArguments `__init__()` method, then it'll be used as config to build [Deltatuner model](https://github.com/intel/e2eAIOK/tree/main/e2eAIOK/deltatuner) object.|

              |enable_gradient_checkpointing|False|enable gradient checkpointing to save GPU memory, but will cost more compute runtime|

              |chat_template|None|User-defined chat template.|

Contributor

harborn Apr 18, 2024

Have you compared the impact of different templates on fine-tuning performance?

Contributor Author

minmingzhu Apr 22, 2024

not yet

minmingzhu force-pushed the chat_template branch from 42825d3 to 4fa89cc Compare

April 22, 2024 06:17

minmingzhu force-pushed the chat_template branch from 46733e7 to b5383ec Compare

May 9, 2024 06:41

minmingzhu and others added 20 commits

May 16, 2024 12:00


          implement fine-tuning chat template function

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

7f7d404

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

a3ce22f

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

b10cda3

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          integrate gbt for transformer 4.26.0

049304a

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

63a1217

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

58c9584

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          1. remove is_base_model tag

e2193ca

2. modify chat template

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

1090bf0

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          1. update doc/finetune_parameters.md

6bdd664

2. add unit test

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

4f0d118

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          Support latest Ray 2.10 release (intel#158)

e08a93c

* update

* fix blocking

* update

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* update

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* fix setup and getting started

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* update

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* update

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* nit

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* Add dependencies for tests and update pyproject.toml

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* Update dependencies and test workflow

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* Update dependencies and fix torch_dist.py

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

* Update OpenAI SDK installation and start ray cluster

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>

---------

Signed-off-by: Wu, Xiaochang <xiaochang.wu@intel.com>


          [Tests] Add query single test (intel#156)

1bbaf22

* single test

* single test

* single test

* single test

* fix hang error


          format

9498efe

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          [Finetune] use base model mpt-7b instead of mpt-7b-chat (intel#181)

115c513

* use base model mpt-7b instead of mpt-7b-chat

Signed-off-by: minmingzhu <minming.zhu@intel.com>

* manual setting specify tokenizer

Signed-off-by: minmingzhu <minming.zhu@intel.com>

* update

Signed-off-by: minmingzhu <minming.zhu@intel.com>

* update doc/finetune_parameters.md

Signed-off-by: minmingzhu <minming.zhu@intel.com>

---------

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          fix license issues

cfa3064

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          Update finetune.yaml

c0e4d2d


          refactor datap rocesser

b24c9f0

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

f0d94d1


          update

6075c2c

Signed-off-by: minmingzhu <minming.zhu@intel.com>

minmingzhu force-pushed the chat_template branch from c43a192 to 6075c2c Compare

May 16, 2024 06:27

minmingzhu added 4 commits

May 17, 2024 14:22


          update

c17ce45

Signed-off-by: minmingzhu <minming.zhu@intel.com>


          update

678d6e2


          update

294161d


          update

c104a3e

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet