add fine-tuning codebase toward StableLM-JP-Tuned-Alpha #4

fujiki-saij · 2023-05-19T01:44:24Z

TODOs

refactor the initial codebase according to the codebase used for StableLM Tuned Alpha.
put configs in a separate file
add inference code

…base used in StableLM

leemengtw

@fujiki-saij except some small changes are needed, generally LGTMeng. Feel free to merge after considering my comments, thanks!

leemengtw · 2023-05-25T07:23:33Z

finetune/configs/sample.yaml

+data_path: "fujiki/japanese_alpaca_data"
+train_size: 0.98
+trainer: "text"
+max_text_len": 1024


let's remove the additional " here.

leemengtw · 2023-05-25T08:15:26Z

finetune/configs/sample.yaml

+ logging_steps: 100
+ save_strategy: "steps"
+ save_steps: 100
+ save_total_limit": 2


currently passing this key to training config will raise error, could you double check whether we need to rename or remove it?

OK, let me double check these configurations. (I just manually translated the config to this yaml format, so there might be some mistakes.)

leemengtw · 2023-05-25T08:18:57Z

finetune/finetune_base_ja.py

+ if "rinna" in config["tokenizer_path"]:
+ tokenizer = AutoTokenizer.from_pretrained(
+ config["tokenizer_path"], use_fast=False, cache_dir=config.get("cache_dir", None),
+ )
+ else:
+ tokenizer = AutoTokenizer.from_pretrained(
+ config["tokenizer_path"], cache_dir=config.get("cache_dir", None),
+ )


2 changes might be needed here:

change config["tokenizer_path"] to config["tokenizer"]["tokenizer_name_or_path"] to match with what you have in sample.yaml

maybe remove the if statement and just use a general purpose one:

tokenizer = AutoTokenizer.from_pretrained( config["tokenizer"]["tokenizer_name_or_path"], use_fast=config.get("use_fast", False), cache_dir=config.get("cache_dir", None), )

I thought I already committed and pushed this change, but actually just staged and forgot to commit and push it.
0691ffa
Thanks for the review!

leemengtw · 2023-05-25T08:45:31Z

finetune/rm_datasets.py

+ # TODO: clean up comment outed lines
+ # # Data expected in prompt response pairs
+ # if os.path.exists(cache_name + "inputids.pt"):
+ # print("using cached dataset")
+ # self.input_ids = torch.load(cache_name+"inputids.pt")
+ # self.attn_masks = torch.load(cache_name+"attnmask.pt")
+ # self.labels = torch.load(cache_name+"inputids.pt")
+ # return


oh yap maybe clean up a little bit and add back the cache feature. Right now cache_name param is not used and the cache it would be helpful for debugging purpose 👍

leemengtw · 2023-05-25T08:47:32Z

finetune/finetune_base_ja.py

+ # remove parenthesis that might be introduced by some NMTs
+ new_example = {}
+ for k, v in example.items():
+ if example[k].startswith("「") and example[k].endswith("」"):


JFYI when I run the program, this if statement was never triggered.

That's right. I've already updated the dataset at HF by applying this processing.
So, we don't need this functionality here now.

leemengtw · 2023-05-26T17:52:29Z

finetune/templates.py

+### 応答: 
+{response}"""
+
+NO_INPUT_PROMPT = """以下は、タスクを説明する指示と、文脈のある入力の組み合わせです。要求を適切に満たす応答を書きなさい。


@fujiki-saij I think we should modify this instruction because there is no 文脈のある入力 so the 文脈のある入力の組み合わせ is incorrect and we might get better result with improved instruction.

OK. Let me add prompt template versioning functionality to experiment with different templates (in this PR or another PR).

Get EOS ID from SentencePiece directly

add initial finetune codebase refactored according to finetuning code…

a577670

…base used in StableLM

fujiki-saij self-assigned this May 19, 2023

update README.md

027cdfe

fujiki-saij marked this pull request as draft May 19, 2023 02:08

fujiki-saij requested a review from leemengtw May 22, 2023 07:49

add LoRA to README

d16f0dd

leemengtw reviewed May 25, 2023

View reviewed changes

fujiki-saij changed the title ~~[WIP] add fine-tuning codebase toward StableLM-JP-Tuned-Alpha~~ add fine-tuning codebase toward StableLM-JP-Tuned-Alpha May 25, 2023

fujiki-saij marked this pull request as ready for review May 25, 2023 09:36

leemengtw requested changes May 26, 2023

View reviewed changes

fujiki-saij force-pushed the jp-stable-tuned/sft branch from 4a6f0bb to bed0b5a Compare May 29, 2023 09:56

fujiki-saij added 14 commits May 30, 2023 14:14

add sample inference notebook for instruction-following models

5275b00

add some comments

e0f299f

refactor

5177482

refactor

a13cc58

refactor: add main function

833b1fd

refactor: config by yaml file

85c53ff

refactor: add prompt template

f0cc087

update README.md

3933606

refactor: add use_fast to config

6d94d4f

refactor: using use_fast config

754bb0c

fix sample.yaml

42fe5ef

add config

1651cbc

add: concatenating multiple datasets

11f1a67

refactor

7d9df8d

fujiki-saij force-pushed the jp-stable-tuned/sft branch from bed0b5a to 7d9df8d Compare May 30, 2023 05:15

fujiki-saij added 2 commits May 30, 2023 14:17

add notebooks directory

21afc85

add inference notebook

85b785e

add config

9d925ea

fujiki-saij merged commit 0a471e8 into Stability-AI:jp-stable May 30, 2023

leemengtw pushed a commit that referenced this pull request Jun 7, 2023

Merge pull request #4 from Stability-AI/feature/direct-sp-eos

f4957f3

Get EOS ID from SentencePiece directly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add fine-tuning codebase toward StableLM-JP-Tuned-Alpha #4

add fine-tuning codebase toward StableLM-JP-Tuned-Alpha #4

fujiki-saij commented May 19, 2023 •

edited

Loading

leemengtw left a comment

leemengtw May 25, 2023

leemengtw May 25, 2023

fujiki-saij May 25, 2023

leemengtw May 25, 2023

fujiki-saij May 25, 2023

leemengtw May 25, 2023

leemengtw May 25, 2023

fujiki-saij May 25, 2023

leemengtw May 26, 2023

fujiki-1emon May 28, 2023

add fine-tuning codebase toward StableLM-JP-Tuned-Alpha #4

add fine-tuning codebase toward StableLM-JP-Tuned-Alpha #4

Conversation

fujiki-saij commented May 19, 2023 • edited Loading

TODOs

leemengtw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fujiki-saij commented May 19, 2023 •

edited

Loading