Fine-tuning on custom data #3

siamakzd · 2022-03-30T03:32:41Z

Thank you for sharing your great work!

If I want to fine-tune on a custom dataset, what should be the steps? i.e.

-Which scripts we need to modify?

Thanks in advance!

jpWang · 2022-03-30T07:21:53Z

Hi,
I think the main steps should be:

Organize your dataset into the format of FUNSD/XFUND, depending on your dataset is monolingual/multilingual.
Put YourDataset.py under LiLTfinetune/data/datasets/. You can refer to funsd.py/xfun.py.
Put run_YourDataset_YourTask.py under examples/. You can refer to run_funsd.py/run_xfun_re.py/run_xfun_ser.py.

If you want to do something beyond training/evaluating, You can add your code to the lines after the model makes predictions, such as https://github.com/jpWang/LiLT/blob/main/examples/run_funsd.py#L345 in run_funsd.py.

NielsRogge · 2022-11-21T18:33:35Z

Hi,

hamzabchiri · 2023-05-14T22:51:59Z

Hello,

Could you let me know when you have a Custom dataset and how to organize your dataset into the format of FUNSD/XFUND?

and do you recommend any tutorial for this step?

Thank you in advance.

jpWang pinned this issue Apr 6, 2022

hamzabchiri mentioned this issue May 16, 2023

dataset format of FUNSD/XFUND #45

Open

Provide feedback