support ORPO algorithm #854

hjh0119 · 2024-04-30T09:35:39Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Bug Fix

Training and inference for internvl-chat with GPUs that do not support flash attention
V100推理internVL-1.5问题 #870
Adding timm dependency requirements for several VL models

New Feature

More Models or Datasets Support

dataset

…va-llama * commit 'eb940b1a1e8ac1adcf79f5e566ba3d68b545f069': support ORPO algorithm (modelscope#854) fix dataset info deepcopy (modelscope#871) fix dataset_test_ratio=1 (modelscope#869) # Conflicts: # README.md # README_CN.md

* main: add llava-llama (modelscope#873) support ORPO algorithm (modelscope#854) fix dataset info deepcopy (modelscope#871) fix dataset_test_ratio=1 (modelscope#869)

* main: (24 commits) fix pre-commit traindataset异常提示 (modelscope#859) Feat/pack (modelscope#881) fix swift cli exit code if subprocess is failed (modelscope#879) support Deepseek-V2-Chat and InternVL-Chat-V1.5-int8 model (modelscope#876) add llava-llama (modelscope#873) support ORPO algorithm (modelscope#854) fix dataset info deepcopy (modelscope#871) fix dataset_test_ratio=1 (modelscope#869) Refactor dataset (modelscope#802) Feat/loras (modelscope#865) Update tuner docs (modelscope#853) update docs (modelscope#850) Fix code format and docs (modelscope#847) update (modelscope#846) fix xcomposer device_map (modelscope#844) fix merge_lora_dtype (modelscope#842) Fix infer default dtype (modelscope#834) fix ui (modelscope#830) support Internvl-chat-v1.5 model (modelscope#824) ... # Conflicts: # docs/source/LLM/自定义与拓展.md # docs/source_en/LLM/Customization.md # examples/pytorch/llm/custom.py # scripts/benchmark/exp_utils.py # scripts/utils/run_dataset_info.py # swift/aigc/diffusers/train_controlnet.py # swift/aigc/diffusers/train_controlnet_sdxl.py # swift/aigc/diffusers/train_text_to_image.py # swift/aigc/diffusers/train_text_to_image_lora.py # swift/aigc/diffusers/train_text_to_image_lora_sdxl.py # swift/aigc/diffusers/train_text_to_image_sdxl.py # swift/llm/__init__.py # swift/llm/deploy.py # swift/llm/dpo.py # swift/llm/export.py # swift/llm/infer.py # swift/llm/sft.py # swift/llm/tuner.py # swift/llm/utils/__init__.py # swift/llm/utils/argument.py # swift/llm/utils/client_utils.py # swift/llm/utils/dataset.py # swift/llm/utils/model.py # swift/llm/utils/preprocess.py # swift/llm/utils/template.py # swift/llm/utils/utils.py # swift/trainers/trainers.py # swift/tuners/base.py # swift/ui/llm_infer/llm_infer.py # swift/ui/llm_infer/runtime.py # swift/ui/llm_train/dataset.py # swift/ui/llm_train/llm_train.py # tests/llm/test_run.py

jinghan added 30 commits April 30, 2024 11:33

init

f197ebc

update

e462d08

init orpo argument

c27552d

merge origin

58fdec4

lint

ec10daa

update

ac8386c

update orpo main

02c1517

lint

2ef494a

update

053ca5a

fix

a9c1e3b

fix

04a1256

fix

34db90e

fix

198229f

fix trainer args

738ea8f

fix args'

dbd5cd9

fix

6ce2eb8

fix

9e79ff0

update

1e602bd

fix args

75f098d

fix args

7e2156a

update args

f0f358f

fix args

ef42ebb

fix

a708b59

update base model default template

9f6f4d2

fix

fe5829c

update doc/cli

55ed332

update

d0abca8

update

5eaf013

update dataset

83e85f0

merge main

7ec2048

jinghan added 3 commits May 6, 2024 19:55

update

52d357b

update

5f2ad96

update

d6c636f

hjh0119 changed the title ~~[WIP] support ORPO algorithm~~ support ORPO algorithm May 6, 2024

jinghan added 8 commits May 7, 2024 09:59

update doc

83bea24

fix internvl v100

daa19bb

lint

0e3f059

update

e268ed6

update

9ddf7d7

fix

1f87175

update internvl requirement

958025c

update

b4773a3

tastelikefeet approved these changes May 7, 2024

View reviewed changes

hjh0119 merged commit eb940b1 into modelscope:main May 7, 2024
1 of 2 checks passed

BIGBALLON mentioned this pull request May 7, 2024

how to run internvl1.5-8bit with nvidia v100 OpenGVLab/InternVL#144

Open

hjh0119 deleted the orpo branch May 7, 2024 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support ORPO algorithm #854

support ORPO algorithm #854

hjh0119 commented Apr 30, 2024 •

edited

Loading

support ORPO algorithm #854

support ORPO algorithm #854

Conversation

hjh0119 commented Apr 30, 2024 • edited Loading

PR type

PR information

hjh0119 commented Apr 30, 2024 •

edited

Loading