Model sampling/text generation #160

joshlk · 2021-03-05T15:38:40Z

Text generation is working with no output (unconditional) and with an input file. By "working" I mean it is loading a model, creating output and saving it to file. The output is currently utter gibberish as I'm testing it on basically a randomly initialised model.

Steps to test:

Generate a model:

first change the save-interval to 10 or something small in the model config.
Run a model (note this now saves the model config in the checkpoint directory so you can easily determine what the model data corresponds to):

./deepy.py pretrain_gpt2.py -d configs small.yml eleutherai_cluster.yml and wait for a checkpoint to save.

To test unconditional text generation:

Make sure "text-gen-type": "unconditional" is set in the text_generation.yml config
Run an text generation job loading the generated model:

./deepy.py text_gen_gpt2.py -d configs small.yml eleutherai_cluster.yml text_generation.yml

Output will be found in /mnt/ssd-cluster/output/text_generation.txt

To test input-file text generation:

Make sure "text-gen-type": "input-file" is set in the text_generation.yml config
Create an new line delineated input file at /mnt/ssd-cluster/output/sample_input.txt
Run a text generation job:

./deepy.py text_gen_gpt2.py -d configs small.yml eleutherai_cluster.yml text_generation.yml

Output will be found in /mnt/ssd-cluster/output/sample_output.txt

Todo:

# Conflicts: # Dockerfile # requirements.txt

commit 43be6ce Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 10:19:12 2021 +0000 Remove debugging commit 450dfb9 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 10:04:23 2021 +0000 Test `input-file` commit 3d0f562 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:52:57 2021 +0000 Skip tokens that don't exist commit 9384ace Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:48:52 2021 +0000 Debug commit 82871dd Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:47:49 2021 +0000 Debug commit 5d1d19e Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:44:50 2021 +0000 Remove debugging commit dc4cf73 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:41:33 2021 +0000 Debug commit a4d2bf3 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:33:29 2021 +0000 Force activation checkpointing to be disabled commit 901aa73 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:29:27 2021 +0000 Debugging commit c9706d1 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 09:19:41 2021 +0000 Debugging commit a4835b1 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 08:55:17 2021 +0000 Debugging commit 747b9a9 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 08:46:48 2021 +0000 Debugging commit 36137c1 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 08:28:54 2021 +0000 Debugging commit f58ea20 Author: Josh Levy-Kramer <[email protected]> Date: Tue Mar 9 08:22:35 2021 +0000 Debugging commit 4ff6300 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 18:47:04 2021 +0000 Change port based on rank commit be4b715 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 18:45:13 2021 +0000 Change port based on rank commit 63a1be0 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 18:43:52 2021 +0000 Change port based on rank commit 1b8f6b9 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 18:16:58 2021 +0000 pycharm debugger commit 1110bee Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 18:05:47 2021 +0000 pycharm debugger commit 90bb5d4 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 17:51:46 2021 +0000 Test: try manhole commit 6fbcc1f Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 17:50:08 2021 +0000 Test: try manhole commit a272a59 Author: Josh Levy-Kramer <[email protected]> Date: Mon Mar 8 17:46:50 2021 +0000 Test: try manhole

…ext fn

…rallel)

sdtblck · 2021-04-08T16:01:54Z

Ok, this is now working with pipeline parallel size > 1.

The merge conflicts will be quite intense (since rotary pos emb means the number of args passed between pipe stages can differ). I'll sort these soon and we can merge this.

…del_sampling

sdtblck · 2021-04-08T23:12:20Z

Ok so sampling is now working with all pipeline parallel sizes. It's the most awful, hacky code ever, you need to update your deeperspeed branch to the latest commit to get it to work. This should be ready to be merged now imo.

Bring all dependencies up to their latest stable versions.

changes have been made

joshlk added 15 commits March 5, 2021 15:14

Use ConfigMonster for text_generation

6e02cd1

Make text gen executable

5010a7c

Move wandb args

793963a

False values should not exist

ff2a257

Give helpful error

54a16fd

Turn off wandb for text gen

a5d8c37

Load model using deepspeed

561abd5

Save config to checkpoint directory

899bb6b

Pretty print json

8f52fc1

Pretty print json

d9c9d16

Use deepy.py launcher

7aa2bd5

Make wandb_group a default config in deepy.py

737befe

Make wandb_group a default config in deepy.py

8224bbd

Make wandb_group a default config in deepy.py

34b1873

Generate text

4630b17

StellaAthena linked an issue Mar 5, 2021 that may be closed by this pull request

Ensure Sampling works correctly #152

Closed

joshlk added 10 commits March 6, 2021 18:06

Update to pytorch 1.8 and deeperspeed

1ff9f9b

Test

207acf1

Test

750ff49

Test

5374240

test genfile

7639fa9

Text gen to file

ce2fb1c

Use if switch to determine text gen type

1125475

Merge branch 'main' into model_sampling

21f97e9

# Conflicts: # Dockerfile # requirements.txt

Add vim

8a7c961

joshlk force-pushed the model_sampling branch from 43be6ce to 851903b Compare March 9, 2021 10:22

joshlk added 3 commits March 9, 2021 10:27

Tidy config

eb76fbd

Rework generate_samples_input_from_file so more like unconditional

3cffe01

Output to file

eb075ef

sdtblck and others added 7 commits March 25, 2021 13:37

make no_load_optim arg work with deepspeed

e6ef968

fix args stuff

fdbc2ee

get pp model sampling working

02a99d7

add warning if ppsize > 1, make self.inference toggleable

0f92fc1

add documentation to text_generation_utils.py & add generate from t…

901d79e

…ext fn

fix all .forward() methods to use only tensors (necessary for pipe pa…

62a8934

…rallel)

get inference working with pipe_parallel_size > 1

5f22d5d

Merge branch 'main' of https://github.com/EleutherAI/gpt-neox into mo…

7a46aec

…del_sampling

sdtblck and others added 5 commits April 8, 2021 23:14

update requirements

4879da3

Delete requirements.txt

aaa1dc3

Dependency updates

3a7ba11

Bring all dependencies up to their latest stable versions.

Merge branch 'main' into model_sampling

fffff66

fixes

2700569

ShivanshuPurohit self-requested a review April 19, 2021 21:21

ShivanshuPurohit approved these changes Apr 19, 2021

View reviewed changes

ShivanshuPurohit previously approved these changes Apr 19, 2021

View reviewed changes

fix import error

a649a66

sdtblck dismissed ShivanshuPurohit’s stale review via a649a66 April 19, 2021 21:32

Merge branch 'main' into model_sampling

27defac

sdtblck previously approved these changes Apr 22, 2021

View reviewed changes

ShivanshuPurohit previously approved these changes Apr 22, 2021

View reviewed changes

Remove duplicate Embedding Class

1f861e6

sdtblck dismissed stale reviews from ShivanshuPurohit and themself via 1f861e6 April 22, 2021 12:11

sdtblck approved these changes Apr 22, 2021

View reviewed changes

sdtblck merged commit e1f7fcb into main Apr 22, 2021

sdtblck deleted the model_sampling branch April 22, 2021 12:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model sampling/text generation #160

Model sampling/text generation #160

joshlk commented Mar 5, 2021 •

edited

Loading

sdtblck commented Apr 8, 2021

sdtblck commented Apr 8, 2021

Model sampling/text generation #160

Model sampling/text generation #160

Conversation

joshlk commented Mar 5, 2021 • edited Loading

sdtblck commented Apr 8, 2021

sdtblck commented Apr 8, 2021

joshlk commented Mar 5, 2021 •

edited

Loading