llm-decouple

Decoupling understanding from generation for large language models

#Loading neox (before flashattentionv2 https://github.com/EleutherAI/gpt-neox/tree/70af6e84e1c0ffc2fdca89fb77b35a2ccbfceba9)

git clone [email protected]:EleutherAI/gpt-neox.git
git checkout 70af6e8

#Preparing the environment

conda create -n neoxv4
conda install python=3.8
conda install cudatoolkit=11.7 -c conda-forge
conda install -c conda-forge cudatoolkit-dev
export CUDA_HOME=PATH_TO_MINICONDA/miniconda3/envs/neoxv4
export LD_LIBRARY_PATH=PATH_TO_MINICONDA/lib:$LD_LIBRARY_PATH
pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html
conda install -c conda-forge mpi4py mpich
git clone https://github.com/EleutherAI/gpt-neox.git
pip install -r requirements/requirements.txt
pip install -r requirements/requirements-wandb.txt
pip install -r requirements/requirements-tensorboard.txt
python ./megatron/fused_kernels/setup.py install
pip install -r requirements/requirements-flashattention.txt
pip install triton

The OLMO environment uses: transformers 1.17 compatible with CUDA 11.6 peft

#Preparing the data

# This creates the jsonl 
bash new_preprocess.sh

#Preparing data for dolma Original files should go inside dataset/documents. tagged attributes go inside dataset/attributes. Final output in dataset/prepared

gzip file.jsonl
bash dolma_tag.sh
bash dolma_mix.sh
gzip -d file.jsonl.gz

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
OLMo		OLMo
configs		configs
gpt-neox		gpt-neox
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-decouple

About

Releases

Packages

Languages

ryanyxw/llm-decouple

Folders and files

Latest commit

History

Repository files navigation

llm-decouple

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages