Skip to content

Commit

Permalink
Merge pull request EleutherAI#154 from l-ma/l-ma-deduped-link
Browse files Browse the repository at this point in the history
Correct link to huggingface
  • Loading branch information
haileyschoelkopf committed Mar 18, 2024
2 parents 1b4ffc3 + 8a0efac commit 1441afc
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -201,7 +201,7 @@ Next, we must download the appropriate dataset. We provide preshuffled versions
- To download the deduped version, use
```py
from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="EleutherAI/pile-standard-pythia-preshuffled", repo_type="dataset", cache_dir="path/to/local/folder")
hf_hub_download(repo_id="EleutherAI/pile-deduped-pythia-preshuffled", repo_type="dataset", cache_dir="path/to/local/folder")
```

You can now merge the files by using the script `utils/unshard_mmap.py` :
Expand Down

0 comments on commit 1441afc

Please sign in to comment.