Skip to content

Issues: EleutherAI/pythia

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Batch Viewer : Why Sequence Length 2049?
#123 by prakharg24 was closed Nov 2, 2023 updated Jun 19, 2024
Reshape error in batch viewer
#158 by activatedgeek was closed May 11, 2024 updated May 11, 2024
release of checkpoints of different steps
#101 by TobiasLee was closed May 26, 2023 updated Mar 1, 2024
Missing / undownloadable checkpoints on huggingface
#141 by mirandrom was closed Jan 30, 2024 updated Feb 1, 2024
Mismatch about the evaluation results
#118 by yuzc19 was closed Oct 30, 2023 updated Jan 20, 2024
Details about "EleutherAI/pythia-160m-seed*" models
#142 by IanMagnusson was closed Dec 12, 2023 updated Jan 4, 2024
Pytia or GPT-neox?
#138 by borgr was closed Dec 12, 2023 updated Dec 12, 2023
.
#140 by ParthaKrPaul was closed Dec 2, 2023 updated Dec 2, 2023
Replicating the Training Data Order
#136 by prakharg24 was closed Nov 23, 2023 updated Nov 23, 2023
Deduplicated Pile dataset with Domain Attribution
#137 by michaelduan8 was closed Nov 21, 2023 updated Nov 21, 2023
the loss of pythia training
#97 by Wangpeiyi9979 was closed May 1, 2023 updated Nov 10, 2023
The value of weight decay
#132 by yehuitang was closed Nov 9, 2023 updated Nov 9, 2023
Error when running unshard_memmap.py
#114 by ShaneeyS was closed Nov 4, 2023 updated Nov 4, 2023
Model Initialization Question
#129 by yanlai00 was closed Nov 4, 2023 updated Nov 4, 2023
The performance about pythia and LLaMA model architecture
#122 by peiyingxin was closed Oct 30, 2023 updated Oct 30, 2023
Weights tying
#117 by link-er was closed Oct 8, 2023 updated Oct 8, 2023
Convert the huggingface checkpoint to GPT-Neox checkpoint
#116 by ZhiYuanZeng was closed Aug 13, 2023 updated Aug 13, 2023
Difference between LFS and HuggingFace datasets?
#112 by eric-mitchell was closed Jul 21, 2023 updated Jul 21, 2023
What tool do you use for your data preprocessing/binarization?
#69 by ajesujoba was closed Apr 18, 2023 updated Jul 3, 2023
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.