-
Notifications
You must be signed in to change notification settings - Fork 155
Issues: EleutherAI/pythia
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Batch Viewer : Why Sequence Length 2049?
#123
by prakharg24
was closed Nov 2, 2023
updated Jun 19, 2024
release of checkpoints of different steps
#101
by TobiasLee
was closed May 26, 2023
updated Mar 1, 2024
Is there existing code to resume training from specific checkpoint?
#150
by javirandor
was closed Mar 1, 2024
updated Mar 1, 2024
Missing / undownloadable checkpoints on huggingface
#141
by mirandrom
was closed Jan 30, 2024
updated Feb 1, 2024
Would it be possible to share training loss curves on the original Pythia models?
#145
by itsnamgyu
was closed Jan 22, 2024
updated Jan 22, 2024
Details about "EleutherAI/pythia-160m-seed*" models
#142
by IanMagnusson
was closed Dec 12, 2023
updated Jan 4, 2024
Deduplicated Pile dataset with Domain Attribution
#137
by michaelduan8
was closed Nov 21, 2023
updated Nov 21, 2023
The performance about pythia and LLaMA model architecture
#122
by peiyingxin
was closed Oct 30, 2023
updated Oct 30, 2023
Convert the huggingface checkpoint to GPT-Neox checkpoint
#116
by ZhiYuanZeng
was closed Aug 13, 2023
updated Aug 13, 2023
Clarification of Pythia tokenizer(s) at different sizes, steps and data preprocessing?
#115
by RylanSchaeffer
was closed Aug 3, 2023
updated Aug 3, 2023
Difference between LFS and HuggingFace datasets?
#112
by eric-mitchell
was closed Jul 21, 2023
updated Jul 21, 2023
Is there a template poilerplate for the prompt used in C.1 gender bias intervention?
#106
by ruyuan-zuo
was closed Jul 21, 2023
updated Jul 21, 2023
Is there an access to the deduplicated version of the data with meta info?
#92
by Jason3900
was closed Jul 21, 2023
updated Jul 21, 2023
Can I provide custom data and continue training Pythia on this new data?
#113
by GeorgiAngelov
was closed Jul 21, 2023
updated Jul 21, 2023
What tool do you use for your data preprocessing/binarization?
#69
by ajesujoba
was closed Apr 18, 2023
updated Jul 3, 2023
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.