-
Notifications
You must be signed in to change notification settings - Fork 156
Issues: EleutherAI/pythia
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Will memorization experimental codes be released?
#98
by chujiezheng
was closed May 22, 2023
updated May 22, 2023
Revamp experiment organization and migrate code when necessary
documentation
Improvements or additions to documentation
#99
by StellaAthena
was closed Jun 4, 2023
updated Jun 4, 2023
3 tasks
pythia-12b checkpoints missing on HuggingFace for step4000 and step32000
#107
by byungdoh
was closed Jun 4, 2023
updated Jun 4, 2023
Possible error in Pythia-12B-deduped step 32000
#108
by smahdavi4
was closed Jun 15, 2023
updated Jun 15, 2023
Multiple training runs of same model with different random seed for weight initialisation
#110
by KarolisRam
was closed Jun 22, 2023
updated Jun 22, 2023
What tool do you use for your data preprocessing/binarization?
#69
by ajesujoba
was closed Apr 18, 2023
updated Jul 3, 2023
Can I provide custom data and continue training Pythia on this new data?
#113
by GeorgiAngelov
was closed Jul 21, 2023
updated Jul 21, 2023
Is there an access to the deduplicated version of the data with meta info?
#92
by Jason3900
was closed Jul 21, 2023
updated Jul 21, 2023
Is there a template poilerplate for the prompt used in C.1 gender bias intervention?
#106
by ruyuan-zuo
was closed Jul 21, 2023
updated Jul 21, 2023
Difference between LFS and HuggingFace datasets?
#112
by eric-mitchell
was closed Jul 21, 2023
updated Jul 21, 2023
Clarification of Pythia tokenizer(s) at different sizes, steps and data preprocessing?
#115
by RylanSchaeffer
was closed Aug 3, 2023
updated Aug 3, 2023
Convert the huggingface checkpoint to GPT-Neox checkpoint
#116
by ZhiYuanZeng
was closed Aug 13, 2023
updated Aug 13, 2023
The performance about pythia and LLaMA model architecture
#122
by peiyingxin
was closed Oct 30, 2023
updated Oct 30, 2023
Deduplicated Pile dataset with Domain Attribution
#137
by michaelduan8
was closed Nov 21, 2023
updated Nov 21, 2023
Details about "EleutherAI/pythia-160m-seed*" models
#142
by IanMagnusson
was closed Dec 12, 2023
updated Jan 4, 2024
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.