-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Story_cloze fails #14
Comments
Probably due to these lines. Does someone know why they are here? if set_name != "story-cloze":
raw_set = load_dataset(*getLoadName(set_name))
else:
raw_set = load_dataset(*getLoadName(set_name), data_dir="./datasets/rawdata") |
We are cleaning up the whole generation process right now.. I'll have a look at this too |
Story Cloze requires manual download after requesting the data from this form: |
The files are super lightweight (<1MB). I'm considering adding them to the repo rather than asking users to download them... Or at least have a simple .sh scirpt which downloads them from somewhere @norabelrose what do you think about it? |
@FabienRoger I'm not sure the people who made the dataset would be happy with us posting it in this repo? Presumably they ask people to fill out the form for a reason. I'd prefer to just remove the special case for |
I think we should still keep the special case code around, at least for replications. But I'll add an appropriate error message that tells you which form you should fill (I filled it and I received the data automatically). You can't just remove the special case and still use story-cloze because huggingface raises an error if you try to load it
|
This dataset is not recognized by huggingface
The text was updated successfully, but these errors were encountered: