You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have corpus of ~10GB of adult stories, in English, in plain text, taken primarily from asstr.org and literotica.
I think it would be interesting to incorporate these into the training set as well.
The text was updated successfully, but these errors were encountered:
One of your datasources is directly named and excluded there, and the other one, probably follows the same rationale. Their reasons for excluding these were much different from the reasons for which I would have excluded them were it my choice (my rationale is x in, x out -> where x = {copyright infringement, nsfw content}), but they had a more scientific rationale you can read there.
I have corpus of ~10GB of adult stories, in English, in plain text, taken primarily from asstr.org and literotica.
I think it would be interesting to incorporate these into the training set as well.
The text was updated successfully, but these errors were encountered: