- Reddit crawler
- subreddits
-
BrainStorming
- r/AskEngineers
- r/financialindependence
- r/Entrepreneur
- r/smallbusiness
- r/lifehacks
- r/productivity
- r/GetMotivated
- r/GetStudying
- r/Cooking
-
Creative Writing
- r/fantasywriters
- r/WritingPrompts
- r/ShortStories
- r/Jokes
-
modification of dataset
- is text in QA pair better than pure text when considering creativity
-
- Base LLM
- LLMs with SFT and RLHF
- Fine-tuned with creative dataset
- repetition penalty
- https://aclanthology.org/2023.acl-long.34.pdf
- NS-FH https://aclanthology.org/2023.acl-demo.6.pdf
- Contrastive decoding ? https://openreview.net/pdf?id=V88BafmH9Pj
- Samplaing temperature
- semantic diverse beam search
- AI Feedbacks
- Random words in prompts
- Model ensembling
- ranking
- voting
- NLTK topic normalisation
- lexcial diversity
- Diverse-N Grams
- output distribution (entropy?) or Kurtosis/Skewness?
- semantic diversity
- embedding model for semantic similarity comparison (SimCSE/SensentenceBERT?)
- https://aclanthology.org/W19-2311.pdf
- https://aclanthology.org/2023.emnlp-main.31.pdf
- https://openreview.net/pdf?id=SJeYe0NtvH
- https://ceur-ws.org/Vol-3359/paper7.pdf
- https://arxiv.org/pdf/2305.08493.pdf
- https://arxiv.org/pdf/2311.01937.pdf
- https://arxiv.org/pdf/2312.02439.pdf
- https://ojs.aaai.org/index.php/AIIDE/article/view/27539/27312
- https://arxiv.org/pdf/2311.09682.pdf