This is a final project from NLP class at Waseda. This project shows the effect of DPO on a mini language model. This project is mostly from minChatGPT.
O-suke12/MiniFastGPT
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.