Guyu (谷雨)

pre-training and fine-tuning framework for text generation

backbone code for "An Empirical Investigation of Pre-Trained Transformer Language Models for Open-Domain Dialogue Generation": https://arxiv.org/abs/2003.04195

Pre-training:

./prepare_data.sh

./train.sh

./inference.sh

Fine-tuning

Example: chat-bot

cd chat_bot
./fine_tune.sh
./inference.sh

Web Api:

./deploy.sh

Pre-trained models

12-layer, 768-hidden, 12-heads, Chinese (News + zhwiki, 200G) and English (Gigawords + Bookscorpus + enwiki, 60G)
24-layer, 768-hidden, 12-heads, Chinese (News + zhwiki, 200G) and English (Gigawords + Bookscorpus + enwiki, 60G)
download them: https://github.com/lipiji/Guyu/tree/master/model

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
chat-bot		chat-bot
model		model
toy		toy
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adam.py		adam.py
api.py		api.py
biglm.py		biglm.py
data.py		data.py
deploy.sh		deploy.sh
inference.py		inference.py
inference.sh		inference.sh
label_smoothing.py		label_smoothing.py
optim.py		optim.py
prepare_data.py		prepare_data.py
prepare_data.sh		prepare_data.sh
train.py		train.py
train.sh		train.sh
transformer.py		transformer.py
utils.py		utils.py
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guyu (谷雨)

Pre-training:

Fine-tuning

Web Api:

Pre-trained models

References:

About

Releases

Packages

Languages

License

JerryX1110/Guyu

Folders and files

Latest commit

History

Repository files navigation

Guyu (谷雨)

Pre-training:

Fine-tuning

Web Api:

Pre-trained models

References:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages