Skip to content
/ MATINF Public

The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization".

Notifications You must be signed in to change notification settings

WHUIR/MATINF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MATINF - Multitask Chinese NLP Dataset

The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization".

Citation

If you use the dataset or code in your research, please kindly cite our work:

@inproceedings{xu-etal-2020-matinf,
    title = "{MATINF}: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization",
    author = "Xu, Canwen  and
      Pei, Jiaxin  and
      Wu, Hongtao  and
      Liu, Yiyu  and
      Li, Chenliang",
    booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.acl-main.330",
    pages = "3586--3596",
}

Dataset

You can get MATINF dataset by signing the agreement on Google Form to request the access. You will get the download link and the zip password after filling the form. ALL USE MUST BE NON-COMMERCIAL!!

Code

Please manually change the stage variable in main() to toggle from different training phases.

Then run:

python run.py

Code credit: Hongtao Wu

About

The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization".

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages