We have constructed two datasets for cross-lingual summarization: ZH2ENSUM and EN2ZHSUM. You can download here.
We would appreciate your citation if you find this is beneficial.
@inproceedings{zhu-etal-2019-ncls,
title = "{NCLS}: Neural Cross-Lingual Summarization",
author = "Zhu, Junnan and Wang, Qian and Wang, Yining and Zhou, Yu and Zhang, Jiajun and Wang, Shaonan and Zong, Chengqing",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
month = nov,
year = "2019",
address = "Hong Kong, China",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/D19-1302",
doi = "10.18653/v1/D19-1302",
pages = "3045--3055",
}
If you have any question, please feel free to contact us by sending an email to {junnan.zhu, yzhou, jjzhang, cqzong}@nlpr.ia.ac.cn.
This project is licensed under the BSD License - see LICENSE.md for details.
The copyright of this dataset belongs to the authors, and the dataset is only used for research purposes. Display, reproduction, transmission, distribution or publication of this dataset is prohibited.