Taro Watanabe
I am a professor at Nara Institute of Science and Technology (NAIST) and lead a natural language processing laboratory. I'm mainly working for machine translation and other areas, such as machine learning and natural language processing (curriculum vitae). I received B.E. from Kyoto University, M.E. from Kyoto University, M.S. from CMU, and Ph. D. from Kyoto University. I was previously affiliated with ATR, NTT, NICT and Google. You can reach me via taro at is.naist.jp.
Book
Machine Translation (in Japanese)
Talks, Lectures, Tutorials
Lecture at ALAGIN 2015 (in Japanese) [introduction, decoding, optimization, deep learning]
Short History of SMT. Talk at the 20th anniversary for the Association of NLP (in Japanese) [slide].
Lecture at ALAGIN 2014 (in Japanese) [introduction1, introduction2, decoding, scfg, optimization1, optimization2].
Statistical Approach for MT. Talk at U. Tokyo (in Japanese) [slide].
SMT12. talk at HIT-MSRA summer school 2012 [slide].
Structures in Statistical Machine Translation. A tutorial talk at NAIST [slide].
Cutting Edge in Statistical Machine Translation. A tutorial talk at NLP 2012 (in Japanese) [slide].
A set of slides for HIT-MSRA Summer School 2011 [introduction, alignment-model, phrase-model, tree-model].
Foundations of Statistical Machine Translation: Past, Present and Future. A tutorial talk at INTERSPEECH 2010 [slide].
Statistical Machine Translation Tutorial. A tutorial talk at NLP 2004 (in Japanese) [slide].
Activities
editorial board (action editor): Computational Linguistics (2013-2015), Machine Translation (2016-2021), TASL (2018-2021), TALLIP (2017-), ARR (2021-), TACL (2022-)
program (co-)chair: IJCNLP2017
(senior) area (co-)chair: ACL2008, ACL2011, EMNLP2013, ACL2015, COLING2016, NAACL2018, EMNLP2018, ACL2019, ACL2020, AACL2020, EMNLP2021, ACL2022, EMNLP2022 and many
committee: IPSJ-Kansai (2012-2013, in Japanese)
Software
trance: a transition-based neural network constituent parser
cicada: a hypergraph-based machine translation toolkit which supports {string,tree}-to-{string, tree} model
expgram: yet-another ngram toolkit with succinct storage
pialign: phrasal ITG aligner for phrase table induction
lader: latent derivation reorder for pre-reordering of MT input
a head-driven transition-based dependency parser
Supervising, Collaborating
Students: Many.
Interns: Katsuhiko Hayashi (2008, 2010-2012, NAIST), Graham Neubig (2010-2012, Kyoto U.), Isamu Fujiwara (2011, Tottori U.), Daniel Flannery (2011, Kyoto U.), Lemao Liu (2012-2013, Harbin Institute of Tech.), Hidetaka Kamigaito (2013, 2014-2015, Tokyo Institute of Tech.), Hitoshi Otsuki (2014-2015, Kyoto Institute of Tech.)
Visiting researchers: Conghui Zhu (2012-2013, Harbin Institute of Tech.)
Colleagues: Chooi-Ling Goh (2009,2011), Akihiro Tamura (2011-2014), Hideya Mino (2013-2015), Youzheng Wu (2014), Shumpei Kubosawa (2014-2015), Lemao Liu (2014-2015)
Publications
2024
Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe. 2024. Artwork Explanation in Large-scale Vision Language Models. In ACL 2024. [paper]
Armin Sarhangzadeh and Taro Watanabe. 2024. Alignment-Based Decoding Policy for Low-Latency and Anticipation-Free Neural Japanese Input Method Editors. In ACL 2024 Findings. [paper]
Huayang Li, Siheng Li, Deng Cai, Longyue Wang, Lemao Liu, Taro Watanabe, Yujiu Yang and Shuming Shi. 2024. TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild. In ACL 2024 Findings. [paper]
Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka and Masao Utiyama. 2024. Centroid-Based Efficient Minimum Bayes Risk Decoding. In ACL 2024 Findings. [paper]
Yusuke Sakai, Hidetaka Kamigaito and Taro Watanabe. 2024. mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans. In ACL 2024 Findings. [paper]
Akari Haga, Saku Sugawara, Akiyo Fukatsu, Miyu Oba, Hiroki Ouchi, Taro Watanabe and Yohei Oseki. 2024. Modeling Overregularization in Children with Small Language Models. In ACL 2024 Findings. [paper]
Yuji Oshima, Hiroyuki Shindo, Hiroki Teranishi, Hiroki Ouchi and Taro Watanabe. 2024. Synthetic Context with LLM for Entity Linking from Scientific Tables. In SDProc 2024. [paper]
Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe. 2024. Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding. In Repl4NLP 2024. [paper]
Huy Hien Vu, Hidetaka Kamigaito and Taro Watanabe. 2024. Context-Aware Machine Translation with Source Coreference Explanation. In Transactions of the Association for Computational Linguistics. [paper]
Miyu Oba, Tatsuki Kuribayashi, Hiroki Ouchi and Taro Watanabe. 2024. Second Language Acquisition of Neural Language Models. In Journal of Natural Language Processing (Japanese). [paper]
Hiroyuki Deguchi, Taro Watanabe, Yusuke Matsui, Masao Utiyama, Hideki Tanaka and Eiichiro Sumita. 2024. Subset Retrieval Nearest Neighbor Machine Translation. In Journal of Natural Language Processing. [paper]
Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe, 2024. Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion? In NAACL 2024. [paper]
Yuhi Matogawa, Yusuke Sakai, Taro Watanabe and Chihiro Taguchi. 2024. Japanese Rule-based Grapheme-to-phoneme Conversion System and Multilingual Named Entity Dataset with International Phonetic Alphabet. In SIGMORPHON 2024. [paper]
Justin Vasselli, Arturo Martínez Peguero, Junehwan Sung and Taro Watanabe. 2024. Applying Linguistic Expertise to LLMs for Educational Material Development in Indigenous Languages. In AmericasNLP 2024. [paper]
Hiroyuki Deguchi, Masaaki Nagata and Taro Watanabe. 2024. Detector-Corrector: Edit-Based Automatic Post Editing for Human Post Editing. In EAMT 2024 (to appear).
Eunike Kardinata, Hiroki Ouchi and Taro Watanabe. 2024. Constructing Indonesian-English Travelogue Dataset. In LREC-COLING 2024. [paper]
Frederikus Hudi, Zhi Qu, Hidetaka Kamigaito and Taro Watanabe. 2024. Disentangling Pretrained Representation to Leverage Low-Resource Languages in Multilingual Machine Translation. In LREC-COLING 2024. [paper]
Iqra Ali, Hidetaka Kamigaito and Taro Watanabe. 2024. Monolingual Paraphrase Detection Corpus for Low Resource Pashto Language at Sentence Level. In LREC-COLING 2024. [paper]
Eri Onami, Shuhei Kurita, Taiki Miyanishi and Taro Watanabe. 2024. JDocQA: Japanese Document Question Answering Dataset for Generative Language Models. In LREC-COLING 2024. [paper]
Shohei Higashiyama, Hiroki Ouchi, Hiroki Teranishi, Hiroyuki Otomo, Yusuke Ide, Aitaro Yamamoto, Hiroyuki Shindo, Yuki Matsuda, Shoko Wakamiya, Naoya Inoue, Ikuya Yamada and Taro Watanabe. 2024. Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation. In EACL 2024 Findings. [paper]
Yuto Nishida, Makoto Morishita, Hidetaka Kamigaito and Taro Watanabe. 2024. Generating Diverse Translation with Perturbed kNN-MT. In EACL 2024 Student Research Workshop. [paper]
2023
Hiroyuki Deguchi, Kenji Imamura, Yuto Nishida, Yusuke Sakai, Justin Vasselli and Taro Watanabe. 2023. NAIST-NICT WMT’23 General MT Task Submission. In WMT 2023. [paper]
Lemao Liu, Francisco Casacuberta, George Foster, Guoping Huang, Philipp Koehn, Geza Kovacs, Shuming Shi, Taro Watanabe and Chengqing Zong. 2023. Findings of the Word-Level AutoCompletion Shared Task in WMT 2023. In WMT 2023. [paper]
Yiran Wang, Taro Watanabe, Masao Utiyama and Yuji Matsumoto. 2023. 24-bit Languages. In IJCNLP-AACL 2023. [paper]
Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe. 2023. Model-based Subsampling for Knowledge Graph Completion. In IJCNLP-AACL 2023. [paper]
Huayang Li, Tian Lan, Zihao Fu, Deng Cai, Lemao Liu, Nigel Collier, Taro Watanabe and Yixuan Su. 2023. Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective. In NeurIPS 2023 (to appear).
Hiroyuki Deguchi, Taro Watanabe, Yusuke Matsui, Masao Utiyama, Hideki Tanaka and Eiichiro Sumita. 2023. Subset Retrieval Nearest Neighbor Machine Translation. In ACL 2023. [paper]
Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe. 2023. Table and Image Generation for Investigating Knowledge of Entities in Pretrained Vision and Language Models. In ACL 2023. [paper]
Miyu Oba, Tatsuki Kuribayashi, Hiroki Ouchi and Taro Watanabe. 2023. Second Language Acquisition of Neural Language Models. In ACL 2023 Findings. [paper]
Justin Vasselli, Christopher Vasselli, Adam Nohejl and Taro Watanabe. 2023. NAISTeacher: A Prompt and Rerank Approach to Generating Teacher Utterances in Educational Dialogues. In 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023). [paper]
Justin Vasselli and Taro Watanabe. 2023. A Closer Look at k-Nearest Neighbors Grammatical Error Correction. 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023). [paper]
Yusuke Ide, Masato Mita, Adam Nohejl, Hiroki Ouchi, and Taro Watanabe. 2023. Japanese Lexical Complexity for Non-Native Readers: a New Dataset. In 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023). [paper]
Jungmin Choi, Ukyo Honda, Taro Watanabe and Kentaro Inui. 2023. Explainable Natural Language Inference in the Legal Domain via Text Generation. In Transactions of the Japanese Society for Artificial Intelligence. [paper]
Van-Hien Tran, Hiroki Ouchi, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe. 2023. Enhancing Semantic Correlation between Instances and Relations for Zero-Shot Relation Extraction. In Journal of Natural Language Processing. [paper]
Ukyo Honda, Taro Watanabe and Yuji Matsumoto. 2023. Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning. In WACV 2023. [paper]
2022
Shintaro Harada and Taro Watanabe. 2022. Neural Machine Translation with Synchronous Latent Phrase Structure. In Journal of Natural Language Processing. [paper]
Yuki Yamamoto, Yuji Matsumoto and Taro Watanabe. 2022. Dependency Patterns of Complex Sentences and Semantic Disambiguation for Abstract Meaning Representation Parsing. In Journal of Natural Language Processing. [paper]
Ukyo Honda, Hashimoto Atsushi, Taro Watanabe and Yuji Matsumoto. 2022. Removing Partial Mismatches in Unsupervised Image Captioning. In Transactions of the Japanese Society for Artificial Intelligence (in Japanese). [paper]
Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto, and Taro Watanabe. 2022. Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path. In Journal of Natural Language Processing. [paper]
Francisco Casacuberta, George Foster, Guoping Huang, Philipp Koehn, Geza Kovacs, Lemao Liu, Shuming Shi, Taro Watanabe and Chengqing Zong. 2022. Findings of the Word-Level AutoCompletion Shared Task in WMT 2022. In WMT 2022. [paper]
Hiroyuki Deguchi, Kenji Imamura, Masahiro Kaneko, Yuto Nishida, Yusuke Sakai, Justin Vasselli, Huy Hien Vu and Taro Watanabe. 2022. NAIST-NICT-TIT WMT22 General MT Task Submission. In WMT 2022. [paper]
Akio Hayakawa, Tomoyuki Kajiwara, Hiroki Ouchi and Taro Watanabe. 2022. JADES: New Text Simplification Dataset in Japanese Targeted at Non-Native Speakers. In Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). [paper]
Huayang Li, Deng Cai, Jin Xu and Taro Watanabe. 2022. Residual Learning of Neural Text Generation with n-gram Language Model. In EMNLP 2022 Findings. [paper]
Jungmin Choi, Ukyo Honda, Taro Watanabe, Hiroki Ouchi and Kentaro Inui. 2022. Law retrieval with supervised contrastive learning using the hierarchical structure of law. In PACLIC 36. [paper]
Xincan Feng, Zhi Qu, Yuchang Cheng, Taro Watanabe and Nobuhiro Yugami. 2022. Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space. In TextGraphs-16. [paper]
Zhi Qu and Taro Watanabe. 2022. Adapting to Non-Centered Languages for Zero-shot Multilingual Translation. In COLING 2022. [paper]
Masao Ideuchi, Masatoshi Tsuchiya, Yiran Wang and Masao Utiyama. 2022. NICTmed at the NCTIR-16 Real-MedNLP Task. In NTCIR-16. [paper]
Chihiro Taguchi, Sei Iwata and Taro Watanabe. 2022. Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information. In Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages (EURALI-2022). [paper]
Van-Hien Tran, Hiroki Ouchi, Taro Watanabe and Yuji Matsumoto. 2022. Improving Discriminative Learning for Zero-Shot Relation Extraction. In 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge (SpaNLP). [paper]
Jiannan Xiang, Huayang Li, Defu Lian, Guoping Huang, Taro Watanabe and Lemao Liu. 2022. Visualizing the Relationship Between Encoded Linguistic Information and Task Performance. In ACL 2022 Findings. [paper]
Zuchao Li, Yiran Wang, Masao Utiyama, Eiichiro Sumita, Hai Zhao and Taro Watanabe. 2022. What Works and Doesn’t Work, A Deep Decoder for Neural Machine Translation. In ACL 2022 Findings. [paper]
2021
Yushi Hirose, Shimbo Masashi and Taro Watanabe. 2021. Transductive Data Augmentation with Relational Path Rule Mining for Knowledge Graph Embedding. In IEEE International Conference on Big Knowledge (ICBK). [paper]
Yuya Sawada, Hiroki Teranishi, Yuji Matsumoto and Taro Watanabe. 2021. Coordinate Structure Analysis without Labeled Data for Recognizing Compound Named Entities. In Journal of Natural Language Processing (in Japanese) . [paper]
Van-Hien Tran, Van-Thuy Phi, Akihiko Kato, Hiroyuki Shindo, Taro Watanabe and Yuji Matsumoto. 2021. Improved Decomposition Strategy for Joint Entity and Relation Extraction. In Journal of Natural Language Processing. [paper]
Masao Ideuchi, Yohei Sakamoto, Yoshitaka Oida, Isaac Okada, Shohei Higashiyama, Masao Utiyama, Eiichiro Sumita and Taro Watanabe. 2021. A Selection Support System for Enterprise Resource Planning Package Components using Ensembles of Multiple Models with Round-trip Translation. In Journal of Natural Language Processing. [paper]
Shohei Higashiyama, Masao Utiyama, Taro Watanabe and Eiichiro Sumita. 2021. A Text Editing Approach to Joint Japanese Word Segmentation, POS Tagging, and Lexical Normalization. In Seventh Workshop on Noisy User-generated Text (W-NUT 2021). [paper]
Yuki Yamamoto, Yuji Matsumoto and Taro Watanabe. 2021. Dependency Patterns of Complex Sentences and Semantic Disambiguation for Abstract Meaning Representation Parsing. In *SEM 2021. [paper]
Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe. 2021. Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path. In ACL-IJCNLP 2021. [paper]
Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe. 2021. Structured Refinement for Sequential Labeling. In ACL-IJCNLP 2021 Findings. [paper]
Yushi Hirose, Shimbo Masashi and Taro Watanabe. 2021. Transductive Data Augmentation with Relational Path Rule Induction for Knowledge Graph Embedding. In International Workshop on Knowledge Graph: Heterogeneous Graph Deep Learning and Applications.
Shintaro Harada and Taro Watanabe. 2021. Neural Machine Translation with Synchronous Latent Phrase Structure. In ACL-IJCNLP 2021 Student Research Workshop. [paper]
Sei Iwata, Taro Watanabe and Masaaki Nagata. 2021. Zero Pronouns Identification based on Span prediction. In ACL-IJCNLP 2021 Student Research Workshop. [paper]
Chihiro Taguchi, Yusuke Sakai and Taro Watanabe. 2021. Transliteration for Low-Resource Code-Switching Texts: Building an Automatic Cyrillic-to-Latin Converter for Tatar. In Fifth Workshop on Computational Approaches to Linguistic Code-Switching (CALCS 2021). [paper]
Shohei Higashiyama, Masao Utiyama, Taro Watanabe and Eiichiro Sumita. 2021. User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization. In NAACL-HLT 2021. [paper]
Ukyo Honda, Yoshitaka, Ushiku, Atsushi Hashimoto, Taro Watanabe and Yuji Matsumoto. 2021. Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning. In EACL 2021. [paper]
2020
Hiroki Teranishi, Hiroyuki Shindo, Taro Watanabe and Yuji Matsumoto. 2020. Coordinate Structure Analysis using Local Models and CKY Algorithm. In Journal of Natural Language Processing (In Japanese). [paper]
Shohei Higashiyama, Masao Utiyama, Yuji Matsumoto, Taro Watanabe and Eiichiro Sumita. 2020. Auxiliary Lexicon Word Prediction for Cross-Domain Word Segmentation. In Journal of Natural Language Processing. [paper]
Yuya Sawada, Takashi Wada, Takayoshi Shibahara, Hiroki Teranishi, Shuhei Kondo, Hiroki Shindo, Taro Watanabe and Yuji Matsumoto. 2020. Coordination Boundary Identification without Labeled Data for Compound Terms Disambiguation. In COLING 2020. [paper]
2018
Wei Wang, Taro Watanabe, Macduff Hughes, Tetsuji Nakagawa and Ciprian Chelba. 2018. Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection. In WMT 2018. [paper]
2016
Yusuke Oda, Taku Kudo, Tetsuji Nakagawa and Taro Watanabe. 2016. Phrase-based Machine Translation using Multiple Reordering Candidates. In COLING 2016. [paper]
Graham Neubig and Taro Watanabe. 2016. Optimization for Statistical Machine Translation: A Survey. In Computational Linguistics. [paper]
Taro Watanabe. 2016. Advances in Structured Learning by Neural Networks. In Journal of the Japanese Society for Artificial Intelligence (invited paper, in Japanese) [paper].
Hidetaka Kamigaito, Taro Watanabe, Hiroya Takamura, Manabu Okumura and Eiichiro Sumita. Unsupervised Word Alignment Using Frequency Constraint in Posterior Regularized EM. 2016. In Journal of Natural Language Processing. Vol. 23, No. 4 (in Japanese) [paper].
2015
Xiaolin Wang, Masao Utiyama, Andrew Finch, Taro Watanabe and Eiichiro Sumita. 2015. Leave-one-out Word Alignment without Garbage Collector Effects. In EMNLP 2015. [paper]
Hidetaka Kamigaito, Taro Watanabe, Hiroya Takamura, Manabu Okumura andEiichiro Sumita. 2015. Hierarchical Back-off Modeling of Hiero Grammar based on Non-parametric Bayesian Model. In EMNLP 2015. [paper]
Taro Watanabe and Eiichiro Sumita. 2015. Transition-based Neural Constituent Parsing. In ACL 2015. [paper, software]
Akihiro Tamura, Taro Watanabe and Eiichiro Sumita. 2015. Recurrent Neural Networks for Word Alignment. In Journal of Natural Language Processing. Vol. 22, No. 4 [paper].
2014
Hidetaka Kamigaito, Taro Watanabe, Hiroya Takamura and Manabu Okumura. 2014. Unsupervised Word Alignment Using Frequency Constraint in Posterior Regularized EM. In EMNLP 2014. [paper]
Hideya Mino, Taro Watanabe and Eiichiro Sumita. 2014. Syntax-Augmented Machine Translation using Syntax-Label Clustering. In EMNLP 2014. [paper]
Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura and Manabu Okumura. 2014. Unsupervised Learning of Part-of-Speech in Dependency Trees for Machine Translation. In Information Processing Society of Japan. pp. 1665-1680 (Vol. 55, No. 7). [paper, Specially Selected Paper, Outstanding Paper Award]
Youzheng Wu, Taro Watanabe and Chiori Hori. 2014. Recurrent Neural Network-based Tuple Sequence Model for Machine Translation. In COLING 2014. [paper]
Akihiro Tamura, Taro Watanabe and Eiichiro Sumita. 2014. Recurrent Neural Networks for Word Alignment Model. In ACL 2014. [paper]
Lemao Liu, Tiejun Zhao, Taro Watanabe, Hailong Cao and Conghui Zhu. 2014. Discriminative Training for Log-Linear Based SMT: Global or Local Methods. In ACM Transactions on Asian Language Information Processing (TALIP). Vol. 13, Issue 4 [paper].
2013
Lemao Liu, Tiejun Zhao, Taro Watanabe end Eiichiro Sumita. 2013. Tuning SMT with a Large Number of Features via Online Feature Grouping. In IJCNLP 2013. [paper (note that this is version 2 which corrects bugs in the original paper)]
Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura and Manabu Okumura. 2013. Part-of-Speech Induction in Dependency Trees for Statistical Machine Translation. In ACL 2013. [paper]
Lemao Liu, Taro Watanabe, Eiichiro Sumita and Tiejun Zhao. 2013. Additive Neural Networks for Statistical Machine Translation. In ACL 2013. [paper]
Conghui Zhu, Taro Watanabe, Eiichiro Sumita and Tiejun Zhao. 2013. Hierarchical Phrase Table Combination for Machine Translation. In ACL 2013. [paper]
Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura and Manabu Okumura. 2013. Extracting Translation Pairs from Comparable Corpora through Graph-based Label Propagation. In Journal of Natural Language Processing, Vol. 20 (2013), No. 2, pp. 133-160. [paper]
Graham Neubig, Taro Watanabe, Shinsuke Mori and Tatsuya Kawahara. 2013. Substring-based Machine Translation. in Machine Translation. March 2013. [link, code]
2012
Lemao Liu, Tiejun Zhao, Taro Watanabe, Hailong Cao and Conghui Zhu. 2012. Expected Error Minimization with Ultraconservative Update for SMT. In COLING 2012. [paper]
Akihiro Tamura, Taro Watanabe and Eiichiro Sumita. 2012. Bilingual Lexicon Extraction from Comparable Corpora Using Label Propagation. In EMNLP-CoNLL 2012. [paper]
Graham Neubig, Taro Watanabe and Shinsuke Mori. 2012. Inducing a Discriminative Parser to Optimize Machine Translation Reordering. In EMNLP-CoNLL 2012. [paper, code]
Lemao Liu, Hailong Cao, Taro Watanabe, Tiejun Zhao, Mo Yu and Conghui Zhu. 2012. Locally Training the Log-Linear Model for SMT. In EMNLP-CoNLL 2012. [paper]
Katsuhiko Hayashi, Taro Watanabe, Masayuki Asahara and Yuji Matsumoto. 2012. Head-Driven Transition-based Parsing with Top-Down Prediction. In ACL 2012. [paper]
Graham Neubig, Taro Watanabe, Shinsuke Mori and Tatsuya Kawahara. 2012. Machine Translation without Words through Substring Alignment. In ACL 2012. [paper, code]
Taro Watanabe. 2012. Optimized Online Rank Learning for Machine Translation. In NAACL 2012. [paper, poster, code]
Taro Watanabe. 2012. Field of Statistical Machine Translation. In Journal of the Japanese Society for Artificial Intelligence (invited paper, in Japanese). pp. 288-295. Vol. 27 No. 3 May 2012. [paper]
Chooi-Ling Goh, Taro Watanabe, and Eiichiro Sumita. 2012. Japanese argument reordering based on dependency structure for statistical machine translation. In IEICE Transactions on Information and System, pp. 1668-1675. June 2012. [link]
Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, Tatsuya Kawahara. 2012. Joint Phrase Alignment and Extraction for Statistical Machine Translation. In Journal of Information Processing, pp. 512-523. April 2012. [link, code, Outstanding Paper Award]
2011
Katsuhiko Hayashi, Taro Watanabe, Masayuki Asahara and Yuji Matsumoto. 2011. Third-order Variational Reranking on Packed-Shared Dependency Forests. In Proceedings of EMNLP 2011, Edinburgh, Scotland, UK, July. [paper]
Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori and Tatsuya Kawahara. 2011. An Unsupervised Model for Joint Phrase Alignment and Extraction. In ACL 2011. [paper, code]
Taro Watanabe and Eiichiro Sumita. 2011. Machine Translation System Combination by Confusion Forest. In ACL 2011. [paper, slide, poster, code]
2010
Keiji Yasuda, Taro Watanabe, Masao Utiyama and Eiichiro Sumita. 2010. System Description of NiCT SMT for NTCIR-8. In NTCIR-8. [paper]
Chooi-Ling Goh, Taro Watanabe, Hirofumi Yamamoto, and Eiichiro Sumita. 2010. Constraining a generative word alignment model with discriminative output. In IEICE Transactions on Information and System, pp. 1976-1983. July 2012. [link]
2009
Katsuhiko Hayashi, Taro Watanabe, Hajime Tsukada, and Hideki Isozaki. 2009. Structural Support Vector Machines for Log-linear approach in Statistical Machine Translation. In Proceedings of IWSLT 2009, Tokyo, Japan, pp. 144-151, Dec. 2009. [paper]
Taro Watanabe, Hajime Tsukada, and Hideki Isozaki. 2009. A succinct n-gram language model. In ACL-IJCNLP 2009. pp. 341--344. [paper]
2008
Taro Watanabe, Hajime Tsukada and Hideki Isozaki. 2008. NTT SMT System 2008 at NTCIR-7. In NTCIR-7. [paper]
Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada and Hideki Isozaki. 2008. NTT Statistical Machine Translation System for IWSLT 2008. In IWSLT 2008. [paper]
2007
Taro Watanabe, Jun Suzuki, Katsuhito Sudoh, Hajime Tsukada and Hideki Isozaki. 2007. Larger Feature Set Approach for Machine Translation in IWSLT 2007. In IWSLT 2007. [paper, slide]
Taro Watanabe, Jun Suzuki, Hajime Tsukada and Hideki Isozaki. 2007. Online Large-Margin Training for Statistical Machine Translation. In EMNLP-CoNLL 2007 pp. 764-773. [paper, slide]
Taro Watanabe, Kenji Imamura, Eiichiro Sumita and Hiroshi G. Okuno. 2007. Statistical machine translation using hierarchical phrase alignment. In Systems and Computers in Japan. Vol. 38, Issue 6. pp. 70-79. June. [link]
2006
Taro Watanabe, Jun Suzuki, Hajime Tsukada and Hideki Isozaki. 2006. NTT Statistical Machine Translation for IWSLT 2006. In Proceedings of IWSLT 2006 pp. 95-102. [paper]
Taro Watanabe, Hajime Tsukada and Hideki Isozaki. 2006. Left-to-Right Target Generation for Hierarchical Phrase-based Translation. In Proceedings of COLING-ACL 2006 pp.777-784. [paper]
Taro Watanabe, Hajime Tsukada and Hideki Isozaki. 2006. NTT System Description for the WMT2006 Shared Task. In Proceedings of NAACL 2006 Workshop on Statistical Machine Translation pp.122-125. [paper, slide]
2005
Hajime Tsukada, Taro Watanabe, Jun Suzuki, Hideto Kazawa, Hideki Isozaki. 2005. The NTT Statistical Machine Translation System for IWSLT2005. In Proceedings of IWSLT 2005. [paper]
Young-Sook Hwang, Taro Watanabe and Yutaka Sasaki. 2005. Empirical Study of Utilizing Morph-Syntactic Information in SMT. In Proc. of IJCNLP-05 pp.474-485. [paper]
2004
Eiichiro Sumita, Yasuhiro Akiba, Takao Doi, Andrew Finch, Kenji Imamura, Hideo Okuma, Michael Paul, Mitsuo Shimohata and Taro Watanabe. 2004. EBMT, SMT, Hybrid and More: ATR Spoken Language Translation System. In Proceedings of IWSLT 2004 pp.13-20. [paper]
Andrew Finch, Taro Watanabe, Yasuhiro Akiba, Eiichiro Sumita. 2004. Paraphrasing as Machine Translation. In Journal of Natural Language Processing Vol.11, No.5, pp.87-111. [paper]
Ruiqiang Zhang, Gen-ichiro Kikui, Hirofumi Yamamoto, Frank K. Soong, Taro Watanabe, Eiichiro Sumita and Wai Kit Lo. 2004. Improved spoken language translation using n-best speech recognition hypotheses. In INTERSPEECH 2004.
Ruiqiang Zhang, Genichiro Kikui, Hirofumi Yamamoto, Frank Soong, Taro Watnabe and Wai Kit Lo. 2004. A Unified Approach in Speech-to-Speech Translation: Integrating Features of Speech recognition and Machine Translation. In COLING 2004 pp.1168--1174. [paper]
Richard Zens, Hermann Ney, Taro Watanabe, Eiichiro Sumita. 2004. Reordering Constraints for Phrase-Based Statistical Machine Translation. In COLING 2004 pp.205-211. [paper]
Kenji IMAMURA, Hideo OKUMA, Taro WATANABE, Eiichiro SUMITA. 2004. Example-based Machine Translation Based on Syntactic Transfer with Statistical Models. In COLING 2004, Vol.I, pp.99-105. [paper]
Taro Watanabe. 2004. Example-based Statistical Machine Translation. Ph.D. thesis, Kyoto University.
Taro WATANABE, Kenji IMAMURA, Eiichiro SUMITA, Hiroshi G. OKUNO. 2004. Statistical Machine Translation Using Hierarchical Phrase Alignment. In THE IEICE TRANSACTION ON INFROMATION AND SYSTEMS, PT.2(JAPANESE EDITION) , Vol.J87-D-II, No.4, pp.978-986.
2003
Taro Watanabe, Eiichiro Sumita and Hiroshi G. Okuno. 2003. Decoding Algorithms for Statistical Machine Transaltion Considering Generation Directions. In Information Processing Society of Japan pp. 3202 - 3210 (Vol. 44, No. 12) [paper]
Taro Watanabe and Eiichiro Sumita. 2003. Example-based Decoding for Statistical Machine Translation. In Machine Translation Summit IX. pp. 410-417 New Orleans, Louisiana. [paper, slide]
Taro Watanabe and Eiichiro Sumita. 2003. Statistical Machine Translation by Example-based Decoder. In Forum on Information Technology (FIT2003). Japan
Taro Watanabe, Eiichiro Sumita and Hiroshi G. Okuno. 2003. Chunk-based Statistical Translation. In 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003). pp. 303-310 Sapporo, Japan. [paper, slide]
Eiichiro SUMITA, Yasuhiro AKIBA, Takao DOI, Andrew FINCH, Kenji IMAMURA, Michael PAUL, Mitsuo SHIMOHATA and Taro WATANABE. 2003. A Corpus-Centered Approach to Spoken Language Translation. EACL-2003 pp.171-174. [paper]
2002
Taro Watanabe and Eiichiro Sumita. 2002. Statistical Machine Translation Decoder Based on Phrase. In 7th International Conference on Spoken Language Processing (ICSLP 2002) pp. 1889-1892 Denver, Colorado, USA, September
Taro Watanabe and Eiichiro Sumita. 2002. Bidirectional Decoding for Statistical Machine Translation. In 19th International Conference on Computational Linguistics (COLING 2002) pp. 1079-1085 Taipei, Taiwan, August. [paper]
Yasuhiro Akiba, Taro Watanabe and Eiichiro Sumita. 2002. Using Langauge and Translation Models to Select the Best among Outputs from Multiple MT Systems. In 19th International Conference on Computational Linguistics (COLING 2002) Taipei, Taiwan, August. [paper]
Hideharu Nakajima, Hirofumi Yamamoto and Taro Watanabe. 2002. Language Model Adaptation with Additional Text Generated by Machine Translation. In 19th International Conference on Computational Linguistics (COLING 2002) Taipei, Taiwan, August. [paper]
Andrew FINCH, Taro WATANABE and Eiichiro SUMITA. 2002. Paraphrasing by Statistical Machine Translation. In Forum on Information Technology (FIT 2002) (Vol..2) pp.187-188. Japan
Taro Watanabe, Mitsuo Shimohata and Eiichiro Sumita. 2002. Statistical Machine Translation on Paraphrased Corpora. In Third International Conference on Language Resources and Evaluation (LREC 2002), pp. 1954-1957 Las Palmas, Canary Islands, Spain, May. [paper]
Hideharu Nakajima, Hirofumi Yamamoto, Taro Watanabe. 2002. Language Model Adaptation with Additional Texts Generated by Machine Translation. In the 8th Annual Meeting of NLP pp.283-286 Japan
Taro Watanabe, Kenji Imamura and Eiichiro Sumita. 2002. Statistical Machine Translation Based on Hierarchical Phrase Alignment. In 9th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2002) , pp. 188-198 Keihanna, Japan, March. [paper, slide]
2000
Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation. 2000. Lori Levin, Boris Bartlog, Ariadna Font Llitjos, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe and Monika Woszczyna. In LREC 2002. [paper]