Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Pushp, Pushpankar Kumar; Srivastava, Muktabh Mayank

Computer Science > Computation and Language

arXiv:1712.05972 (cs)

[Submitted on 16 Dec 2017 (v1), last revised 23 Dec 2017 (this version, v2)]

Title:Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Authors:Pushpankar Kumar Pushp, Muktabh Mayank Srivastava

View PDF

Abstract:Zero-shot Learners are models capable of predicting unseen classes. In this work, we propose a Zero-shot Learning approach for text categorization. Our method involves training model on a large corpus of sentences to learn the relationship between a sentence and embedding of sentence's tags. Learning such relationship makes the model generalize to unseen sentences, tags, and even new datasets provided they can be put into same embedding space. The model learns to predict whether a given sentence is related to a tag or not; unlike other classifiers that learn to classify the sentence as one of the possible classes. We propose three different neural networks for the task and report their accuracy on the test set of the dataset used for training them as well as two other standard datasets for which no retraining was done. We show that our models generalize well across new unseen classes in both cases. Although the models do not achieve the accuracy level of the state of the art supervised models, yet it evidently is a step forward towards general intelligence in natural language processing.

Comments:	v2 - fixed a citation error, unchanged from v1 otherwise
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1712.05972 [cs.CL]
	(or arXiv:1712.05972v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1712.05972

Submission history

From: Muktabh Mayank Srivastava [view email]
[v1] Sat, 16 Dec 2017 15:17:07 UTC (251 KB)
[v2] Sat, 23 Dec 2017 20:05:03 UTC (251 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Pushpankar Kumar Pushp
Muktabh Mayank Srivastava

export BibTeX citation

Computer Science > Computation and Language

Title:Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators