Word Representations, Tree Models and Syntactic Functions

Šuster, Simon; van Noord, Gertjan; Titov, Ivan

Computer Science > Computation and Language

arXiv:1508.07709 (cs)

[Submitted on 31 Aug 2015 (v1), last revised 5 Feb 2016 (this version, v2)]

Title:Word Representations, Tree Models and Syntactic Functions

Authors:Simon Šuster, Gertjan van Noord, Ivan Titov

View PDF

Abstract:Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both transition and emission components. Such syntactic information can potentially lead to capturing more fine-grain and functional distinctions between words, which, in turn, may be desirable in many NLP applications. We evaluate the word representations on two tasks -- named entity recognition and semantic frame identification. We observe improvements from exploiting syntactic function information in both cases, and the results rivaling those of state-of-the-art representation learning methods. Additionally, we revisit the relationship between sequential and unlabeled-tree models and find that the advantage of the latter is not self-evident.

Comments:	Add github code repository link. Fix equation 4.1
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1508.07709 [cs.CL]
	(or arXiv:1508.07709v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1508.07709

Submission history

From: Simon Šuster [view email]
[v1] Mon, 31 Aug 2015 07:52:50 UTC (72 KB)
[v2] Fri, 5 Feb 2016 13:26:56 UTC (72 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2015-08

Change to browse by:

cs
cs.CL
cs.LG
stat

References & Citations

DBLP - CS Bibliography

listing | bibtex

Simon Suster
Gertjan van Noord
Ivan Titov

export BibTeX citation

Computer Science > Computation and Language

Title:Word Representations, Tree Models and Syntactic Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Word Representations, Tree Models and Syntactic Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators