[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
-
Updated
Apr 28, 2023 - Python
[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
OpenWordnet-PT: an open access wordnet for Portuguese
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
Compass-aligned Distributional Embeddings. Align embeddings from different corpora
Data Sets and Models for Evaluation of Lexical Semantic Change Detection
The implementation for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)
An R-based guide to sampling Google n-gram data, building historical term-feature matrices & investigating lexical semantic change historically.
Data for the DiMSUM shared task at SEMEVAL 2016
Web based semantic visualization tool
Probing task; contextual embeddings -> textual definitions (EMNLP19)
Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
Watasense: an Unsupervised WSD System for Under-Resourced Languages.
A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference
Correlated Occurrence Analogue to Lexical Semantics (COALS)