-
Harbin Institute of Technology
Highlights
- Pro
Stars
Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"
Dual Word Embedding for Robust Unsupervised Bilingual Lexicon Induction (TASLP 2023)
A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.
On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A library for Multilingual Unsupervised or Supervised word Embeddings
Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
[ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation
HG2Vec is a language model that learns word embeddings utilizing only dictionaries and thesauri. Our model reaches the state-of-art on multiple word similarity and relatedness benchmarks.
IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces (EMNLP 2022)
yaoyiran / ContrastiveBLI
Forked from cambridgeltl/ContrastiveBLIImproving Word Translation via Two-Stage Contrastive Learning
Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)
Data sets and comparable Wikipedia samples used in our study on near-isomorphism between monolingual word embeddings
A framework to learn cross-lingual word embedding mappings