MTEB: Massive Text Embedding Benchmark
-
Updated
Jun 13, 2024 - Python
MTEB: Massive Text Embedding Benchmark
Crosslingual Generalization through Multitask Finetuning
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
This repo supports various cross-lingual transfer learning & multilingual NLP models.
[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. ⭐ support NLP!
This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), July 9-14, 2023.
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.
Cross Lingual Language models for making search engines for Holy Quran and Sahih Hadiths
Winning Solution for the Bangla Complex Named Entity Recognition Challenge - BDOSN NLP Hackathon 2023
MaLA-500: Massive Language Adaptation of Large Language Models
Chaii (Challenge in AI for India) Multilingual QnA - Google Research India
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing
Official repository for the paper "CLAfICLe: Cross-Lingual Adaptation for In-Context Learning". Not Published.
[EMNLP 2023 - Findings] Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention
Solutions of the CMU Multilingual Natural Language Processing Course
A collection of codes in a NMT series of Geultto 8th
Add a description, image, and links to the multilingual-nlp topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-nlp topic, visit your repo's landing page and select "manage topics."