-
FAIR (Meta AI)
- San Francisco Bay Area
- elbayadm.github.io
- @melbayad
Stars
A Python library for calculating a large variety of metrics from text
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
The official repo for "LLoCo: Learning Long Contexts Offline"
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
State-of-the-Art Text Embeddings
Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight
TensorDict is a pytorch dedicated tensor container.
Code for "Merging Text Transformers from Different Initializations"
β·οΈ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
A library for mechanistic interpretability of GPT-style language models
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Instant voice cloning by MIT and MyShell.
π§βπ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes π; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaβ¦
A framework for conducting machine learning experiments in python
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Foundational Models for State-of-the-Art Speech and Text Translation
Code and documentation to train Stanford's Alpaca models, and generate the data.
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
LaTeX Template for Mike Morrison's #betterposter
Robust Speech Recognition via Large-Scale Weak Supervision
Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), and "Language-Informed Beam Search Decoding for Multilingual Mβ¦
π A curated list of resources dedicated to Natural Language Processing (NLP)