![:octocat: :octocat:](https://github.githubassets.com/images/icons/emoji/octocat.png)
-
Capital One AI Foundations
- New York
- https://gentawinata.com
- @gentaiscool
Highlights
- Pro
Block or Report
Block or report gentaiscool
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
-
-
distfuse Public
A library to calculate similarity scores between two collections of text sequences encoded using transformer models for bitext mining, dense retrieval, retrieval-based classification, and retrieval…
-
code-switching-papers Public
A curated list of research papers and resources on code-switching
-
mteb Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
Python Apache License 2.0 UpdatedJun 18, 2024 -
miners Public
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.
-
-
indonesian-nlp Public
A curated list of research papers and resources on Indonesian languages
-
-
matrix_fact Public
Matrix Factorization Library
-
mt-metrics-eval Public
Forked from google-research/mt-metrics-evalTools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
Python Apache License 2.0 UpdatedDec 18, 2023 -
-
-
do-we-need-attention Public
Forked from srush/do-we-need-attentionTeX MIT License UpdatedJun 4, 2023 -
lstm-attention Public
Attention-based bidirectional LSTM for Classification Task (ICASSP)
-
DataLab Public
Forked from ExpressAI/DataLabThe unified platform for data-related resources.
Python Apache License 2.0 UpdatedNov 13, 2022 -
meta-emb Public
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
-
acl-anthology Public
Forked from acl-org/acl-anthologyData and software for building the ACL Anthology.
Python Apache License 2.0 UpdatedOct 9, 2022 -
lm-evaluation-harness Public
Forked from bigscience-workshop/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedOct 7, 2022 -
promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for creating, sharing and using natural language prompts.
Python Apache License 2.0 UpdatedJun 27, 2022 -
few-shot-lm Public
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
-
-
end2end-asr-pytorch Public
End-to-End Automatic Speech Recognition on PyTorch
-
BIG-bench Public
Forked from google/BIG-benchBeyond the Imitation Game collaborative benchmark for enormous language models
Python Apache License 2.0 UpdatedApr 25, 2022 -
-
PromptPapers Public
Forked from thunlp/PromptPapersMust-read papers on prompt-based tuning for pre-trained language models.
UpdatedJan 8, 2022 -
NER-datasets Public
Forked from davidsbatista/NER-datasetsDatasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Python UpdatedNov 30, 2021 -
NL-Augmenter Public
Forked from GEM-benchmark/NL-AugmenterNL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Python MIT License UpdatedOct 30, 2021 -
mesh-transformer-jax Public
Forked from kingoflolz/mesh-transformer-jaxModel parallel transformers in JAX and Haiku
Jupyter Notebook Apache License 2.0 UpdatedJun 14, 2021 -
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedJun 3, 2021