gentaiscool

Writing interesting code...

Genta Indra Winata gentaiscool

Writing interesting code...

Researcher @ Capital One AI Foundations. Natural Language Processing, Speech, Multilingual, Code-switching, Dialogue

236 followers · 122 following

Capital One AI Foundations
New York
https://gentawinata.com
@gentaiscool

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

383 results for source starred repositories

Clear filter

faridlazuarda / cultural-llm-papers

A curated list of research papers and resources on Cultural LLM.

17 Updated Mar 29, 2024

simran-khanuja / awesome-cultural-nlp

Resources for cultural NLP research

36 5 Updated Aug 13, 2024

faridlazuarda / LinguAlchemy

Jupyter Notebook 2 1 Updated May 31, 2024

anderskm / gputil

A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python

Python 1,115 119 Updated Apr 13, 2024

termcolor / termcolor

ANSI color formatting for output in terminal

Python 206 25 Updated Jul 1, 2024

afaji / summerschool-KD-PEFT

Mexican NLP 2024 Summerschool Tutorial on Knowledge Distillation and Parameter Efficient Finetuning

8 Updated Jun 17, 2024

bayesian-optimization / BayesianOptimization

A Python implementation of global optimization with gaussian processes.

Python 7,759 1,529 Updated Aug 20, 2024

gentaiscool / distfuse

A library to calculate similarity scores between two collections of text sequences encoded using transformer models for bitext mining, dense retrieval, retrieval-based classification, and retrieval…

Python 4 2 Updated Jun 22, 2024

ZurichNLP / nmtscore

A library of translation-based text similarity measures

Python 25 5 Updated Dec 11, 2023

Tiiiger / bert_score

BERT score for text generation

Jupyter Notebook 1,547 210 Updated Jul 30, 2024

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,762 232 Updated Aug 28, 2024

davidanugraha / proxylm

Implementation of ProxyLM, a scalable and efficient LM performance prediction framework on NLP task using proxy models

Python 5 1 Updated Aug 13, 2024

BatsResearch / LexC-Gen

Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

Python 11 3 Updated Jul 1, 2024

gentaiscool / miners

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.

Python 7 1 Updated Jun 17, 2024

SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

Python 59 55 Updated Aug 27, 2024

SamuelCahyawijaya / in-context-alignment

Jupyter Notebook 3 Updated Jun 24, 2024

dehanalkautsar / IndoToD

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems

1 Updated Jun 10, 2024

IndoNLP / cendol

Indonesian T0 | Instruction-tuning for low-resource and extremely low-resource Austronesian languages

Jupyter Notebook 9 1 Updated Jun 24, 2024

l3cube-pune / code-mixed-nlp

This repository is dedicated to development of code-mixed language resources.

24 1 Updated Jul 22, 2023

IyanuSh / NollySenti

Nollywood Movie Reviews in 5 Nigerian Languages

Shell 4 Updated May 18, 2024

nlp-uoregon / Okapi

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 88 2 Updated Aug 18, 2023

LAION-AI / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 205 19 Updated Jan 13, 2024

Genius1237 / numpy-gpt2

Python 1 Updated Aug 7, 2023

srush / do-we-need-attention

TeX 159 7 Updated Jul 5, 2023

Nyandwi / machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

Jupyter Notebook 4,588 747 Updated Sep 22, 2023

IndoNLP / nusa-writes

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Jupyter Notebook 25 2 Updated Feb 26, 2024

kongaskristjan / rubik

Solve a Rubik's Cube with neural networks

Python 5 1 Updated Aug 4, 2021

forestagostinelli / DeepCubeA

Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.

Python 149 51 Updated Jul 25, 2024

Rojak-NLP / LLM-Code-Mixing

Can LLMs generate code-mixed sentences through zero-shot prompting?

11 Updated Apr 18, 2023

meta-llama / llama

Inference code for Llama models

Python 55,283 9,419 Updated Aug 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Genta Indra Winata gentaiscool

Achievements

Achievements

Highlights

Organizations

Block or report gentaiscool

Stars

faridlazuarda / cultural-llm-papers

simran-khanuja / awesome-cultural-nlp

faridlazuarda / LinguAlchemy

anderskm / gputil

termcolor / termcolor

afaji / summerschool-KD-PEFT

bayesian-optimization / BayesianOptimization

gentaiscool / distfuse

ZurichNLP / nmtscore

Tiiiger / bert_score

embeddings-benchmark / mteb

davidanugraha / proxylm

BatsResearch / LexC-Gen

gentaiscool / miners

SEACrowd / seacrowd-datahub

SamuelCahyawijaya / in-context-alignment

dehanalkautsar / IndoToD

IndoNLP / cendol

l3cube-pune / code-mixed-nlp

IyanuSh / NollySenti

nlp-uoregon / Okapi

LAION-AI / Open-Instruction-Generalist

Genius1237 / numpy-gpt2

srush / do-we-need-attention

Nyandwi / machine_learning_complete

IndoNLP / nusa-writes

kongaskristjan / rubik

forestagostinelli / DeepCubeA

Rojak-NLP / LLM-Code-Mixing

meta-llama / llama