Skip to content
View gentaiscool's full-sized avatar
:octocat:
Writing interesting code...
:octocat:
Writing interesting code...

Highlights

  • Pro

Organizations

@HLTCHKUST @audioku @indobenchmark

Block or report gentaiscool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
383 results for source starred repositories
Clear filter

A curated list of research papers and resources on Cultural LLM.

17 Updated Mar 29, 2024

Resources for cultural NLP research

36 5 Updated Aug 13, 2024
Jupyter Notebook 2 1 Updated May 31, 2024

A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python

Python 1,115 119 Updated Apr 13, 2024

ANSI color formatting for output in terminal

Python 206 25 Updated Jul 1, 2024

Mexican NLP 2024 Summerschool Tutorial on Knowledge Distillation and Parameter Efficient Finetuning

8 Updated Jun 17, 2024

A Python implementation of global optimization with gaussian processes.

Python 7,759 1,529 Updated Aug 20, 2024

A library to calculate similarity scores between two collections of text sequences encoded using transformer models for bitext mining, dense retrieval, retrieval-based classification, and retrieval…

Python 4 2 Updated Jun 22, 2024

A library of translation-based text similarity measures

Python 25 5 Updated Dec 11, 2023

BERT score for text generation

Jupyter Notebook 1,547 210 Updated Jul 30, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,762 232 Updated Aug 28, 2024

Implementation of ProxyLM, a scalable and efficient LM performance prediction framework on NLP task using proxy models

Python 5 1 Updated Aug 13, 2024

Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

Python 11 3 Updated Jul 1, 2024

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.

Python 7 1 Updated Jun 17, 2024

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

Python 59 55 Updated Aug 27, 2024
Jupyter Notebook 3 Updated Jun 24, 2024

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems

1 Updated Jun 10, 2024

Indonesian T0 | Instruction-tuning for low-resource and extremely low-resource Austronesian languages

Jupyter Notebook 9 1 Updated Jun 24, 2024

This repository is dedicated to development of code-mixed language resources.

24 1 Updated Jul 22, 2023

Nollywood Movie Reviews in 5 Nigerian Languages

Shell 4 Updated May 18, 2024

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 88 2 Updated Aug 18, 2023

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 205 19 Updated Jan 13, 2024
Python 1 Updated Aug 7, 2023

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

Jupyter Notebook 4,588 747 Updated Sep 22, 2023

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Jupyter Notebook 25 2 Updated Feb 26, 2024

Solve a Rubik's Cube with neural networks

Python 5 1 Updated Aug 4, 2021

Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.

Python 149 51 Updated Jul 25, 2024

Can LLMs generate code-mixed sentences through zero-shot prompting?

11 Updated Apr 18, 2023

Inference code for Llama models

Python 55,283 9,419 Updated Aug 18, 2024
Next