Skip to content
View gentaiscool's full-sized avatar
:octocat:
Writing interesting code...
:octocat:
Writing interesting code...

Highlights

  • Pro

Organizations

@HLTCHKUST @audioku @indobenchmark
Block or Report

Block or report gentaiscool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 2 9 Updated Jul 25, 2024

ANSI color formatting for output in terminal

Python 201 25 Updated Jul 1, 2024

Mexican NLP 2024 Summerschool Tutorial on Knowledge Distillation and Parameter Efficient Finetuning

7 Updated Jun 17, 2024

A Python implementation of global optimization with gaussian processes.

Python 7,674 1,524 Updated Jul 25, 2024

A library to calculate similarity scores between two collections of text sequences encoded using transformer models for bitext mining, dense retrieval, retrieval-based classification, and retrieval…

Python 4 2 Updated Jun 22, 2024

A library of translation-based text similarity measures

Python 24 5 Updated Dec 11, 2023

BERT score for text generation

Jupyter Notebook 1,520 208 Updated Jun 14, 2024

MTEB: Massive Text Embedding Benchmark

Python 1,673 221 Updated Jul 25, 2024

Implementation of ProxyLM, a scalable and efficient LM performance prediction framework on NLP task using proxy models

Python 5 Updated Jun 15, 2024

Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

Python 11 3 Updated Jul 1, 2024

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.

Python 7 1 Updated Jun 17, 2024

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

Python 58 54 Updated Jul 8, 2024
Jupyter Notebook 3 Updated Jun 24, 2024

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems

1 Updated Jun 10, 2024

Indonesian T0 | Instruction-tuning for low-resource and extremely low-resource Austronesian languages

Jupyter Notebook 9 1 Updated Jun 24, 2024

This repository is dedicated to development of code-mixed language resources.

24 1 Updated Jul 22, 2023

Nollywood Movie Reviews in 5 Nigerian Languages

Shell 4 Updated May 18, 2024

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 85 2 Updated Aug 18, 2023

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 203 19 Updated Jan 13, 2024
Python 1 Updated Aug 7, 2023

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

Jupyter Notebook 4,564 739 Updated Sep 22, 2023

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Jupyter Notebook 25 1 Updated Feb 26, 2024

Solve a Rubik's Cube with neural networks

Python 5 Updated Aug 4, 2021

Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.

Python 145 51 Updated Jul 25, 2024

Can LLMs generate code-mixed sentences through zero-shot prompting?

11 Updated Apr 18, 2023

Inference code for Llama models

Python 54,581 9,359 Updated Jul 25, 2024

Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)

Python 59 6 Updated Jun 12, 2023

GlobalBench: A Benchmark for Global Progress in Language Technology

Python 6 Updated Dec 7, 2023

The unified platform for data-related resources.

Python 130 28 Updated Mar 6, 2023
Next