shyram

Sangil Park shyram

Research and Development Engineer at Samsung Research

19 followers · 60 following

Samsung Research HQ
in/shyram
https://bento.me/shyram

Achievements

Organizations

Block or Report

Block or report shyram

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

🔤 NLP

Natural Language Processing

89 repositories

lovit / textmining-tutorial

(한국어) 텍스트 마이닝을 위한 공부거리들

Jupyter Notebook 204 61 Updated Apr 7, 2020

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 64,900 7,581 Updated Jul 22, 2024

teddysum / korean_ABSA_baseline

Jupyter Notebook 43 11 Updated Jul 5, 2024

styfeng / DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

822 78 Updated Aug 12, 2022

makcedward / nlpaug

Data augmentation for NLP

Jupyter Notebook 4,358 455 Updated Jun 24, 2024

teslacool / SCA

Soft Contextual Data Augmentation

Python 39 9 Updated Jun 21, 2022

graykode / nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook 13,935 3,892 Updated Feb 21, 2024

NoUnique / pymecab-ko

🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3

C++ 13 1 Updated Feb 14, 2023

Gubuzeong / Getting-Started-with-Google-BERT

Jupyter Notebook 13 2 Updated Mar 28, 2022

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 37,527 9,541 Updated Jul 20, 2024

fromSun2Moon / KoreanF2I

한국어 높임말 교정

Python 25 3 Updated Dec 31, 2022

kocohub / korean-hate-speech

Korean HateSpeech Dataset

371 38 Updated Jul 18, 2020

PotatoSpudowski / fastLLaMa

fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.

C 407 26 Updated Jun 2, 2023

EleutherAI / the-pile

Python 1,443 123 Updated Apr 27, 2023

sooftware / Korean-PLM

List of Korean pre-trained language models.

183 16 Updated Aug 31, 2023

soskek / bookcorpus

Crawl BookCorpus

Python 796 111 Updated Jul 14, 2023

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,397 1,946 Updated Apr 24, 2024

google-research / deduplicate-text-datasets

Rust 1,054 105 Updated Jun 6, 2024

databrickslabs / dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,806 1,158 Updated Jun 30, 2023

facebookresearch / stopes

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 242 37 Updated Dec 15, 2023

stanford-cs324 / winter2022

Website

Python 46 11 Updated Jan 24, 2023

openlm-research / open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,287 371 Updated Jul 16, 2023

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,220 758 Updated Jul 10, 2024

princeton-nlp / MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,002 57 Updated Jan 11, 2024

yaodongC / awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,044 58 Updated Jan 4, 2024

lcw99 / evolve-instruct

evolve llm training instruction, from english instruction to any language.

Python 105 12 Updated Sep 15, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,297 3,311 Updated Jul 22, 2024

nlpyang / geval

Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"

Python 208 23 Updated Feb 4, 2024

MicrosoftTranslator / GEMBA

GEMBA — GPT Estimation Metric Based Assessment

Python 84 13 Updated Feb 18, 2024

wxjiao / Is-ChatGPT-A-Good-Translator

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 235 16 Updated Nov 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sangil Park shyram

Achievements

Achievements

Organizations

Block or report shyram

🔤 NLP

lovit / textmining-tutorial

openai / whisper

teddysum / korean_ABSA_baseline

styfeng / DataAug4NLP

makcedward / nlpaug

teslacool / SCA

graykode / nlp-tutorial

NoUnique / pymecab-ko

Gubuzeong / Getting-Started-with-Google-BERT

google-research / bert

fromSun2Moon / KoreanF2I

kocohub / korean-hate-speech

PotatoSpudowski / fastLLaMa

EleutherAI / the-pile

sooftware / Korean-PLM

soskek / bookcorpus

microsoft / JARVIS

google-research / deduplicate-text-datasets

databrickslabs / dolly

facebookresearch / stopes

stanford-cs324 / winter2022

openlm-research / open_llama

openai / tiktoken

princeton-nlp / MeZO

yaodongC / awesome-instruction-dataset

lcw99 / evolve-instruct

vllm-project / vllm

nlpyang / geval

MicrosoftTranslator / GEMBA

wxjiao / Is-ChatGPT-A-Good-Translator