Skip to content
View hyunwoongko's full-sized avatar
🎯
Large Language Model
🎯
Large Language Model

Organizations

@EleutherAI @jiphyeonjeon @Hugging-Face-Supporter

Block or report hyunwoongko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recipes to train reward model for RLHF.

Python 633 54 Updated Sep 12, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,292 223 Updated Nov 26, 2023
Python 2 Updated Jun 22, 2024

Official implementation of project Honeybee (CVPR 2024)

Python 415 18 Updated May 10, 2024

A very simple performing matrix multiplication example for CPU / CUDA / METAL using GGML / llama.cpp

9 1 Updated Jul 7, 2024

LLM101n: Let's build a Storyteller

28,243 1,539 Updated Aug 1, 2024

Efficiently computes derivatives of NumPy code.

Python 6,936 905 Updated Sep 12, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 931 52 Updated Jan 30, 2024

Implemented the forward mode of automatic differentiation with the help of dual numbers using Python.

Jupyter Notebook 19 7 Updated Feb 8, 2023

Pure Python autograd library based on NumPy

Python 4 3 Updated Jan 31, 2021
Jupyter Notebook 20 14 Updated Oct 5, 2018

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,017 198 Updated Sep 7, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 557 49 Updated Apr 7, 2024

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

Python 77 17 Updated Dec 14, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 250 9 Updated Sep 13, 2024

A native PyTorch Library for large model training

Python 1,541 141 Updated Sep 13, 2024

The official Meta Llama 3 GitHub site

Python 26,098 2,925 Updated Aug 12, 2024

Regex for Korean curseword filtering

10 Updated Oct 17, 2020

symspellpy를 한글 특성에 맞춰서 수정한 라이브러리. 음소분해를 이용해 더 정확한 오타교정을 해준다.

Python 38 4 Updated Nov 18, 2021

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

287 11 Updated Apr 18, 2024
Python 1 1 Updated Sep 7, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 456 55 Updated Sep 11, 2024

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

Jupyter Notebook 259 27 Updated Jun 3, 2024
34 2 Updated Feb 27, 2024

Collection of leaked system prompts

1,000 123 Updated Sep 11, 2024

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python 614 52 Updated Sep 9, 2024

Minimalistic large language model 3D-parallelism training

Python 1,107 105 Updated Sep 13, 2024
Next