hyunwoongko

🎯

Large Language Model

Kevin Ko hyunwoongko

🎯

Large Language Model

ML Researcher at @kakao

782 followers · 70 following

@kakao
Seoul, Korea
03:16 (UTC +09:00)
@hyunwoongko

Achievements

x3 x3

Achievements

x3 x3

Organizations

Stars

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 633 54 Updated Sep 12, 2024

patrickrchao / JailbreakingLLMs

Python 349 54 Updated Aug 20, 2024

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,292 223 Updated Nov 26, 2023

hyunwoongko / mygrad

Python 2 Updated Jun 22, 2024

khanrc / honeybee

Official implementation of project Honeybee (CVPR 2024)

Python 415 18 Updated May 10, 2024

likejazz / ggml-simple

A very simple performing matrix multiplication example for CPU / CUDA / METAL using GGML / llama.cpp

9 1 Updated Jul 7, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,243 1,539 Updated Aug 1, 2024

HIPS / autograd

Efficiently computes derivatives of NumPy code.

Python 6,936 905 Updated Sep 12, 2024

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 931 52 Updated Jan 30, 2024

ujjwalkhandelwal / Dual-numbers-and-automatic-differentiation-using-Python

Implemented the forward mode of automatic differentiation with the help of dual numbers using Python.

Jupyter Notebook 19 7 Updated Feb 8, 2023

jaketae / pygrad

Pure Python autograd library based on NumPy

Python 4 3 Updated Jan 31, 2021

AdrienLE / intuitive_policy_gradient

Jupyter Notebook 20 14 Updated Oct 5, 2018

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,017 198 Updated Sep 7, 2024

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 557 49 Updated Apr 7, 2024

xrsrke / pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

Python 77 17 Updated Dec 14, 2023

neuralmagic / nm-vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 250 9 Updated Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kevin Ko hyunwoongko

Achievements

Achievements

Organizations

Block or report hyunwoongko

Stars

RLHFlow / RLHF-Reward-Modeling

patrickrchao / JailbreakingLLMs

noahshinn / reflexion

hyunwoongko / mygrad

khanrc / honeybee

likejazz / ggml-simple

karpathy / LLM101n

HIPS / autograd

Liuhong99 / Sophia

ujjwalkhandelwal / Dual-numbers-and-automatic-differentiation-using-Python

jaketae / pygrad

AdrienLE / intuitive_policy_gradient

OpenRLHF / OpenRLHF

tspeterkim / flash-attention-minimal

xrsrke / pipegoose

neuralmagic / nm-vllm

pytorch / torchtitan

meta-llama / llama3

curioustorvald / KoreanCursewordRegex

HeegyuKim / symspellpy-ko

microsoft / rho

locuslab / scaling_laws_data_filtering

gyb357 / UNet-Segmentation

NVIDIA / NeMo-Curator

openai / transformer-debugger

tabtoyou / KoLLaVA

KU-HIAI / Ko-Gemma

jujumilk3 / leaked-system-prompts

lucidrains / MEGABYTE-pytorch

huggingface / nanotron