Skip to content
View yspkm's full-sized avatar
  • Seoul National University
  • Republic of Korea
Block or Report

Block or report yspkm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Open source platform for the machine learning lifecycle

Python 18,168 4,104 Updated Aug 17, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,093 643 Updated Aug 14, 2024

A algebraic word problem dataset, with multiple choice questions annotated with rationales.

286 41 Updated Nov 2, 2017

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,488 2,210 Updated Jul 29, 2024

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,026 93 Updated Mar 10, 2024

A curated list for Efficient Large Language Models

Python 1,044 74 Updated Aug 13, 2024

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,495 1,488 Updated Aug 17, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

905 77 Updated Aug 8, 2024

πŸ‘©β€πŸ’»πŸ‘¨β€πŸ’» AI μ—”μ§€λ‹ˆμ–΄ 기술 λ©΄μ ‘ μŠ€ν„°λ”” (⭐️ 1k+)

1,776 438 Updated May 4, 2023

Inference code for LLaMA models in JAX

Python 106 6 Updated May 21, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,344 251 Updated Aug 13, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,517 779 Updated Aug 15, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,755 982 Updated Aug 18, 2024

Pipeline Parallelism for PyTorch

Python 686 84 Updated Aug 7, 2024

PyTorch extensions for high performance and large scale training.

Python 3,115 273 Updated Jun 18, 2024

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 500 40 Updated Jul 2, 2024

A modern model graph visualizer and debugger

JavaScript 960 68 Updated Aug 16, 2024

Distributed Evolutionary Algorithms in Python

Python 5,713 1,116 Updated Aug 10, 2024

Code Transformer neural network components piece by piece

Jupyter Notebook 270 141 Updated May 1, 2023

Google Research

Jupyter Notebook 33,638 7,811 Updated Aug 16, 2024

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Python 324 50 Updated Jul 14, 2024

Curation note of NLP datasets

91 6 Updated Dec 6, 2022

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,894 813 Updated Jul 1, 2024

Abseil Common Libraries (Python)

Python 2,261 246 Updated Aug 17, 2024

Datasets, Transforms and Models specific to Computer Vision

Python 15,879 6,895 Updated Aug 18, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,102 1,278 Updated Aug 17, 2024

Ongoing research training transformer models at scale

Python 9,698 2,184 Updated Aug 16, 2024

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,287 343 Updated Aug 8, 2024

JAX Synergistic Memory Inspector

Python 160 3 Updated Jul 16, 2024

Everything you want to know about Google Cloud TPU

Python 481 27 Updated Jul 16, 2024
Next