Skip to content
View xvyaward's full-sized avatar
  • POSTECH (Pohang University of Science and Technology
  • Pohang, Korea
  • 12:02 (UTC +09:00)
  • X @changhunlee_

Highlights

  • Pro

Block or report xvyaward

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,905 4,117 Updated Oct 7, 2024

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

Python 51 5 Updated Mar 7, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,802 254 Updated May 3, 2024

๐Ÿ“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,598 176 Updated Oct 6, 2024

Awesome LLM compression research papers and tools.

1,096 66 Updated Oct 4, 2024

๐Ÿ’ฏ Curated coding interview preparation materials for busy software engineers

TypeScript 117,729 14,631 Updated Oct 6, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficieโ€ฆ

C++ 8,341 936 Updated Oct 1, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,170 296 Updated Aug 30, 2024

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.

Python 434 25 Updated Jun 18, 2024

Refine high-quality datasets and visual AI models

Python 8,721 551 Updated Oct 7, 2024

Transformer related optimization, including BERT, GPT

C++ 5 5 Updated Jun 2, 2023

A Gradio web UI for Large Language Models.

Python 39,895 5,228 Updated Oct 5, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,379 469 Updated Sep 28, 2024

<โšก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

Python 15,343 1,849 Updated Aug 14, 2024

Source code for Twitter's Recommendation Algorithm

Scala 62,131 12,152 Updated Jul 10, 2024

๐Ÿ“š ์‹ ์ž… ๊ฐœ๋ฐœ์ž๋กœ์„œ ์„ฑ์žฅ์„ ์œ„ํ•œ ์ „๊ณต ์ง€์‹์„ ์ •๋ฆฌํ•ฉ๋‹ˆ๋‹ค ๐Ÿ˜Š

1,220 123 Updated Dec 12, 2022

์ฝ”๋”ฉ ํ…Œ์ŠคํŠธ ๊ด€๋ จ ๊ธฐ์ถœ๋ฌธํ•ญ์„ ํ’€์–ด๋ณด๊ณ  ์†Œ์Šค์ฝ”๋“œ ๋ฐ ์„ค๋ช…์„ ์—…๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค.

C++ 1,133 166 Updated Jul 14, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (Vโ€ฆ

Python 31,703 4,710 Updated Oct 2, 2024

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) ๆˆ‘ไธ้—ดๆ–ญๆ›ดๆ–ฐ็š„ๆœบๅ™จๅญฆไน ๏ผŒๆฆ‚็Ž‡ๆจกๅž‹ๅ’Œๆทฑๅบฆๅญฆไน ็š„่ฎฒไน‰(2000+้กต)ๅ’Œ่ง†้ข‘้“พๆŽฅ

Jupyter Notebook 8,393 1,715 Updated Sep 29, 2024

Awesome Knowledge Distillation

3,430 493 Updated Aug 26, 2024

Fast and accurate object detection with end-to-end GPU optimization

Python 885 271 Updated Sep 29, 2021

Color palettes which are also distinguishable when printed in grayscale

Jupyter Notebook 3 Updated May 25, 2020

<๋จธ์‹ ๋Ÿฌ๋‹ ๊ต๊ณผ์„œ with ํŒŒ์ด์ฌ, ์‚ฌ์ดํ‚ท๋Ÿฐ, ํ…์„œํ”Œ๋กœ>์˜ ์ฝ”๋“œ ์ €์žฅ์†Œ

Jupyter Notebook 66 74 Updated Apr 5, 2022

๋”ฅ๋Ÿฌ๋‹์— ๋ชฉ๋งˆ๋ฅธ ์‚ฌ๋žŒ๋“ค์„ ์œ„ํ•œ PyTorch

8 4 Updated Dec 3, 2019

Collection of recent methods on (deep) neural network compression and acceleration.

923 133 Updated Sep 6, 2024

Pipeline FFT Implementation in Verilog HDL

Verilog 73 18 Updated Apr 14, 2019

C++ based maze solver, accelerated with Nvidia CUDA

Cuda 1 Updated Jan 12, 2020

Command-line program to download videos from YouTube.com and other video sites

Python 131,702 9,970 Updated Aug 17, 2024
Next