Skip to content
View Niko-zyf's full-sized avatar
  • Nanjing University

Block or report Niko-zyf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for studying the super weight in LLM

Jupyter Notebook 10 1 Updated Nov 11, 2024

For releasing code related to compression methods for transformers, accompanying our publications

Python 371 37 Updated Oct 11, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,356 443 Updated Nov 13, 2024

Applied AI experiments and examples for PyTorch

Python 164 14 Updated Oct 31, 2024

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Python 166 9 Updated Aug 9, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,883 206 Updated Nov 1, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 730 56 Updated Oct 8, 2024

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 284 21 Updated Jul 22, 2024

想打造一个全能的翻译,但并不全能,甚至全不能。。

Python 9 Updated Apr 16, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,653 427 Updated Nov 16, 2024

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,834 171 Updated May 25, 2024

🚁 保险行业语料库,聊天机器人

Python 1,020 345 Updated Jul 12, 2024

huggingface mirror download

Python 551 56 Updated Nov 16, 2024

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

Python 11,418 1,250 Updated Nov 7, 2024

An Open-source Toolkit for LLM Development

Python 2,719 176 Updated May 24, 2024

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 20,085 2,242 Updated Nov 5, 2024

A framework for few-shot evaluation of language models.

Python 6,982 1,867 Updated Nov 16, 2024

Awesome LLM compression research papers and tools.

1,192 77 Updated Nov 12, 2024

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 686 118 Updated Apr 16, 2024

Reorder-based post-training quantization for large language model

Python 182 11 Updated May 17, 2023

Torchreid: Deep learning person re-identification in PyTorch.

Python 4,320 1,146 Updated Jul 22, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,255 146 Updated Jul 12, 2024

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 429 70 Updated Nov 20, 2023

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,453 1,068 Updated Nov 1, 2024

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

JavaScript 77,050 6,268 Updated Nov 16, 2024

[ECCV 2022] R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis

Python 191 23 Updated Aug 15, 2023

ALSO: Automotive Lidar Self-supervision by Occupancy estimation

Python 169 19 Updated Jul 24, 2023

Plenoxels: Radiance Fields without Neural Networks

Python 2,829 360 Updated Jun 29, 2023

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

Python 67 13 Updated Nov 4, 2021

Flops counter for convolutional networks in pytorch framework

Python 2,820 307 Updated Sep 27, 2024
Next