Skip to content
View digbangbang's full-sized avatar
🌝
🌝
Block or Report

Block or report digbangbang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,140 49 Updated Aug 5, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

895 76 Updated Aug 8, 2024

CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization

Python 9 Updated Aug 3, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 41,645 7,583 Updated Aug 8, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,698 172 Updated Aug 2, 2024

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Python 1,019 89 Updated Dec 12, 2023

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 594 31 Updated Aug 5, 2024

Official repository for CoMM Dataset

Python 10 Updated Jul 31, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,055 731 Updated Aug 10, 2024

[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.

Python 36 3 Updated Jul 28, 2024

a collection of AWESOME things about Optimal Transport in Deep Learning

154 12 Updated May 11, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 24,532 3,531 Updated Aug 10, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,249 4,000 Updated Aug 10, 2024

The official Meta Llama 3 GitHub site

Python 25,370 2,809 Updated Aug 8, 2024

Inference code for CodeLlama models

Python 15,750 1,843 Updated Jul 23, 2024

Inference code for Llama models

Python 54,968 9,399 Updated Jul 25, 2024

Codebase for Inference-Time Policy Adapters

Python 18 1 Updated Nov 3, 2023

A simple and effective LLM pruning approach.

Python 589 69 Updated Aug 9, 2024

PyTorch implementations of Generative Adversarial Networks.

Python 16,117 4,042 Updated Jun 18, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 767 83 Updated Aug 8, 2024

Supercharge Your Model Training

Python 5,096 410 Updated Aug 9, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 511 37 Updated Mar 4, 2024

Code associated with Tuning Language Models by Proxy (Liu et al., 2024)

Python 78 10 Updated Mar 30, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,100 988 Updated Aug 5, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 84,279 6,463 Updated Aug 10, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,064 1,273 Updated Aug 6, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 130,388 25,907 Updated Aug 10, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,168 818 Updated Aug 10, 2024

[ICLR2022] Efficient Split-Mix federated learning for in-situ model customization during both training and testing time

Python 39 9 Updated Apr 12, 2023

[ICLR 2021] HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

Python 139 33 Updated Feb 27, 2023
Next