Skip to content
View abzb1's full-sized avatar
Block or Report

Block or report abzb1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation to get super vision language performances. (Under Review)

Python 76 1 Updated Jun 23, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,101 61 Updated Jul 18, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

Python 713 83 Updated Jul 18, 2024

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 215 8 Updated Jul 17, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 8,832 890 Updated Jul 18, 2024

Open-source AI cookbook

Jupyter Notebook 1,507 203 Updated Jul 18, 2024

Tools for understanding how transformer predictions are built layer-by-layer

Python 391 39 Updated Jun 2, 2024

A framework for few-shot evaluation of language models.

Python 5,880 1,568 Updated Jul 18, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,820 549 Updated Jul 18, 2024

Bayesian low-rank adaptation for large language models

Python 18 7 Updated May 4, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,228 399 Updated Jul 18, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,456 25,693 Updated Jul 18, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,318 346 Updated Jul 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,155 3,288 Updated Jul 18, 2024

Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!

Python 63 10 Updated May 3, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,627 296 Updated Jul 18, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,692 1,023 Updated Jun 27, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,088 1,445 Updated Jul 18, 2024