Skip to content
View Zhengsh123's full-sized avatar

Block or report Zhengsh123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

208 7 Updated Nov 12, 2024

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 13,458 3,810 Updated Nov 1, 2024

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

670 52 Updated Nov 4, 2024
Python 46 Updated May 16, 2023

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 2,644 322 Updated Nov 10, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,089 436 Updated Nov 12, 2024

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 263 8 Updated Nov 7, 2024

A PyTorch Library for Multi-Task Learning

Python 2,036 188 Updated Oct 18, 2024

LLM-Merging: Building LLMs Efficiently through Merging

Jupyter Notebook 174 39 Updated Sep 24, 2024

[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"

Python 58 9 Updated Nov 26, 2023

Tools for merging pretrained large language models.

Python 4,798 437 Updated Nov 5, 2024

assistant tools for attention visualization in deep learning

Jupyter Notebook 1,002 80 Updated Jun 9, 2022

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

445 13 Updated Oct 10, 2024

Transformers 库快速入门教程

Python 1,116 140 Updated Sep 20, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,516 154 Updated Oct 10, 2024

阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。

Go 4,201 355 Updated Nov 10, 2024

The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''

Python 178 6 Updated Mar 25, 2024

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 93 2 Updated Jan 15, 2024

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,078 59 Updated Jan 4, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

3,712 321 Updated Sep 20, 2024

A collection of AWESOME things about mixture-of-experts

967 74 Updated Jul 31, 2024

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

611 34 Updated Oct 29, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,592 804 Updated Nov 10, 2024

GOOD: A Graph Out-of-Distribution Benchmark [NeurIPS 2022 Datasets and Benchmarks]

Python 186 19 Updated Nov 9, 2024

哈工大计算机学院课程整理

Python 101 22 Updated Apr 9, 2023

哈工大(本部)计算机专业研究生课程攻略 | HIT CS Postgraduate Guide

HTML 230 21 Updated Dec 11, 2023

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 833 42 Updated Nov 12, 2024

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,497 139 Updated May 9, 2023

🐧 Linux教程,主要内容:Linux 命令、Linux 系统运维、软件运维、精选常用Shell脚本

Shell 5,323 908 Updated Mar 12, 2024

In this repository, you can find my solutions to some exercises of the book "Understanding Machine Learning From Theory to Algorithms" by Shai Shalev-Shwartz and Shai Ben-David.

13 2 Updated Feb 23, 2022
Next