Skip to content
View gi2wzh's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report gi2wzh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,801 109 Updated Jul 29, 2024

PromptBERT: Improving BERT Sentence Embeddings with Prompts

Python 330 32 Updated Nov 22, 2023

When do we not need larger vision models?

Python 328 9 Updated Aug 19, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 11 3 Updated Dec 4, 2023

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,979 790 Updated Aug 7, 2024

The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024

Python 17 Updated Jul 30, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,274 150 Updated Aug 23, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 953 63 Updated Oct 6, 2024

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL 736 150 Updated Mar 15, 2023

✨✨Latest Advances on Multimodal Large Language Models

12,212 779 Updated Oct 16, 2024

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Python 194 30 Updated Mar 14, 2023

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,947 408 Updated May 29, 2024

Starter code for working with the YouTube-8M dataset.

Python 2,313 848 Updated Oct 25, 2021

A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free-text description

24 2 Updated Jan 15, 2022

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

Jupyter Notebook 112 12 Updated Oct 27, 2023
Python 107 19 Updated Jun 27, 2021

Mamba SSM architecture

Python 12,874 1,092 Updated Oct 13, 2024

Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.

Jupyter Notebook 344 36 Updated May 13, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

3,540 304 Updated Sep 20, 2024

[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation

Python 150 15 Updated May 10, 2023
Python 14 Updated Apr 29, 2024

Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]

Python 107 8 Updated Apr 18, 2024

The suite of modeling video with Mamba

Python 223 21 Updated May 14, 2024

Official inference library for Mistral models

Jupyter Notebook 9,630 848 Updated Sep 20, 2024

Train transformer language models with reinforcement learning.

Python 9,752 1,229 Updated Oct 16, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,120 1,587 Updated Oct 15, 2024

Inference code for Llama models

Python 56,029 9,523 Updated Aug 18, 2024

The official Meta Llama 3 GitHub site

Python 26,676 3,018 Updated Aug 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,727 5,799 Updated Aug 19, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,457 1,432 Updated Sep 19, 2024
Next