Skip to content
View Mortyzhou-Shef-BIT's full-sized avatar
🪐
Working from home
🪐
Working from home
Block or Report

Block or report Mortyzhou-Shef-BIT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

417 9 Updated Jul 20, 2024
Python 11 3 Updated Jun 16, 2024

Official repository for "Unveiling and Mitigating Bias in Audio Visual Segmentation" in ACM MM 2024

1 Updated Jul 21, 2024
Python 1 Updated Feb 24, 2024
Python 1 Updated Mar 20, 2024
Python 11 Updated Jul 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,278 3,310 Updated Jul 21, 2024

[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation

Python 48 2 Updated Jan 20, 2024

[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line

Python 21 4 Updated Mar 6, 2023

Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"

Python 27 3 Updated Jul 11, 2024

The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024

Python 11 Updated Jul 18, 2024

The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024

Python 6 1 Updated Jul 17, 2024

An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)

Python 17 2 Updated Sep 28, 2023

Spatial Sparse Convolution Library

Python 1,798 360 Updated Jul 8, 2024

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Python 223 20 Updated Jun 30, 2024

Explore the Limits of Omni-modal Pretraining at Scale

Python 72 3 Updated Jun 28, 2024

This is a reposotory that includes paper、code and datasets about domain generalization-based fault diagnosis and prognosis. (基于领域泛化的故障诊断和预测,持续更新)

139 20 Updated Jul 2, 2024

收录了若干读博中所遇到的问题和相关资料

29 1 Updated May 20, 2024

METER for Online Anomaly Detection

Python 10 3 Updated Jul 12, 2024

Faster Whisper transcription with CTranslate2

Python 10,401 873 Updated Jul 21, 2024

一个用于CosyVoice的api接口项目

Python 18 4 Updated Jul 18, 2024
4 Updated Jun 23, 2024
Python 14 4 Updated Oct 2, 2023

Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)

Python 19 1 Updated Sep 1, 2023

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

Python 63 16 Updated Apr 28, 2024

Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024

Python 32 3 Updated Mar 17, 2024

Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"

Python 12 1 Updated Mar 27, 2024

Collection of awesome parameter-efficient fine-tuning resources.

409 10 Updated Jul 11, 2024

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Python 26 Updated Jun 25, 2024

A comprehensive collection of awesome research and other items about video domain adaptation

92 6 Updated Mar 22, 2024
Next