Skip to content
View phellonchen's full-sized avatar
Block or Report

Block or report phellonchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Python 41 2 Updated May 13, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,639 1,855 Updated Jun 27, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,097 736 Updated Jul 10, 2024

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Python 296 16 Updated Aug 10, 2023

Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"

Python 37 5 Updated Dec 23, 2022

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Python 65 6 Updated Jul 12, 2023

[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

23 3 Updated May 18, 2023

Recent Advances in Visual Dialog

29 1 Updated Aug 19, 2022

A benchmark for the task of translation suggestion

Mask 59 25 Updated Jun 23, 2022

Recent Advances in Vision and Language Pre-training (VLP)

283 15 Updated Jun 6, 2023

End-to-end Speech Translation

Python 35 6 Updated Apr 12, 2021

Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)

Python 30 5 Updated Aug 5, 2021

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 9,907 1,522 Updated Jul 12, 2024

DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog

Python 24 5 Updated Mar 8, 2022

Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"

Python 65 14 Updated Mar 24, 2023

A implementation of SeqGAN in PyTorch, following the implementation in tensorflow.

Python 258 93 Updated Feb 20, 2019