Skip to content
View xingling0's full-sized avatar

Block or report xingling0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Update the latest text-related papers from top conferences

19 3 Updated Aug 25, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,461 926 Updated Sep 22, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

880 34 Updated Oct 9, 2024

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 135 9 Updated Jun 13, 2024

The repo for In-context Autoencoder

Jupyter Notebook 83 6 Updated May 11, 2024

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 273 20 Updated Sep 9, 2024

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Python 11 1 Updated Jun 6, 2024

The common sites and tools.

Python 6 Updated Nov 22, 2022

The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024

Python 11 1 Updated Sep 12, 2024

The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024

Python 23 1 Updated Jul 27, 2024

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. IEEE Transactions on Image Processing (2018)

Python 208 76 Updated Apr 29, 2019

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Python 157 9 Updated Mar 17, 2023

The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"

Python 167 9 Updated Apr 10, 2024

Longformer: The Long-Document Transformer

Python 2,037 273 Updated Feb 8, 2023

[CVPR 2023] Egocentric Audio-Visual Object Localization

Python 23 Updated Jan 6, 2024

[TCSVT 2024] Official implementation of the paper: Benchmarking Micro-action Recognition: Dataset, Methods, and Applications

Jupyter Notebook 14 1 Updated Aug 21, 2024

A curated list of awesome temporal action segmentation resources.

151 13 Updated Apr 4, 2024

[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "

Python 99 10 Updated Aug 3, 2023
Python 12 Updated Mar 17, 2024

Code for our IJCAI 2024 paper "DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition"

3 Updated May 3, 2024

Unified Audio-Visual Perception for Multi-Task Video Localization

Python 17 Updated Apr 19, 2024

Official repository for the paper PLLaVA

Python 573 38 Updated Jul 28, 2024
Python 55 6 Updated Nov 17, 2023
Python 13 2 Updated Nov 13, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,696 2,042 Updated Jul 24, 2024

Official Pytorch implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024

Python 40 1 Updated Sep 11, 2024
Python 10 2 Updated Jun 21, 2024

A curated list of audio-visual learning methods and datasets.

223 17 Updated Sep 11, 2024

This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.

Python 102 7 Updated Aug 10, 2024
Next