xingling0

Follow

xingling0

Follow

1 follower · 1 following

Stars

TongkunGuan / Text-Related-Papers

Update the latest text-related papers from top conferences

19 3 Updated Aug 25, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,461 926 Updated Sep 22, 2024

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

880 34 Updated Oct 9, 2024

princeton-nlp / CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 135 9 Updated Jun 13, 2024

getao / icae

The repo for In-context Autoencoder

Jupyter Notebook 83 6 Updated May 11, 2024

princeton-nlp / AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 273 20 Updated Sep 9, 2024

showlab / VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Python 11 1 Updated Jun 6, 2024

cs-hao / Learning-tools

The common sites and tools.

Python 6 Updated Nov 22, 2022

GeWu-Lab / Stepping-Stones

The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024

Python 11 1 Updated Sep 12, 2024

GeWu-Lab / Ref-AVS

The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024

Python 23 1 Updated Jul 27, 2024

marcellacornia / sam

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. IEEE Transactions on Image Processing (2018)

Python 208 76 Updated Apr 29, 2019

ViTAE-Transformer / ViTAE-VSA

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Python 157 9 Updated Mar 17, 2023

ViTAE-Transformer / QFormer

The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"

Python 167 9 Updated Apr 10, 2024

allenai / longformer

Longformer: The Long-Document Transformer

Python 2,037 273 Updated Feb 8, 2023

WikiChao / Ego-AV-Loc

[CVPR 2023] Egocentric Audio-Visual Object Localization

Python 23 Updated Jan 6, 2024

VUT-HFUT / Micro-Action

[TCSVT 2024] Official implementation of the paper: Benchmarking Micro-action Recognition: Dataset, Methods, and Applications

Jupyter Notebook 14 1 Updated Aug 21, 2024

nus-cvml / awesome-temporal-action-segmentation

A curated list of awesome temporal action segmentation resources.

151 13 Updated Apr 4, 2024

sauradip / STALE

[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "

Python 99 10 Updated Aug 3, 2023

Z2HENG / DeTAL

Python 12 Updated Mar 17, 2024

quhongyu / DTS-TPT

Code for our IJCAI 2024 paper "DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition"

3 Updated May 3, 2024

ttgeng233 / UniAV

Unified Audio-Visual Perception for Multi-Task Video Localization

Python 17 Updated Apr 19, 2024

magic-research / PLLaVA

Official repository for the paper PLLaVA

Python 573 38 Updated Jul 28, 2024

FreeformRobotics / EAEFNet

Python 55 6 Updated Nov 17, 2023

Jamie725 / Multimodal-Object-Detection-via-Probabilistic-Ensembling

Python 141 20 Updated Mar 3, 2024

fyyCS / LSLD

Python 13 2 Updated Nov 13, 2023

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,696 2,042 Updated Jul 24, 2024

benedettaliberatori / T3AL

Official Pytorch implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024

Python 40 1 Updated Sep 11, 2024

enyac-group / T-VSL

Python 10 2 Updated Jun 21, 2024

GeWu-Lab / awesome-audiovisual-learning

A curated list of audio-visual learning methods and datasets.

223 17 Updated Sep 11, 2024

yinjunbo / IS-Fusion

This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.

Python 102 7 Updated Aug 10, 2024