Skip to content
View lioy123's full-sized avatar

Block or report lioy123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

This is the official implementation of the ICMR24 paper "UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos"

Jupyter Notebook 1 Updated Sep 2, 2024

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

Python 40 4 Updated Aug 31, 2024

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,518 192 Updated Aug 12, 2020

DSNet: A Flexible Detect-to-Summarize Network for Video Summarization

Python 207 50 Updated Sep 16, 2021

Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''

Python 34 4 Updated Jul 30, 2021

Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.

Python 31 2 Updated May 19, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,332 6,388 Updated Oct 3, 2024

Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021

Python 21 10 Updated Apr 5, 2022

Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"

Python 15 1 Updated Aug 2, 2022

A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"

Python 115 33 Updated Jun 20, 2023

The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)

Python 70 10 Updated Apr 24, 2023

A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.

Python 20 4 Updated Jan 13, 2022

Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)

Python 41 18 Updated Mar 21, 2024