Skip to content
View Jambo-12's full-sized avatar

Block or report Jambo-12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Explainable Video Action Reasoning via Prior Knowledge and State Transitions

Jupyter Notebook 21 1 Updated Jun 20, 2024

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

Jupyter Notebook 187 34 Updated Aug 22, 2022

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 208 11 Updated Jun 13, 2024

Code for the ECCV'22 paper "Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos".

Python 25 4 Updated Feb 5, 2024

A video database bridging human actions and human-object relationships

Python 127 17 Updated Jun 30, 2020

[NeurIPS2023] Neural-Logic Human-Object Interaction Detection

Python 9 2 Updated Aug 24, 2024

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 545 57 Updated Jun 7, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,784 272 Updated Aug 1, 2024

Tips for Writing a Research Paper using LaTeX

TeX 2,453 317 Updated May 4, 2023

A simple code for plotting figure, colorbar, and cropping with python

Python 351 44 Updated Apr 13, 2022

Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"

Python 18 2 Updated Apr 16, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,220 4,777 Updated Sep 19, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,418 392 Updated Sep 8, 2024

https://layer6ai-labs.github.io/xpool/

Python 111 9 Updated Jul 1, 2023

Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]

Python 50 28 Updated Jun 2, 2023

Knowledgebase embedding with your company data

Python 115 67 Updated Feb 15, 2024

Home Action Genome: Cooperative Contrastive Action Understanding

Python 19 3 Updated Nov 8, 2021

[IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

Python 17 2 Updated Dec 19, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,743 954 Updated Aug 23, 2024

Official repository of paper "Subobject-level Image Tokenization"

Python 60 5 Updated Apr 25, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,475 926 Updated Sep 22, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,369 71 Updated Oct 9, 2024

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 292 19 Updated Jul 19, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,900 372 Updated Aug 7, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,343 85 Updated Sep 23, 2024
Python 18 Updated Jan 29, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,473 153 Updated Aug 30, 2024

Top free VPN (ClashX & V2Ray proxy) with subscription links. [免费VPN、免费梯子、免费科学上网、免费订阅链接、免费节点、精选、ClashX & V2Ray 教程]

Python 3,702 319 Updated Jul 25, 2024
Next