-
University of Central Florida
- Orlando, FL
- https://akash2907.github.io/
Highlights
- Pro
Block or Report
Block or report AKASH2907
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]
[ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction
Multi-modal Prompting for Open-vocabulary Video Visual Relationship Detection(AAAI2024)
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
A curated list of awesome self-supervised learning methods in videos
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
This repository contains the source code for the paper First Order Motion Model for Image Animation
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
State-of-the-art 2D and 3D Face Analysis Project
S3D Text-Video model trained on HowTo100M using MIL-NCE
Simple code for generating a color-coded latex table from raw data
Code for the paper "Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection"
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
Hiera: A fast, powerful, and simple hierarchical vision transformer.