Skip to content
View bpiyush's full-sized avatar

Highlights

  • Pro

Block or report bpiyush

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • [CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

    Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Jun 17, 2024
  • [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

    Python BSD 3-Clause "New" or "Revised" License Updated Jun 15, 2024
  • ViLMA Public

    Forked from ilkerkesen/ViLMA

    ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)

    Python MIT License Updated Jun 12, 2024
  • TestOfTime Public

    Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time

    Python 45 3 MIT License Updated Jun 11, 2024
  • [ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

    Python MIT License Updated Jun 10, 2024
  • LAVIS Public

    Forked from salesforce/LAVIS

    LAVIS - A One-stop Library for Language-Vision Intelligence

    Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Jun 10, 2024
  • Video Foundation Models & Data for Multimodal Understanding

    Python Apache License 2.0 Updated Jun 9, 2024
  • VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Python Apache License 2.0 Updated Jun 7, 2024
  • 【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

    Python MIT License Updated Jun 6, 2024
  • VTimeLLM Public

    Forked from huangb23/VTimeLLM

    [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

    Python Other Updated Jun 5, 2024
  • TempCompass Public

    Forked from llyx97/TempCompass

    [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

    Python 1 Updated May 25, 2024
  • Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)

    Jupyter Notebook MIT License Updated May 22, 2024
  • FCN-f0 Public

    Forked from ardaillon/FCN-f0

    Fully-Convolutional Network for Pitch Estimation of Speech Signals

    Python 1 MIT License Updated Feb 19, 2024
  • sam-pt Public

    Forked from SysCV/sam-pt

    SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

    Python 1 Apache License 2.0 Updated Jan 24, 2024
  • FastSAM Public

    Forked from CASIA-IVA-Lab/FastSAM

    Fast Segment Anything

    Python 1 GNU Affero General Public License v3.0 Updated Jan 22, 2024
  • Bunch of scripts useful to add when starting on a new machine

    Shell 1 Updated Jan 18, 2024
  • digan Public

    Forked from sihyun-yu/digan

    Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).

    Python Updated Jan 14, 2024
  • Clone of the WACV2023 paper. Adaptation on pouring water.

    Python MIT License Updated Jan 9, 2024
  • Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

    Python 1 Other Updated Jan 3, 2024
  • Clone of the Sound2Scene repo. Need to train on pouring water images.

    Python 1 Updated Jan 2, 2024
  • PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

    Jupyter Notebook 1 Apache License 2.0 Updated Dec 31, 2023
  • VITATECS Public

    Forked from lscpku/VITATECS
    Python 1 Updated Nov 30, 2023
  • bpiyush Public

    My personal introductory repository

    Updated Nov 17, 2023
  • We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.

    Jupyter Notebook 1 MIT License Updated Oct 31, 2023
  • A portfolio page

    JavaScript MIT License Updated Oct 2, 2023
  • VideoMAE-ssl Public

    Forked from MCG-NJU/VideoMAE

    [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

    Python Other Updated Jul 25, 2023
  • Tests for codec artefacts in stored audio samples.

    Python MIT License Updated Mar 27, 2023
  • NLP-CS671A Public

    Course files for CS671A - Natural Language Processing

    Python 1 Updated Feb 15, 2023
  • Tutorial to scrape YouTube video for research purposes.

    Jupyter Notebook MIT License Updated Dec 11, 2022
  • Rotation equivariance meets local feature matching

    Jupyter Notebook 18 MIT License Updated Oct 20, 2022