bpiyush

Piyush Bagad bpiyush

1st year DPhil, VGG, Oxford. Past: MSc in AI from UvA | Research @ Wadhwani AI | B.S. in Mathematics @ IIT Kanpur

22 followers · 5 following

University of Oxford
Oxford
20:07 (UTC -12:00)
bpiyush.github.io
@bagad_piyush

Achievements

Highlights

TimeChat Public
Forked from RenShuhuai-Andy/TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Jun 17, 2024
Video-LLaMA Public
Forked from DAMO-NLP-SG/Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python BSD 3-Clause "New" or "Revised" License Updated Jun 15, 2024
ViLMA Public
Forked from ilkerkesen/ViLMA

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)

Python MIT License Updated Jun 12, 2024
TestOfTime Public

Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time

Python 45 3 MIT License Updated Jun 11, 2024
unmasked_teacher Public
Forked from OpenGVLab/unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Python MIT License Updated Jun 10, 2024
LAVIS Public
Forked from salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Jun 10, 2024
InternVideo Public
Forked from OpenGVLab/InternVideo

Video Foundation Models & Data for Multimodal Understanding

Python Apache License 2.0 Updated Jun 9, 2024
VideoLLaMA2 Public
Forked from DAMO-NLP-SG/VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python Apache License 2.0 Updated Jun 7, 2024
LanguageBind Public
Forked from PKU-YuanGroup/LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python MIT License Updated Jun 6, 2024
VTimeLLM Public
Forked from huangb23/VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python Other Updated Jun 5, 2024
TempCompass Public
Forked from llyx97/TempCompass

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 1 Updated May 25, 2024
ddsp-pytorch Public
Forked from sweetcocoa/ddsp-pytorch

Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)

Jupyter Notebook MIT License Updated May 22, 2024
FCN-f0 Public
Forked from ardaillon/FCN-f0

Fully-Convolutional Network for Pitch Estimation of Speech Signals

Python 1 MIT License Updated Feb 19, 2024
sam-pt Public
Forked from SysCV/sam-pt

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Python 1 Apache License 2.0 Updated Jan 24, 2024
FastSAM Public
Forked from CASIA-IVA-Lab/FastSAM

Fast Segment Anything

Python 1 GNU Affero General Public License v3.0 Updated Jan 22, 2024
new-machine-setup-scripts Public

Bunch of scripts useful to add when starting on a new machine

Shell 1 Updated Jan 18, 2024
digan Public
Forked from sihyun-yu/digan

Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).

Python Updated Jan 14, 2024
PhysParamInference Public
Forked from florianHofherr/PhysParamInference

Clone of the WACV2023 paper. Adaptation on pouring water.

Python MIT License Updated Jan 9, 2024
sound-guided-semantic-image-manipulation Public
Forked from kuai-lab/sound-guided-semantic-image-manipulation

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Python 1 Other Updated Jan 3, 2024
Sound2Scene Public
Forked from postech-ami/Sound2Scene

Clone of the Sound2Scene repo. Need to train on pouring water images.

Python 1 Updated Jan 2, 2024
dino-local Public
Forked from facebookresearch/dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Jupyter Notebook 1 Apache License 2.0 Updated Dec 31, 2023
VITATECS Public
Forked from lscpku/VITATECS

Python 1 Updated Nov 30, 2023
bpiyush Public

My personal introductory repository

Updated Nov 17, 2023
transparent-liquid-segmentation Public
Forked from gauthamnarayan/transparent-liquid-segmentation

We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.

Jupyter Notebook 1 MIT License Updated Oct 31, 2023
bpiyush.github.io Public

A portfolio page

JavaScript MIT License Updated Oct 2, 2023
VideoMAE-ssl Public
Forked from MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python Other Updated Jul 25, 2023
audio_codec_tests Public

Tests for codec artefacts in stored audio samples.

Python MIT License Updated Mar 27, 2023
NLP-CS671A Public

Course files for CS671A - Natural Language Processing

Python 1 Updated Feb 15, 2023
YouTube-scrapper-tutorial Public

Tutorial to scrape YouTube video for research purposes.

Jupyter Notebook MIT License Updated Dec 11, 2022
rotation-equivariant-lfm Public

Rotation equivariance meets local feature matching

Jupyter Notebook 18 MIT License Updated Oct 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Piyush Bagad bpiyush

Achievements

Achievements

Highlights

Block or report bpiyush

TimeChat Public

Video-LLaMA Public

ViLMA Public

TestOfTime Public

unmasked_teacher Public

LAVIS Public

InternVideo Public

VideoLLaMA2 Public

LanguageBind Public

VTimeLLM Public

TempCompass Public

ddsp-pytorch Public

FCN-f0 Public

sam-pt Public

FastSAM Public

new-machine-setup-scripts Public

digan Public

PhysParamInference Public

sound-guided-semantic-image-manipulation Public

Sound2Scene Public

dino-local Public

VITATECS Public

bpiyush Public

transparent-liquid-segmentation Public

bpiyush.github.io Public

VideoMAE-ssl Public

audio_codec_tests Public

NLP-CS671A Public

YouTube-scrapper-tutorial Public

rotation-equivariant-lfm Public