SY-Xuan

Shiyu Xuan SY-Xuan

Try to make our world a better place!

41 followers · 6 following

Achievements

Highlights

Stars

GXNU-ZhongLab / ODTrack

The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].

Python 102 9 Updated Oct 7, 2024

facebookresearch / grounded-video-description

Video Grounding and Captioning

Python 323 72 Updated Oct 12, 2021

prs-eth / Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,374 134 Updated Sep 17, 2024

xuboyue1999 / RGBT-Tracking

This repository contains the necessary tools for RGBT tracking, including datasets（GTOT, RGBT234, LasHeR）, evaluation tools, visualization tools, and results of existing works.

MATLAB 16 3 Updated Jun 18, 2024

HengLan / VastTrack

[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking

Python 48 2 Updated Sep 28, 2024

ip7z / 7zip

7-Zip

C++ 801 78 Updated Aug 12, 2024

hywang2002 / MV-VTON

MV-VTON: Multi-View Virtual Try-On with Diffusion Models

Python 92 6 Updated Jul 8, 2024

wangxiao5791509 / VisEvent_SOT_Benchmark

[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.

Python 124 9 Updated Sep 28, 2024

BUGPLEASEOUT / LasHeR

A Large-scale High-diversity Benchmark for RGBT Tracking

Shell 47 3 Updated Jul 9, 2022

xiaozai / DeT

Dataset and Code for the paper "DepthTrack: Unveiling the Power of RGBD Tracking" (ICCV2021), and "Depth-only Object Tracking" (BMVC2021)

Python 81 11 Updated May 5, 2022

jiawen-zhu / ViPT

[CVPR23] Visual Prompt Multi-Modal Tracking

Python 259 18 Updated Jul 28, 2023

lizhou-cs / JointNLT

The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.

Python 58 4 Updated Jun 3, 2023

Fantasy-Studio / Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Python 1,109 98 Updated Nov 28, 2023

SiatMMLab / Awesome-Diffusion-Model-Based-Image-Editing-Methods

Diffusion Model-Based Image Editing: A Survey (arXiv)

474 32 Updated Nov 7, 2024

Xuchen-Li / Awesome-Visual-Object-Tracking

A visual object tracking paper list, articles related to visual object tracking have been documented.

24 Updated Nov 6, 2024

mihirp1998 / Diffusion-TTA

Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

Python 56 4 Updated Mar 24, 2024

baaivision / DIVA

Diffusion Feedback Helps CLIP See Better

Python 214 11 Updated Aug 24, 2024

altndrr / vic

Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification

Python 102 3 Updated Feb 2, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,283 981 Updated Nov 6, 2024

apple / ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 614 37 Updated Oct 14, 2024

wusize / F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

Python 42 Updated Aug 5, 2024

zamling / PSALM

[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

Python 190 9 Updated Sep 3, 2024

baaivision / EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 228 3 Updated Oct 2, 2024

wl-zhao / VPD

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

Jupyter Notebook 511 31 Updated Dec 21, 2023

Tsingularity / dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 614 33 Updated May 14, 2024

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 8,861 560 Updated Nov 12, 2024

OpenGVLab / VisionLLM

VisionLLM Series

Python 905 26 Updated Oct 18, 2024

microsoft / TypeChat

TypeChat is a library that makes it easy to build natural language interfaces using types.

TypeScript 8,230 391 Updated Sep 21, 2024

983632847 / Awesome-Multimodal-Object-Tracking

A personal investigative project to track the latest progress in the field of multi-modal object tracking.

Python 105 11 Updated Nov 11, 2024

phiphiphi31 / DiffusionTrack

Python 6 Updated Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shiyu Xuan SY-Xuan

Achievements

Achievements

Highlights

Block or report SY-Xuan

Stars

GXNU-ZhongLab / ODTrack

facebookresearch / grounded-video-description

prs-eth / Marigold

xuboyue1999 / RGBT-Tracking

HengLan / VastTrack

ip7z / 7zip

hywang2002 / MV-VTON

wangxiao5791509 / VisEvent_SOT_Benchmark

BUGPLEASEOUT / LasHeR

xiaozai / DeT

jiawen-zhu / ViPT

lizhou-cs / JointNLT

Fantasy-Studio / Paint-by-Example

SiatMMLab / Awesome-Diffusion-Model-Based-Image-Editing-Methods

Xuchen-Li / Awesome-Visual-Object-Tracking

mihirp1998 / Diffusion-TTA

baaivision / DIVA

altndrr / vic

mlfoundations / open_clip

apple / ml-mobileclip

wusize / F-LMM

zamling / PSALM

baaivision / EVE

wl-zhao / VPD

Tsingularity / dift

voxel51 / fiftyone

OpenGVLab / VisionLLM

microsoft / TypeChat

983632847 / Awesome-Multimodal-Object-Tracking

phiphiphi31 / DiffusionTrack