Skip to content
View Andy-Cheng's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Andy-Cheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,678 191 Updated May 21, 2024
Python 3 Updated Dec 13, 2023

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 184 13 Updated May 31, 2024

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,134 33 Updated Jul 8, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,043 57 Updated Jul 9, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

Python 660 77 Updated Jul 8, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,473 89 Updated Jul 6, 2024

Long Context Transfer from Language to Vision

Python 186 11 Updated Jul 3, 2024

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Python 56 4 Updated Jul 4, 2024

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 85 Updated Jul 2, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,522 177 Updated Jul 3, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,238 114 Updated Jul 9, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,068 734 Updated Jul 5, 2024

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,318 123 Updated May 9, 2023

Allows to use your GoPro camera as a webcam on linux

Shell 500 63 Updated Jan 10, 2024

Monocular, One-stage, Regression of Multiple 3D People and their 3D positions & trajectories in camera & global coordinates. ROMP[ICCV21], BEV[CVPR22], TRACE[CVPR2023]

Python 1,306 227 Updated Oct 26, 2023

Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro

Python 133 7 Updated May 17, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 128,767 25,541 Updated Jul 9, 2024

Code Repository for Liquid Time-Constant Networks (LTCs)

Python 1,354 257 Updated Jun 3, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,216 261 Updated Jul 9, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,768 3,962 Updated Jul 9, 2024

[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"

Python 302 32 Updated Feb 13, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 29,230 4,320 Updated Jul 8, 2024

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,145 339 Updated May 31, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 14,957 1,425 Updated Jul 9, 2024

Fast Diffusion Models with Transformers

Python 613 83 Updated Oct 7, 2023

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Spectral Recovery Challenge) and a toolbox for spectral reconstruction

Python 405 57 Updated Jun 17, 2024

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Python 3,088 328 Updated Jul 8, 2024

[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System

Python 466 16 Updated May 28, 2024
Next