-
Zhejiang University, Harbin Institute of Technology
- Shanghai
Stars
Language
Sort by: Recently starred
This is a resouce list for low light image enhancement
A repository to keep track of Deep Learning based methods for visual odometry (pull requests are always welcome)
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition
MambaOut: Do We Really Need Mamba for Vision?
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
The official code for NF-Atlas: Multi-Volume Neural Feature Fields for Large Scale LiDAR Mapping
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
[IJCV 2024] InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions
This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
A library for machine learning research on motion capture data
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of f…
Scripts for numerical evaluations for the GENEA Gesture Generation Challenge
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
Robust Speech Recognition via Large-Scale Weak Supervision
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
Code repository for the paper "Tracking People by Predicting 3D Appearance, Location & Pose". (CVPR 2022 Oral)
Official repo of "MotionLLM: Multimodal Motion-Language Learning with Large Language Models"
This repository contains scripts to build Youtube Gesture Dataset.
Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Task Planner for Heterogeneous Multi-robot Teams with Battery Constraints