Skip to content
View jwwangchn's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shenzhen

Organizations

@WHU-UAV
Block or Report

Block or report jwwangchn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Python 8,040 712 Updated Dec 10, 2023

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,599 187 Updated Dec 5, 2023

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,572 96 Updated Jul 6, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,220 758 Updated Jul 10, 2024
Python 67 4 Updated Jul 8, 2024

Vision utilities for web interaction agents 👀

Jupyter Notebook 1,296 66 Updated Jul 18, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

Python 575 29 Updated Jun 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,947 3,981 Updated Jul 22, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,047 39 Updated Jul 14, 2024

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Python 281 6 Updated Jun 21, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 4,646 303 Updated Jun 28, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,156 946 Updated Jun 17, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 404 19 Updated Jul 18, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,262 138 Updated Jul 17, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,248 870 Updated Jun 17, 2024

Code for the paper "Query-Key Normalization for Transformers"

Jupyter Notebook 33 2 Updated Mar 6, 2021

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,040 751 Updated Jun 28, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,171 780 Updated Nov 21, 2023

A curated list of foundation models for vision and language tasks

694 30 Updated Jun 25, 2024

A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

Python 9,958 1,210 Updated Jun 10, 2024

official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

159 8 Updated Oct 6, 2023

transformer xl在中文文本生成上的尝试(可写小说、古诗)(transformer xl for text generation of chinese)

Python 704 244 Updated Apr 7, 2022

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,614 237 Updated Jun 4, 2024

Stable Diffusion with Core ML on Apple Silicon

Python 16,504 890 Updated Jul 19, 2024

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 1,943 154 Updated Jun 25, 2024

Swap face between two photos.

Python 705 223 Updated Jun 13, 2023

This is a list of awesome paper about optical flow and related work.

398 28 Updated May 27, 2024

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 360 23 Updated Jun 5, 2024

通过水印减除方法去掉视频中的水印,快速但不完美

Python 311 77 Updated Sep 4, 2018

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 656 53 Updated Jul 4, 2024
Next