Block or Report
Block or report DHUAVY
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage: Python
Sort by: Most stars
Starred repositories
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…
High-Resolution Image Synthesis with Latent Diffusion Models
LlamaIndex is a data framework for your LLM applications
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
ModelScope: bring the notion of Model-as-a-Service to life.
A unified framework for 3D content generation.
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Open source short video automatic generation tool
Transformer: PyTorch Implementation of "Attention Is All You Need"
Code for ALBEF: a new vision-language pre-training method
Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Data-efficient and weakly supervised computational pathology on whole slide images - Nature Biomedical Engineering
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Large-scale text-video dataset. 10 million captioned short videos.
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
General video interaction platform based on LLMs, including Video ChatGPT