Stars
We write your reusable computer vision tools. 💜
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
A comprehensive collection of IQA papers
This repo contains the implementation of VQGAN, Taming Transformers for High-Resolution Image Synthesis in PyTorch from scratch. I have added support for custom datasets, testings, experiment track…
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A PyTorch implementation of SRGAN based on CVPR 2017 paper "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"
Fast and memory-efficient exact attention
Latte: Latent Diffusion Transformer for Video Generation.
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Contrastive Predictive Coding for Image Recognition
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Implementation of Medfusion - A latent diffusion model for medical image synthesis.
3D ResNets for Action Recognition (CVPR 2018)
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Deconvolution and Checkerboard Artifacts
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
Pytorch implementation of "Taming transformer for high resolution image synthesis (VQGAN)"
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Taming Transformers for High-Resolution Image Synthesis