Starred repositories
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Reference models and tools for Cloud TPUs.
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
Efficient neural feature detector and descriptor
Code and models of paper " ECO: Efficient Convolutional Network for Online Video Understanding", ECCV 2018
Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk Classification: A Deep Learning Approach (C&G, 2017)