Skip to content
View DHUAVY's full-sized avatar
Block or Report

Block or report DHUAVY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Jupyter Notebook 9,784 1,241 Updated May 21, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 18,930 2,879 Updated Jul 9, 2024

CLIPort: What and Where Pathways for Robotic Manipulation

Jupyter Notebook 436 81 Updated Nov 2, 2023

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 298 45 Updated Jun 18, 2024

Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images

Python 13 2 Updated Jul 15, 2024

Latent Text-to-Image Diffusion

Jupyter Notebook 3,757 449 Updated May 18, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Python 37,671 4,861 Updated Jun 16, 2024

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

Python 231 18 Updated Aug 3, 2023

Text world based on Minecraft rules.

Python 11 Updated May 13, 2024

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 2,516 393 Updated Apr 17, 2024

This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

Python 60 4 Updated Jun 30, 2024

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Java 322 14 Updated Apr 8, 2024
Python 4 Updated Jul 18, 2024

Data-efficient and weakly supervised computational pathology on whole slide images - Nature Biomedical Engineering

Python 974 328 Updated Jul 14, 2024

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,334 145 Updated Jul 4, 2024

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Python 1,446 192 Updated Jul 7, 2023

Let us control diffusion models!

Python 29,130 2,632 Updated Feb 25, 2024

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 3,478 416 Updated Jul 10, 2024

[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Python 152 6 Updated Jul 18, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 33,873 4,021 Updated Jul 10, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,526 677 Updated Jul 18, 2024

LlamaIndex is a data framework for your LLM applications

Python 33,736 4,742 Updated Jul 19, 2024

CVPR 2024 论文和开源项目合集

17,260 2,548 Updated Jul 4, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,487 599 Updated May 20, 2024

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 79,181 6,038 Updated Jul 19, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,460 324 Updated Jun 16, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,201 2,451 Updated Jul 15, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1 Updated Jan 8, 2024

A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)

Python 20 6 Updated May 27, 2020
Python 16 5 Updated Sep 24, 2021
Next