DHUAVY

BITTIO DHUAVY

北京航空航天大学-软件学院

5 followers · 27 following

Block or Report

Block or report DHUAVY

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

google-research / vision_transformer

Jupyter Notebook 9,784 1,241 Updated May 21, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 18,930 2,879 Updated Jul 9, 2024

cliport / cliport

CLIPort: What and Where Pathways for Robotic Manipulation

Jupyter Notebook 436 81 Updated Nov 2, 2023

alfworld / alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 298 45 Updated Jun 18, 2024

R-J96 / stainFuser

Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images

Python 13 2 Updated Jul 15, 2024

runwayml / stable-diffusion

Latent Text-to-Image Diffusion

Jupyter Notebook 3,757 449 Updated May 18, 2023

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 37,671 4,861 Updated Jun 16, 2024

CraftJarvis / MC-Planner

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

Python 231 18 Updated Aug 3, 2023

CraftJarvis / MC-TextWorld

Text world based on Minecraft rules.

Python 11 Updated May 13, 2024

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 2,516 393 Updated Apr 17, 2024

Zhoues / MineDreamer

This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

Python 60 4 Updated Jun 30, 2024

CraftJarvis / JARVIS-1

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Java 322 14 Updated Apr 8, 2024

whiteyunjie / ROAM

Python 4 Updated Jul 18, 2024

mahmoodlab / CLAM

Data-efficient and weakly supervised computational pathology on whole slide images - Nature Biomedical Engineering

Python 974 328 Updated Jul 14, 2024

GaParmar / img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,334 145 Updated Jul 4, 2024

Janspiry / Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Python 1,446 192 Updated Jul 7, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 29,130 2,632 Updated Feb 25, 2024

fudan-generative-vision / champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 3,478 416 Updated Jul 10, 2024

huanngzh / Parts2Whole

[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Python 152 6 Updated Jul 18, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 33,873 4,021 Updated Jul 10, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,526 677 Updated Jul 18, 2024

run-llama / llama_index

LlamaIndex is a data framework for your LLM applications

Python 33,736 4,742 Updated Jul 19, 2024

amusi / CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

17,260 2,548 Updated Jul 4, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,487 599 Updated May 20, 2024

ollama / ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 79,181 6,038 Updated Jul 19, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,460 324 Updated Jun 16, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,201 2,451 Updated Jul 15, 2024

DHUAVY / BIALBEF

Forked from salesforce/ALBEF

Code for ALBEF: a new vision-language pre-training method

Python 1 Updated Jan 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly