ihollywhy

Yang Yiding ihollywhy

Research Scientist at ByteDance

28 followers · 21 following

ByteDance
Bellevue
ihollywhy.github.io

Achievements

Stars

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,674 440 Updated Sep 19, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,726 174 Updated Aug 1, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,795 2,115 Updated Aug 9, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,868 150 Updated Sep 25, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,465 437 Updated Jul 30, 2024

roboflow / notebooks

Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…

Jupyter Notebook 5,291 823 Updated Oct 4, 2024

ermongroup / SDEdit

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

Python 973 91 Updated Feb 12, 2023

wpeebles / gangealing

Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)

Python 1,011 120 Updated Oct 12, 2022

LargeWorldModel / LWM

Python 7,098 549 Updated Aug 12, 2024

danielgatis / rembg

Rembg is a tool to remove images background

Python 16,512 1,847 Updated Oct 1, 2024

mit-han-lab / efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,799 164 Updated Aug 9, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,823 1,055 Updated Aug 15, 2024

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 23,027 5,370 Updated Sep 30, 2024

NVlabs / stylegan3

Official PyTorch implementation of StyleGAN3

Python 6,383 1,123 Updated Sep 12, 2023

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,308 224 Updated Jun 14, 2024

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,735 1,139 Updated Jul 30, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,777 1,078 Updated Oct 7, 2024

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,738 196 Updated Sep 25, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 24,292 2,702 Updated Sep 4, 2024

lllyasviel / Fooocus

Focus on prompting and generating

Python 40,591 5,667 Updated Aug 21, 2024

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,123 210 Updated Sep 26, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,527 685 Updated Jul 25, 2024

ali-vilab / AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 3,949 359 Updated Apr 8, 2024

horseee / DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 772 37 Updated Jun 27, 2024

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,160 58 Updated Mar 14, 2024

YifanXu74 / MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 258 12 Updated Feb 23, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,923 407 Updated May 29, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,700 170 Updated May 24, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,712 370 Updated Mar 14, 2024

OpenGVLab / VisionLLM

VisionLLM Series

Python 868 23 Updated Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yang Yiding ihollywhy

Achievements

Achievements

Block or report ihollywhy

Stars

OpenGVLab / InternVL

PixArt-alpha / PixArt-alpha

hpcaitech / Open-Sora

NVlabs / VILA

AILab-CVC / YOLO-World

roboflow / notebooks

ermongroup / SDEdit

wpeebles / gangealing

LargeWorldModel / LWM

danielgatis / rembg

mit-han-lab / efficientvit

facebookresearch / seamless_communication

deepinsight / insightface

NVlabs / stylegan3

luosiallen / latent-consistency-model

CompVis / taming-transformers

state-spaces / mamba

facebookresearch / co-tracker

Stability-AI / generative-models

lllyasviel / Fooocus

intel / intel-extension-for-transformers

cumulo-autumn / StreamDiffusion

ali-vilab / AnyDoor

horseee / DeepCache

Computer-Vision-in-the-Wild / CVinW_Readings

YifanXu74 / MQ-Det

THUDM / CogVLM

Alpha-VLLM / LLaMA2-Accessory

OpenGVLab / LLaMA-Adapter

OpenGVLab / VisionLLM