Skip to content
View hysts's full-sized avatar

Block or report hysts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers).

Python 215 20 Updated Nov 6, 2024
Python 1,067 79 Updated Nov 7, 2024

Training-free Regional Prompting for Diffusion Transformers 🔥

Python 239 10 Updated Nov 5, 2024

InstantIR: Blind Image Restoration with Instant Generative Reference 🔥

Python 145 5 Updated Nov 7, 2024
Python 120 5 Updated Nov 4, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,111 61 Updated Nov 7, 2024

Official PyTorch implementation of "Framer: Interactive Frame Interpolation".

259 10 Updated Nov 7, 2024
Jupyter Notebook 501 23 Updated Nov 1, 2024

Official implementation of “LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images”

Python 45 Updated Nov 6, 2024

This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"

Python 88 3 Updated Nov 3, 2024

GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models

Python 124 5 Updated Oct 25, 2024

3D Reconstruction for all

Rust 772 17 Updated Nov 7, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 4,361 323 Updated Nov 5, 2024

DepthSplat: Connecting Gaussian Splatting and Depth

Python 521 19 Updated Nov 3, 2024

The code for the Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness paper

Jupyter Notebook 16 1 Updated Oct 14, 2024

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 541 37 Updated Oct 31, 2024

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 412 28 Updated Oct 31, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 2,000 126 Updated Nov 7, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 7,424 826 Updated Nov 7, 2024

A custom Gradio component that toggles between on and off states.

Svelte 8 Updated Oct 19, 2024

The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"

Python 572 35 Updated Oct 31, 2024

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,554 496 Updated Nov 6, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 6,725 783 Updated Nov 7, 2024

Official implementation of 'Motion Inversion For Video Customization'

Python 122 8 Updated Oct 22, 2024
Python 7 2 Updated Oct 11, 2024
Python 222 26 Updated Oct 11, 2024

Depth Any Video with Scalable Synthetic Data

Python 384 25 Updated Oct 25, 2024

[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

Python 256 5 Updated Oct 17, 2024

Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Python 115 9 Updated Oct 13, 2024

Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)

Python 168 12 Updated Oct 23, 2024
Next