Skip to content
View koukyo1994's full-sized avatar
🌴
On vacation
🌴
On vacation

Highlights

  • Pro
Block or Report

Block or report koukyo1994

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LLaVA-JP is a Japanese VLM trained by LLaVA method

Python 46 9 Updated Jul 3, 2024

world modeling challenge for humanoid robots

Python 131 7 Updated Jul 11, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 338 10 Updated Jul 10, 2024

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Python 47 Updated May 23, 2024

commaVQ is a dataset of compressed driving video

Jupyter Notebook 275 45 Updated Jul 8, 2024

A Generalizable World Model for Autonomous Driving

Python 409 20 Updated Jun 17, 2024
Python 37 6 Updated Apr 24, 2024

Code to Blur Human Faces and Vehicle License Plates in Video and Images using a SoTA Object Detection model YOLOv8

Python 22 2 Updated Aug 13, 2023

CVPR 2024 论文和开源项目合集

17,301 2,548 Updated Jul 4, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 498 29 Updated Jul 23, 2024

Stable Video Diffusion Training Code and Extensions.

Python 483 45 Updated Jul 23, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,039 134 Updated Jun 25, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,570 158 Updated Jul 12, 2024

Collect some World Models for Autonomous Driving papers.

321 5 Updated Jul 20, 2024

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 45 2 Updated Jul 22, 2024

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Python 1,202 125 Updated May 3, 2024

Image Generation using VQVAE and GPT Models

Jupyter Notebook 8 2 Updated May 17, 2023

Open-Sora: Democratizing Efficient Video Production for All

Python 20,847 1,974 Updated Jul 16, 2024

A curated list of foundation models for vision and language tasks

695 31 Updated Jun 25, 2024

[IEEE T-PAMI] All you need for End-to-end Autonomous Driving

1,785 176 Updated May 6, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 28,664 9,316 Updated Jul 22, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 41,836 4,996 Updated Jul 23, 2024

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,050 504 Updated May 16, 2024

日本語LLMまとめ - Overview of Japanese LLMs

TypeScript 885 25 Updated Jul 21, 2024

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,935 444 Updated Nov 7, 2022

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Python 11,044 1,202 Updated Jul 23, 2024

A Supervised and Semi-Supervised Object Detection Library for YOLO Series

Python 805 147 Updated Mar 28, 2023

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Python 331 28 Updated Jul 14, 2024

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,538 133 Updated Jul 29, 2023

SAM with text prompt

Jupyter Notebook 1,401 148 Updated Jul 20, 2024
Next