Skip to content
View huizhang0110's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report huizhang0110

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Distributed vector search for AI-native applications

Go 2,039 327 Updated Oct 8, 2024

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,838 160 Updated Jul 17, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 569 30 Updated Sep 13, 2024

A curated list of foundation models for vision and language tasks

793 35 Updated Oct 5, 2024

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Python 158 10 Updated Oct 3, 2024

DOM to Semantic-Markdown for use with LLMs

TypeScript 651 14 Updated Oct 6, 2024

Explore the Limits of Omni-modal Pretraining at Scale

Python 81 4 Updated Sep 2, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 720 56 Updated Oct 2, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 11,058 767 Updated Oct 9, 2024

Kolors Team

Python 3,682 245 Updated Sep 4, 2024

Streamlit file browser

Python 103 18 Updated Apr 23, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,812 2,114 Updated Aug 9, 2024

A Node for ComfyUI that does what you ask it to do

Python 488 33 Updated Jul 7, 2024

Your image is almost there!

Python 7,248 417 Updated Jul 26, 2024

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 9,642 926 Updated Sep 26, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 631 31 Updated Aug 13, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,527 530 Updated Jul 25, 2024

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.

Python 14 1 Updated Feb 5, 2024
Python 45 1 Updated Sep 5, 2024

Generative Models by Stability AI

Python 24,299 2,705 Updated Sep 4, 2024

👾 Open source implementation of the ChatGPT Code Interpreter

Python 3,762 401 Updated May 14, 2024

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,896 150 Updated Sep 19, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,670 506 Updated Jul 18, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,614 4,520 Updated Oct 6, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,544 127 Updated Aug 4, 2024

[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Python 426 26 Updated Aug 10, 2024

understanding model mistakes with human annotations

Jupyter Notebook 105 6 Updated Feb 22, 2023

Recent LLM-based CV and related works. Welcome to comment/contribute!

831 35 Updated Jun 5, 2024
Next