Skip to content
View hsiangyuzhao's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report hsiangyuzhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,058 308 Updated Jan 22, 2024

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 643 46 Updated Apr 9, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,514 700 Updated Jul 4, 2024

Awesome speech/audio LLMs, representation learning, and codec models

500 24 Updated May 29, 2024

哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。

C# 19,907 2,203 Updated Feb 8, 2024
Python 1,119 60 Updated Jul 7, 2024

CVPR 2024 论文和开源项目合集

17,135 2,543 Updated Jul 4, 2024

Connected components on discrete and continuous multilabel 3D & 2D images. Handles 26, 18, and 6 connected variants; periodic boundaries (4, 8, & 6)

C++ 343 41 Updated Jul 1, 2024
Python 5,364 1,638 Updated Jul 8, 2024

本人的科研经验

4,822 299 Updated Jun 1, 2024

[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.

Python 521 58 Updated May 31, 2024

[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes plus nine classes)

Python 179 12 Updated Jul 7, 2024

A repository for research on medium sized language models.

Python 344 45 Updated Jul 4, 2024

A Gradio web UI for Large Language Models.

Python 38,251 5,072 Updated Jul 8, 2024

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 77,543 5,856 Updated Jul 9, 2024

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 6,732 1,038 Updated Mar 15, 2024

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 7,706 2,526 Updated Jun 28, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,774 3,962 Updated Jul 9, 2024

Tool for robust segmentation of >100 important anatomical structures in CT and MR images

Python 1,295 215 Updated Jul 3, 2024

Official implementation of SAM-Med2D

Jupyter Notebook 795 76 Updated Jun 18, 2024

Curated papers on Large Language Models in Healthcare and Medical domain

158 14 Updated Jul 8, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,009 565 Updated Jul 6, 2024

PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.

Python 153 11 Updated Mar 21, 2024

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,296 148 Updated Jun 9, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 23,985 4,937 Updated Jul 9, 2024

atss的Pytorch实现,支持多卡分布式训练

Python 16 4 Updated Jan 3, 2021

End-to-End Object Detection with Transformers

Python 13,105 2,375 Updated Mar 12, 2024
Python 955 125 Updated Oct 3, 2022
Next