-
Shanghai Jiao Tong University
- Shanghai
-
18:45
(UTC +08:00) - hsiangyuzhao.github.io
Highlights
- Pro
Block or Report
Block or report hsiangyuzhao
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
✨✨Latest Advances on Multimodal Large Language Models
Awesome speech/audio LLMs, representation learning, and codec models
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。
Connected components on discrete and continuous multilabel 3D & 2D images. Handles 26, 18, and 6 connected variants; periodic boundaries (4, 8, & 6)
[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.
[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes plus nine classes)
A repository for research on medium sized language models.
A Gradio web UI for Large Language Models.
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
Official implementation of SAM-Med2D
Curated papers on Large Language Models in Healthcare and Medical domain
Hackable and optimized Transformers building blocks, supporting a composable construction.
PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
End-to-End Object Detection with Transformers