coral3333

Follow

🎯

Focusing

Shanshan Du coral3333

🎯

Focusing

Follow

Vision and Language，Visual dialog

6 followers · 27 following

Tongji University
shanghai
https://mic.tongji.edu.cn/main.htm

Achievements

Achievements

Organizations

Lists (4)

Sort

VD

14 repositories

VD datasets

VLP

VQA

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

microsoft / scene_graph_benchmark

image scene graph generation benchmark

Python 384 86 Updated Aug 30, 2022

Jingkang50 / OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Python 407 68 Updated Apr 10, 2023

vacancy / SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Python 534 54 Updated Jan 23, 2024

ranjaykrishna / visual_genome_python_driver

A python wrapper for the Visual Genome API

Jupyter Notebook 354 90 Updated Sep 21, 2023

yeeeqichen / MedicalKG

医疗知识图谱构建实战，通过爬虫获取百度百科数据，使用Mongodb存储结构化三元组，并使用neo4j进行知识图谱的构建及可视化; Medical Knowledge Graph; Crawler; neo4j

Python 70 9 Updated Jul 16, 2023

SupritYoung / Zhongjing

A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.

Python 290 28 Updated Dec 12, 2023

FreedomIntelligence / Medical_NLP

Medical NLP Competition, dataset, large models, paper

2,085 400 Updated Jun 8, 2024

liuhuanyong / QASystemOnMedicalKG

A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建，自动问答，基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱，并以该知识图谱完成自动问答与分析服务。

Python 6,173 2,123 Updated Aug 8, 2024

fengxi177 / Knowlegde_Graph_TCM

中医药知识图谱探索demo：数据集、介绍文章、可视化结果

Python 74 16 Updated Jun 22, 2022

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,246 655 Updated Aug 14, 2024

HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs

The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"

Python 750 34 Updated Sep 6, 2024

zjukg / KG-LLM-Papers

[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)

1,302 98 Updated Aug 27, 2024

facebookresearch / comet_memory_dialog

Code for Navigating Connected Memories with a Task-oriented Dialog System

Python 17 1 Updated Dec 12, 2022

YuJungHeo / kbvqa-public

Python 37 6 Updated Nov 29, 2022

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,137 2,096 Updated Aug 12, 2024

KevinLight831 / AMC

AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval

Python 13 1 Updated Aug 30, 2024

IDEA-FinAI / ToG

This is the official github repo of Think-on-Graph. If you are interested in our work or willing to join our research team in Shenzhen, please feel free to contact us by email ([email protected])

Python 285 34 Updated Mar 24, 2024

CrossmodalGroup / ER-SAN

Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.

Python 20 3 Updated Aug 5, 2023

JoshuaChou2018 / SkinGPT-4

Pre-trained Multimodal Large Language Model Enhances Dermatological Diagnosis using SkinGPT-4

Python 62 6 Updated Jul 13, 2024

microsoft / LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,429 173 Updated Aug 13, 2024

zengyan-97 / X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Python 440 51 Updated Nov 25, 2022

SunzeY / AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 630 37 Updated Jul 30, 2024

zjukg / Structure-CLIP

[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations

Python 103 2 Updated Jun 22, 2024

zjukg / MyGO

[Paper][Preprint 2024] MyGO: Discrete Modality Information as Fine-Grained Tokens for Multi-modal Knowledge Graph Completion

Python 206 4 Updated May 28, 2024

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 217 6 Updated May 28, 2024

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,489 503 Updated Sep 5, 2024

Victorwz / MLM_Filter

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

Python 40 1 Updated Jul 16, 2024

Liuziyu77 / RAR

The official implementation of RAR

Python 59 Updated Mar 27, 2024

scutcyr / BianQue

中文医疗对话模型扁鹊(BianQue)

Python 687 92 Updated Oct 25, 2023

Paranioar / RCAR

[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”

Python 28 3 Updated Apr 11, 2024

Starred topics

graph-convolution

$latex logo$

LaTeX

Awesome Lists

gcn

video-classification