Skip to content
View Scarecrow0's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ShanghaiTech University @SHTUPLUS
  • Shanghai China

Highlights

  • Pro
Block or Report

Block or report Scarecrow0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.

Python 149 1 Updated Mar 16, 2024

CVPR2023 : VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud

Python 58 12 Updated Jul 9, 2024
Python 550 27 Updated Feb 15, 2024

4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)

Python 83 1 Updated May 17, 2024
2 Updated May 20, 2024

A curated list of foundation models for vision and language tasks

720 32 Updated Aug 14, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,155 264 Updated Aug 14, 2024

Multimodal Models in Real World

Jupyter Notebook 353 17 Updated Jul 12, 2024

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

8 Updated Jun 27, 2024

Code release of Video2Game

JavaScript 291 21 Updated Apr 25, 2024

Official implementation of 'Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields'

Python 115 5 Updated Feb 8, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,130 279 Updated May 4, 2024

A curated list of papers and open-source resources focused on 3D AIGC.

254 15 Updated Jul 26, 2024

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,144 315 Updated Jan 22, 2024

[CVPR 2024] SceneWiz3D: Towards Text-guided 3D Scene Composition

91 3 Updated May 4, 2024

A curated list of awesome AIGC 3D papers

487 18 Updated Aug 8, 2024
Python 57 6 Updated Mar 29, 2019

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Python 496 36 Updated Jan 27, 2024
Python 259 42 Updated Aug 14, 2024

Official implementation for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Python 40 4 Updated Nov 7, 2023

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,659 241 Updated Jun 4, 2024

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

316 29 Updated Apr 25, 2024
17 Updated Oct 22, 2023

Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph Generation, ICCV2023

Python 6 1 Updated Feb 27, 2024

[ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation

Python 28 2 Updated Jan 25, 2024

This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World" (Accepted by ICCV 2023)

Python 40 5 Updated Mar 12, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

Python 585 29 Updated Jun 17, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,068 432 Updated Aug 14, 2024

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

875 46 Updated Apr 5, 2024

李跳跳APK包备份

2,078 579 Updated Aug 31, 2023
Next