Block or Report
Block or report KatameRonin
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
Loading Information from a Database into Unity
Repository for the paper "Predicting Popularity of Images Over 30 Days", ICIP 2020 Image Popularity Prediction Challenge, 3rd Place
Repository for the paper "Efficient Detection of Lesions During Endoscopy", ICPR International Workshops and Challenges 2021
Visualizing the Spotify Soundscape through Spotify's top 50 songs of 2023
Collection of AWESOME vision-language models for vision tasks
[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
Awesome-LLM: a curated list of Large Language Model
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).
Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22
awesome grounding: A curated list of research papers in visual grounding
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Code Release for the paper Segmentation Grounded Scene Graph Generation
[ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
IPython Parallel: Interactive Parallel Computing in Python
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch