-
UNC Chapel Hill
- Chapel Hill, NC
- https://j-min.io
- @jmin__cho
Highlights
- Pro
Block or Report
Block or report j-min
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team.
Official implementation of SEED-LLaMA (ICLR 2024).
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
OCR, layout analysis, reading order, line detection in 90+ languages
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Collection of notebook guides created by the Brev.dev team!
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Open reproduction of MUSE for fast text2image generation.
A Jupyter widget for annotating images with bounding boxes
The unofficial python package that returns response of Google Bard through cookie value.