Skip to content
/ IG-LLM Public

Code for "Re-Thinking Inverse Graphics With Large Language Models"

Notifications You must be signed in to change notification settings

kulits/IG-LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Re-Thinking Inverse Graphics With Large Language Models

Peter Kulits*, Haiwen Feng*, Weiyang Liu, Victoria Abrevaya, Michael J. Black

[Project Page]

Data and code coming soon.

Summary

We present the Inverse-Graphics Large Language Model (IG-LLM) framework, a general approach to solving inverse-graphics problems. We instruction-tune an LLM to decode a visual (CLIP) embedding into graphics code that can be used to reproduce the observed scene using a standard graphics engine. Leveraging the broad reasoning abilities of LLMs, we demonstrate that our framework exhibits natural generalization across a variety of distribution shifts without the use of special inductive biases.

image

About

Code for "Re-Thinking Inverse Graphics With Large Language Models"

Resources

Stars

Watchers

Forks