Skip to content

Latest commit

 

History

History
 
 

keypointnet

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

KeypointNet

This is an implementation of the keypoint network proposed in "Discovery of Latent 3D Keypoints via End-to-end Geometric Reasoning [pdf]". Given a single 2D image of a known class, this network can predict a set of 3D keypoints that are consistent across viewing angles of the same object and across object instances. These keypoints and their detectors are discovered and learned automatically without keypoint location supervision [demo].

Datasets:

ShapeNet's rendering for Cars, Planes, Chairs.

Each set contains:

  1. tfrecords
  2. train.txt, a list of tfrecords used for training.
  3. dev.txt, a list of tfrecords used for validation.
  4. test.txt, a list of tfrecords used for testing.
  5. projection.txt, storing the global 4x4 camera projection matrix.
  6. job.txt, storing ShapeNet's object IDs in each tfrecord.

Training:

Run main.py --model_dir=MODEL_DIR --dset=DSET

where MODEL_DIR is a folder for storing model checkpoints: (see tf.estimator), and DSET should point to the folder containing tfrecords (download above).

Inference:

Run main.py --model_dir=MODEL_DIR --input=INPUT --predict

where MODEL_DIR is the model checkpoint folder, and INPUT is a folder containing png or jpeg test images. We trained the network using the total batch size of 256 (8 x 32 replicas). You may have to tune the learning rate if your batch size is different.

Code credit:

Supasorn Suwajanakorn

Contact:

[email protected], [snavely,tompson,mnorouzi]@google.com

(This is not an officially supported Google product)