Skip to content

Sort images in a directory based on similarity features and dedup

License

Notifications You must be signed in to change notification settings

dashesy/imagesorter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

imagesorter

Sort images in a folder based on similarity features. The default featurizer is 'mobilenet+resnet18' hybrid, which performs the best combining OD and Classification features.

Usage

python sort.py /path/to/input/directory/ /path/to/output/directory/

To use different featurizer

python sort.py /path/to/input/directory /path/to/output/directory/ --featurizer mobilenet

Decrease the dedup threshold to deduplicate more

python sort.py /path/to/input/directory /path/to/output/directory/ --dedup 0.99

TODO

  1. Paralelize and cache to disk to handle large number of images. (Currently fine up to few hundreds)
  2. Add option to minimize the cost function (max sum path) for maximum similarity sort. Can start from current sort as a good initial condition.

About

Sort images in a directory based on similarity features and dedup

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages