GitHub - adi2809/SimpleScanner: assignment one of robotics technology course at indian institute of technology delhi for the fall semester 2021-22

SimpleScanner

this is the submission to the assignment one of JRL302-Robotics-Technology at Indian Institute of Technology Delhi for the semester one 2021-2022

project dependencies : python, open-cv, numpy, argparse.

brief description of programme files :

input_output.py - written for implementing the reading of images, text files from specified path.
compute_homography.py - written for implementing the computation of perspective transform from source coordinates to destination coordinates.
post_processing.py - written for implementing histogram equalisation/ whitebalance for the image (added functionality not integrated in main)
main.py - written to run the app using a simple terminal call which will be explained in the instructions below for running the programme.

logic flow behind the implementation :

-the method is based on the idea suggested in this mathematics stack-exchange post ( https://math.stackexchange.com/a/1289595/613549 ).

the basic idea of the implementation lies in how to solve the linear system of equations to find the 8-unknown paramters of the projective matrix namely h11, h12, h13, h21, h22, h23, h31, h32 ( we can take h33 = 1).

instructions for running the programme in the terminal :

python/ python3 main.py --image [path of image file] --coords [path of coordinates file in form of a .txt]

image file can be a .png, .jpg or other formats that can be read by the opencv-python library's imread()
in the coords.txt file the following format is to be followed:
- top_left_x,top_left_y
- top_right_x,top_right_y
- btm_right_x,btm_right_y
- btm_left_x,btm_left_y
files in the test_images folder :
- test_0.png, the image on which we tested the programme.
files in the test_coords folder :
- coords_0.txt the coordinates of the four points for the perspective transform in the order specified above for the test_0.img.

right now we need to manually determine the four points (using GIMP v 2.0) but work can be further improved by :

using a GUI - based drag service for the user (implement using tkinter).
using opencv - based automatic detection of the countour and the largest feasible rectangle.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
test_coords		test_coords
test_imgs		test_imgs
LICENSE		LICENSE
README.md		README.md
compute_homography.py		compute_homography.py
input_output.py		input_output.py
main.py		main.py
post_processing.py		post_processing.py
result.png		result.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimpleScanner

About

Releases

Packages

Languages

License

adi2809/SimpleScanner

Folders and files

Latest commit

History

Repository files navigation

SimpleScanner

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages