SAVI - Where's my Coffee Mug

The "SAVI project - Where's my coffee mug?" implements an advanced perception system that processes information collected from 3D sensors and conventional cameras. The goal is to extract objects from a generated point cloud and using them to train a neural network classifier. This classifier will then be able to tell what the object is.

Index

Description
The Project
Authors
Reference

Description

The second assignment of the SAVI (Advanced Industrial Vision Systems) a curricular unit given at the university of aveiro in the Master's degree in mechanical engineering the project aimed to teach the basics of 3D point cloud understanding and processing, as well as the use of classifiers and integration as a system. The main objective was to recognize objects identified in the point cloud using the "Washington RGB-D Dataset".

The Project

This project uses Open3D for point cloud processing of a dataset, OpenCV for image processing and feature extraction and PyTorch for deep neural network training of a classifier that will be able to recognize objects.

Requirements

It is necessary to install the following softwares before any use:

Open3D
OpenCV
PyTorch
Pickle
Matplotlib
GTTS

Also network connection is required.

Dataset

For the point cloud and image generation this program uses the Washington RGB-D Dataset.

Usage

You can use the following command to download the program:

git clone https://github.com/joaodmatias/SaviProject2.git

To run the program you can start by moving to the directory where you cloned the repository. Once in there you can use:

./main.py -h

to get some help on options to run, including to add a path to run different scenarios. You can then use:

./main.py -p DATASET_PATH

while replacing "DATASET_PATH" with the path to the scenario you want to run. If no scenario is chosen, there is a preset scenario that will run.

Functionalities/Improvements

The color information will appear on the terminal where you run the program, as an approximation to the CSS21 list of colors as well as the actual RGB value.
The dimensions will appear as a tuple such as (width, height) in meters.

Here we can see the extraction of images of objects used to train the classifier:

Authors

Reference

Proposed work

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
Data_objects		Data_objects
Data_scenario		Data_scenario
Jota		Jota
Matias		Matias
__pycache__		__pycache__
images_objects		images_objects
joao		joao
joao2		joao2
Classes.py		Classes.py
DepthCamera_2023-02-01-06-12-10.json		DepthCamera_2023-02-01-06-12-10.json
DepthCapture_2023-02-01-06-12-10.png		DepthCapture_2023-02-01-06-12-10.png
README.md		README.md
audio.py		audio.py
image1.png		image1.png
image2.png		image2.png
image3.png		image3.png
image4.png		image4.png
image5.png		image5.png
image_point.py		image_point.py
main.py		main.py
narracao.mp3		narracao.mp3
pcd_point_cloud.pcd		pcd_point_cloud.pcd
teste.py		teste.py
viewpoint.json		viewpoint.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAVI - Where's my Coffee Mug

Index

Description

The Project

Requirements

Dataset

Usage

Functionalities/Improvements

Authors

Reference

About

Releases

Packages

Contributors 3

Languages

joaodmatias/SaviProject2

Folders and files

Latest commit

History

Repository files navigation

SAVI - Where's my Coffee Mug

Index

Description

The Project

Requirements

Dataset

Usage

Functionalities/Improvements

Authors

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages