A master repository containing all the SME issued material for the Integrated Robotics and Applied Computer Vision course of PESU IO Slot 14 (2022). Images from the best submissions will be featured on this README.md
-
Day 1 is dedicated purely for set up. Your performance in the course will be evaluated for points from day 2.
-
All submissions from then will be graded on a normal distribution.
-
Options will usually be given for each tasks defining a seperate one for exemplary performance. Completion of the tougher option of tasks each day (if applicable) will be awarded a perfect score.
-
Task submissions where the code for the easier option (if applicable) is not working will be given a zero. This along with the highest score will define the upper and lower bounds of the bell curve. Don't worry about the number on your marks, this is just a temporary metric to determine your final grade.
-
Points scored on the first half of the tasks given will be aggregated as your final score for Assignment 1 (25 points). The same will be done for Assignment 2 with the second half (25 points). This will constitute your in-course evaluation of 50 points.
-
The final 50 points will be awarded based on your performance in the final project presentations, which will be evaluated for relevance to the topic (5), concept clarity (10), pitch appeal and presentation skill (10), technical complexity (10), and novelty (15).
-
The final score will be calculated as the sum of the two assignments and the final project presentation. This is where we'll apply the bell curve to classify the students into three grade classes : C (completed satisfactorily), D (completed with distinction), and F (fail).
-
The fail bar (F) is set at a final score of 50/100. The distinction bar (D) is set at two standard deviations from the mean of the remainder (subject to change at the discretion of PESU IO management). The rest of the students will be awarded a C.
-
Unofficially, there's a surprise in store for anyone scoring above three standard deviations from the mean from the side of the SMEs. Getting to this point truly demonstrates your dedication and passion for the subject and we'd like to make sure your efforts get the recognition they deserve.
-
Don't worry about the exact number of your final score. It's nothing more than a tool to determine which grade bracket you fall into.
- Introduction to Robotics and Computer Vision
- Introduction to Git, creating and cloning a repository
- Introduction to OpenCV
- Introduction to ROS
- Software installations
- ./day1
- OpenCV documentation : https://docs.opencv.org/4.x/
- ROS documentation : https://wiki.ros.org/Documentation
- Git cheat sheet : https://education.github.com/git-cheat-sheet-education.pdf
- For your first task, you'll be required to follow the attached youtube video word for word to dual boot your computers to run Ubuntu 20.04. Be thorough, take backups if you want and do not screw this up because it will cost you more than just points.
Video link : https://www.youtube.com/watch?v=lGR_VNwUfzk&t=807s Torrent download link for the ubuntu-20.04.5-desktop-amd64.iso disk image : https://releases.ubuntu.com/20.04/ubuntu-20.04.5-desktop-amd64.iso.torrent
For once you have your Ubuntu system up and running, I've written a shell executable that will -
Install ROS Initialize your catkin workspace Install and initialize the ROS packages you'll need later in the course Install all of your python dependencies Install VSCode on your systems and update & reboot your computer.
Do not modify this executable. I've tested it more than once and can assure you that it will definitely work on a properly installed Ubuntu 20.04 system.
Connect your system to your home internet and follow these instructions to the dot to execute the script -
- Download and save the file under /home/your username/
- Navigate to /home/your username/ on the terminal
- Execute 'sudo chmod 777 IRACV-IO-setup' on the terminal
- Execute 'ls' on the terminal
- Verify that 'IRACV-IO-setup' appears in green on your terminal. Otherwise, redo step 3.
- Execute './IRACV-IO-setup >> IRACV-IO-setup-output.txt'
- Be on the lookout for user prompts during execution
Mail your 'IRACV-IO-setup-output.txt' (located under /home//) as an attachment to [email protected] and [email protected] by 14:30, November 1st, 2022.
This text file will tell us exactly what installation errors you guys have run into and will determine whether or not we will require an online session to resolve them.
This assignment will take anywhere between 20 to 45 minutes depending on how carefully you're following the instructions.
The ability to read thoroughly and debug system errors is an essential skill in robotics. You will not survive without it. Give every error it's due time for resolution. In case you run into them, post them on the group and hep each other out.
Good luck :)
- For your second task, you'll be required to create a repository on your github account and clone it to your local machine.
You'll then be required to make a commit (any) and push it to your remote repository.
There's no deadline or mandatory requirement for this task. You can do it anytime you want. Show us your work in class for extra points.
Code :
- ./day1/IRACV-IO-setup : The shell script for the first task
- ./day1/opencvFeedLoading.py : A python script that loads a video feed from your webcam and displays it on your screen
- Kernels and their usage in dilation & erosion.
- Grayscaling, gaussian blur & the reasons to use them. [HANDS ON]
- Matrix rotation transform demonstrated using warp perspective.
- Inserting shapes & text (to emphasize the visual attractiveness of computer vision software) [HANDS ON]
- Matrix stretch transform demonstrated by image slicing & resizing
- HSV color space & cylindrical representation demonstrated by color detection & masking.
- Laplacian operator in HSV space demonstrated by Canny edge detection.
- Contour detection
- ROS topics overview, publisher and subscriber
- ./day2
-
For your first task, you'll be required to detect the number of sides on all the polygons given in the picture ./day2/taskResources/shapes.JPG and display them appropriately on the screen.
-
Alternatively, if you think the first task is too simple for you, you can download the .mp4 file at ./day2/taskResources/test.mp4 of a track recorded from the perspective of an actual robot. Use the image processing techniques learned in this class to indicate which direction the robot must turn to at any given point in time. Completing this successfully will guarantee you a perfect score for this task and potentially, all of assignment 1.
Submit either of these in the form of a .mp4 file recording the execution and email it to the SMEs.
Good luck :)
- What is a neuron?
- What is a neural network?
- A conversation on summation in NNs
- A conversation on gradient descent
- A conversation on activation function
- A conversation on forward propagation & backpropagation
- ROS services
- Creating custom messages
- Introduction to TurtleSim
- ./day3
- UE20CS302 - Machine Intelligence course (PESU)
$ cd ~/catkin_ws cd src git clone -b ros-noetic https://github.com/aniskoubaa/ros_essentials_cpp.git $cd .. $catkin_make $source devel/setup.bash new terminal $sudo apt install python-is-python3 To test: python
- Develop a publisher and subscriber to display a message of your choice. Expand the system such that you create at least two publishers and three subscribers and establish topics to pass messages between them.
- For your second task, you'll be required to complete this kaggle notebook to identify handwritten digits using an ANN by adding visualizations for training & validation loss and error.
https://www.kaggle.com/code/rupeshs/mnist-baseline-without-cnn/notebook
Alternatively, you could also write a CNN based image classifier for the following dataset -
https://www.kaggle.com/datasets/ivanfel/honey-bee-pollen
Push your fully executed notebook to your github repository and email the link to the SMEs. Make sure your repository is public.
This task is optional but completion with the previous one will guarantee you a perfect score for this task and potentially, all of assignment 1.
Good luck :)
- TurtleSim control with nodes
- A brief conversation about convolutions in neural networks (as a break from the robotic systems)
- Using teleop and nodes for motion control
- ./day4 -ros_essentials_cpp package -cleaner.py -move.py
- Develop a node to control the turtlesim bot and make it move in a specific shape, such as a rectangle, square, circle etc. The complexity of the design will be proportional to the score obtained for the task.
- What is transfer learning in the context of neural networks?
- A case study of MobileNetV2
- Booting up an AWS EC2 instance [HANDS ON]
- SSH controls
- Building pipelines
- Error Control, Rise time, Peak overshoot
- P, PI, PD and PID controllers
- PID tuning using turtlesim
-
./day5
-
A case study for MobileNet v2 applied to classify fish species : https://github.com/SIH-ClapForKrishna/fish-classifier-tensorflow/blob/main/fish-classifier.ipynb
-
Creating an AWS EC2 instance : https://www.javatpoint.com/aws-ec2-creating-an-instance
-
Connecting to an AWS EC2 instance using SSH : https://www.clickittech.com/aws/connect-ec2-instance-using-ssh/
-
Copying directories to your remote system using scp : https://medium.com/srcecde/copy-file-directory-to-from-ec2-using-scp-secure-copy-685c46636399
-
Connection ...
ssh -i file.pem username@ip-address
- https://www.youtube.com/watch?v=5Y2X7HNmk4s - Controlling a spaceship with PID
- https://robotics.stackexchange.com/questions/9786/how-do-the-pid-parameters-kp-ki-and-kd-affect-the-heading-of-a-differential - PID Controller concepts
- https://github.com/jellevos/simple-ros-pid/blob/master/simple_pid/PID.py - PID implementation -https://modernrobotics.northwestern.edu/nu-gm-book-resource/foundations-of-robot-motion/ An amazing northwestern university explanation -https://www.youtube.com/watch?v=zOByx3Izf5U - An amazing implementation of PID in software by Phil's Lab
-
Develop a P controller to make the turtlesim move in a straight line to a goal.
-
Develop a P controller to make the turtlesim turn.
-
Develop a PID controller to do the same.
-Segway into mediapipe as the deep learning section of computer vision
- Introduction to mediapipe with neural networks & convolution at a high level.
- Emphasis on mediapipe as a collection of ML solutions and not models (hence the light processing load), mention of bounding box classifiers.
- Mediapipe documentation review
- Landmarks
- Hand tracking module [HANDS ON]
- Using PID to control motion
- Deploying a ROS node to control motion in a pre-set manner while integrating PID controllers [HANDS ON]
- Hector quadrotor understanding -Gazebo Setup
- Using RViz to create visualizations in Gazebo [HANDS ON]
- ./day6
- https://www.youtube.com/watch?v=qJ9v1fITeVk - The Construct; Hector Quadrotor installation https://github.com/tu-darmstadt-ros-pkg/hector_quadrotor/tree/catkin - Hector Quadrotor package https://github.com/RAFALAMAO/hector-quadrotor-noetic - Package that is to be installed for hector quadrotor
1.For your first task, write a program to count the number of fingers you hold up against the camera and display the number on the screen.
-
If you wish, you can expand on the previous task to detect whether or not you're holding up rock, paper or scissors, generate a random equivalent from the computer's side and display the winner on the screen. This will be the criterion for a guaranteed perfect score for this task.
-
Writing a node to make the quadrotor move up, down, left, right and rotate
- Running a CV algorithm on a ROS node
- Implementing PID in hector quadrotor
- ./day7
-
Spawn a turtlebot.
-
Spawn a hector quadrotor
-
Demonstrate coke can detection on Gazebo using the quadrotor
-
PID tuning in hector quadrotor
-
Integration of the motion and vision stacks
- N/A. Time to focus on the projects.
- N/A
- Have an execution plan for the project.
- Have a codebase ready for the same.
- Have a specific set of targets to achieve to define the project’s completion.
- Complete 65% of the project with respect to the total number of lines of code.
- N/A. Time to focus on the projects.
- N/A
- Students present their final projects
- All projects must be evaluated by the SME
- Simulate a quadrotor picking up and placing a carboard box from point to point using any environment and package of your choosing. Successful completion of this statement will most definitely get you into the three sigma bracket mentioned in the grading methodology section and guarantees a distinction grade.