ayushkumarshah

Follow

🏠

Working from home

Ayush Kumar Shah ayushkumarshah

🏠

Working from home

Follow

Research Assistant | Visual parsing of graphical notations | Mathematical formula and Chemical diagram recognition

109 followers · 90 following

Rochester Institute of Technology
Rochester, New York
https://shahayush.com
@ayushkumarshah7

Achievements

Achievements

Highlights

Developer Program Member
Pro

Organizations

ayushkumarshah/README.md

Hi there 👋 , I am Ayush Kumar Shah, an AI enthusiast.

🔭 I’m a fifth-year Ph.D. candidate at Rochester Institute of Technology (RIT), conducting research at the Document and Pattern Recognition Lab (DPRL), under the mentorship of Dr. Richard Zanibbi.
💡 My work centers around designing fast, efficient, and interpretable parsers for recognizing complex mathematical and chemical formulas. I explore graphical notations across multiple formats, including PDFs, typeset images, and handwritten strokes. Through graph attention-based techniques, I aim to enhance how contextual information is processed, while preserving a natural and interpretable graph representation.
🎯 My goal is to deliver high accuracy in formula recognition through models that are not only faster but also easier to interpret than traditional encoder-decoder architectures.
💻 Recently, I developed ChemScraper, a molecule diagram parser that extracts characters and graphics directly from PDF molecule images. By utilizing typesetting instructions and simple graph transformations, it generates both visual and chemical graphs — without the need for OCR, GPUs, or vectorization. ChemScraper offers a practical approach to creating fine-grained, annotated datasets for training visual parsers, and also a visual parser for parsing molecule images (raster) directly.
🌐 Research interests: Pattern recognition, recognition of graphical structures, computer vision, speaker understanding, large language models, multi-modal deep learning, natural language processing .
✍️ I write blog posts that reflect my new learnings mostly related to python and AI.
🌱 I’m currently learning fundamental concepts and advancements in recognition (parsing) of graphical information from documents.
📃 You can view my CV here: My CV
Personal website: shahayush.com

My latest Blog posts

My latest YouTube Videos

Some additional pinned repositories

Pinned Loading

Guitar-Chords-recognition Guitar-Chords-recognition Public

An application that predicts the chords when melspectrograms of guitar sound is fed into a CNN.

Python 119 42
autocar autocar Public

A self-driving car that can detect lanes, stop sign, traffic light and avoid a collision, built using Canny edge detection, Hough transform, Haar cascade classifier, and Arduino programming.

Python 5 1
AI-Plays-GTA5 AI-Plays-GTA5 Public

A bike-riding agent in a virtual environment (GTA5), built using CNN, used for simulating self-driving vehicles.

Python 7
Deep-Learning-Nanodegree-Udacity Deep-Learning-Nanodegree-Udacity Public

This repository contains all the projects that I submitted during the completion of the Deep Learning Nano Degree provided by Udacity.

Jupyter Notebook 2 1
Nepali_Plagiarism_Detection Nepali_Plagiarism_Detection Public

An application which detects plagiarised Devanagari text files using a self built rule based stemming algorithm and Cosine similarity.

Jupyter Notebook 6 3
SLR-Parser SLR-Parser Public

A SLR_Parser which costructs canonical collection of LR(0) items and SLR Parsing table and also parses a given input string.

Python 5 2