“Working with Aditya has been a great experience. While he was quick to learn the business requirements to come up with better DS/ML solutions, I picked up most of what the nitty gritties of AI from him. At his young age, his fundamental understanding of mathematics and Data Science, his ability to come up alternative approaches and an entirely self-driven work ethic is a joy to watch. I'm sure he will achieve great heights in his career that will make all us at Cutshort proud of him. :)”
About
A skilled AI/ML Software Engineering , I have deep experience in Machine Learning…
Activity
-
I have been working on creating a game using Google Maps new 3D maps for a Hackathon but as soon as we got the maps working me and my friends spent…
I have been working on creating a game using Google Maps new 3D maps for a Hackathon but as soon as we got the maps working me and my friends spent…
Liked by Aditya Malte
-
Nvidia tennis teams just finished very successful season in BAITL (Bay Area Industrial Tennis League). More than 50 Nvidians participated. We took…
Nvidia tennis teams just finished very successful season in BAITL (Bay Area Industrial Tennis League). More than 50 Nvidians participated. We took…
Liked by Aditya Malte
-
I’m happy to share that I’m starting a new position as Summer Analyst at Goldman Sachs!
I’m happy to share that I’m starting a new position as Summer Analyst at Goldman Sachs!
Liked by Aditya Malte
Experience
Education
Licenses & Certifications
Volunteer Experience
Publications
-
Tiny Data Specialists Through Domain-Specific Pre-training On Code-Mixed Data
COLING 2020 (SemEval)
-
Effective Distributed Representation of Code-Mixed Text
16th IEEE India Council International Conference (INDICON)
This research involved getting high quality word representations for noisy code mixed multilingual text. We started right from scraping large amount of data using Twitter, training fastText on it. Finally implementing a very light classifier model that achieved state-of-the-art performance on a benchmark dataset. Furthermore we also explain the predictions as part of Explainable AI.
Other authorsSee publication -
Evolution of Transfer Learning in Natural Language Processing
Arxiv
In this paper we discuss recent advances in detail, pertaining to transfer learning in NLP(BERT, ELMo, XLNet, etc.).
This paper has been written such that even readers with introductory knowledge in NLP would be able to understand it, while still being a profound survey paper for knowledgeable persons.Other authorsSee publication -
Multilingual Cyberabuse Detection using Advanced Transformer Architecture
31st IEEE Asia Pacific (R10) International Conference (TENCON)
We implemented a deep learning model that achieved state-of-the-art performance on Hindi/English Cyberabuse Detection task.
Other authorsSee publication
Courses
-
Algorithms
CSCI 570
-
Algorithms
-
-
Computer Networks
-
-
Computer Organization and Architecture
-
-
Data Mining and Warehousing
-
-
Data Structures
-
-
Database Management Systems
-
-
Databases
CSCI 585
-
Machine Learning
DSCI 552
-
Natural Language Processing
CSCI 544
-
Object Oriented Programming
-
Projects
-
A Deep Learning Approach to Highly Dense Crowd Counting for Disaster Management
(COEP Mindspark Hackathon Winning Project with over 50 stars on GitHub)
Made the first keras implementation of a novel deep learning architecture - CSRnet (Y.Li et.al. CVPR'18) that outputs a crowd density map corresponding to an input image, and hence deduces the crowd count.
This was then deployed on the Android platform using TFlite, along with a Django admin control server. The combined Android-Web-ML package can be used for effective disaster management and prevention in case…(COEP Mindspark Hackathon Winning Project with over 50 stars on GitHub)
Made the first keras implementation of a novel deep learning architecture - CSRnet (Y.Li et.al. CVPR'18) that outputs a crowd density map corresponding to an input image, and hence deduces the crowd count.
This was then deployed on the Android platform using TFlite, along with a Django admin control server. The combined Android-Web-ML package can be used for effective disaster management and prevention in case of mass gatherings (eg. Kumbh Mela).Other creatorsSee project -
Google stock price prediction using LSTM, a time series problem
Implemented a Long Short Term Memory neural network for the price prediction of the next day's Google stocks based on historical data.
Techniques and technologies used-
LSTM Neural Networks, Regularization, Dropout, Tensorflow. -
Effective Distribution of Code-Mixed text
-
We tackled three problems while tackling Multilingual Code-Mixed text:
1) Lack of a large-scale corpus
2) Lack of effective sub-word based distributed representation.
3) Lack of Agile and Explainable models for code-mixed NLP task.
We tackled all these problems. Achieving state-of-the-art performance on a benchmark dataset using a much lighter and explainable architecture compared to previous heavy ensemble NN based solutions.
(presented at the IEEE IndiCon 2019)Other creatorsSee project -
XLNet on Colab TPU
-
XLNet, introduced by Google Brain and CMU researchers is a state-of-the-art(SOTA) deep learning architecture for natural language processing.
However, it could not be trained by developers without access to very high processing power.
I have thus made the first(and only) modified repo and Notebook that successfully added Colab TPU support to allow eager developers to train their model using the freely available Colab TPUs.
Currently collaborating with Zhilin Yang(Google…XLNet, introduced by Google Brain and CMU researchers is a state-of-the-art(SOTA) deep learning architecture for natural language processing.
However, it could not be trained by developers without access to very high processing power.
I have thus made the first(and only) modified repo and Notebook that successfully added Colab TPU support to allow eager developers to train their model using the freely available Colab TPUs.
Currently collaborating with Zhilin Yang(Google Brain/Carnegie Mellon University) to integrate the same with his repository over GitHub. -
Deep Learning - Multilingual Cyberabuse Detection(presented at IEEE Tencon 2019)
-
Made a Multilingual Cyberabuse Detection System that worked on Hindi, English and Hinglish(code-mixed) text.
Achieved state-of-the-art results on the Hindi TRAC dataset while also achieving top 5 results on the English TRAC dataset.
Used smart preprocessing techniques and variations of the advanced Transformer architecture.
Technologies used- Tensorflow, Google CloudOther creatorsSee project -
Microsoft AI Challenge
-
As part of the Microsoft AI challenge, our task was to improve upon the Bing search by incorporating recent advances in NLP.
Our task was to design an advanced information retrieval engine used to answer users' queries based.
We performed transfer learning in NLP using the state of the art architecture BERT(by Google).
Tools and Technology-Tensorflow, Deep Learning, PythonOther creatorsSee project -
Android Assistant for the differently abled
-
Around 50 million people are the victims of hearing/visual/speech impairment.
Thus, we made an Android application that could allow the disabled(hearing, visual, speech or all three) to seamlessly communicate with the people around them.
Our application makes use of Speech Recognition to give an ear to the hearing impaired, TTS that gives a voice to the speech impaired.
In case a person suffers from a combination of hearing/visual impairment, a Morse code vibration corresponding to the…Around 50 million people are the victims of hearing/visual/speech impairment.
Thus, we made an Android application that could allow the disabled(hearing, visual, speech or all three) to seamlessly communicate with the people around them.
Our application makes use of Speech Recognition to give an ear to the hearing impaired, TTS that gives a voice to the speech impaired.
In case a person suffers from a combination of hearing/visual impairment, a Morse code vibration corresponding to the text is generated for quick communication.
In case where a person is speech and visually impaired, handwriting input is used.
Future scope:
1)Using Image Captioning(Xu. et. al.) and TFLite to allow the visually impaired to understand their surroundings.
2)Adding support for regional languages -
Convolutional Neural Networks
-
Designed several convolutional neural networks to cater to different classification datasets like the
CIFER 10, Cat/Dog, Kaggle Distracted driver,MNIST. Some of the techniques and tools used to improve accuracy were Transfer Learning, Dropout, Regularization, different optimizers and Image Augmentation among several others.
Understood the interdependance of various hyperparameters and got great insights into the art of deep learning through hands on experience.
Tools…Designed several convolutional neural networks to cater to different classification datasets like the
CIFER 10, Cat/Dog, Kaggle Distracted driver,MNIST. Some of the techniques and tools used to improve accuracy were Transfer Learning, Dropout, Regularization, different optimizers and Image Augmentation among several others.
Understood the interdependance of various hyperparameters and got great insights into the art of deep learning through hands on experience.
Tools used/experimented-
Tensorflow and Keras
Honors & Awards
-
Best Data Science Insight
University of Southern California
Won the Best Data Science Insight award for our research project on measuring and detection biases in Commonsense knowledge models
-
Top 1%, Microsoft AI Challenge
Microsoft
Made it to the Top 20 among over 2000 competing teams. Our challenge was to improve the Bing Search Engine using AI and Natural Language Processing (NLP)
-
Winner, Mindspark Hackathon 2018
College Of Engineering, Pune
Winner at the prestigious Mindspark Hackathon. The second largest technical festival in Maharashtra.
-
Winner, Software Development (Senior Category)
PICT IEEE branch(Region 10)
Winner at Credenz, one of the largest tech festivals in Pune.
Implemented a deep learning algorithm that was deployed on android using tflite.
Languages
-
English
Native or bilingual proficiency
-
Marathi
Full professional proficiency
-
Hindi
Full professional proficiency
Recommendations received
1 person has recommended Aditya
Join now to viewMore activity by Aditya
-
Capped off my India visit to the #nvidia office in the 3rd largest city and Gateway of the South - #bengaluru (Bangalore). What was a sleepy little…
Capped off my India visit to the #nvidia office in the 3rd largest city and Gateway of the South - #bengaluru (Bangalore). What was a sleepy little…
Liked by Aditya Malte
-
The grind finally paid off. Ten months ago, I started with LeetCode but soon realized that more than LeetCode is needed. I need to stand out and…
The grind finally paid off. Ten months ago, I started with LeetCode but soon realized that more than LeetCode is needed. I need to stand out and…
Liked by Aditya Malte
-
Excited to share NeMo Retriever, a product I've been working on for over a year! Don't miss Jensen Huang presenting it in his GTC keynote:…
Excited to share NeMo Retriever, a product I've been working on for over a year! Don't miss Jensen Huang presenting it in his GTC keynote:…
Liked by Aditya Malte
-
🚀 𝗛𝗼𝘄 𝘁𝗼 𝗔𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁 𝗮𝗻 𝗘𝗻𝘁𝗲𝗿𝗽𝗿𝗶𝘀𝗲 𝗥𝗔𝗚 𝗦𝘆𝘀𝘁𝗲𝗺 🚀 Do you know the most important decision you’ll make when…
🚀 𝗛𝗼𝘄 𝘁𝗼 𝗔𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁 𝗮𝗻 𝗘𝗻𝘁𝗲𝗿𝗽𝗿𝗶𝘀𝗲 𝗥𝗔𝗚 𝗦𝘆𝘀𝘁𝗲𝗺 🚀 Do you know the most important decision you’ll make when…
Liked by Aditya Malte
-
Today we release SteerLM-aligned version of llama2-70B model, llama2-70B-SteerLM-Chat. This model gets 7.54 on MT-Bench. Important aspect of this…
Today we release SteerLM-aligned version of llama2-70B model, llama2-70B-SteerLM-Chat. This model gets 7.54 on MT-Bench. Important aspect of this…
Liked by Aditya Malte
-
Argh, my son just figured out how to steal chocolate from the pantry, and made a brown, gooey mess. Worried about how hard it is to align AI? Right…
Argh, my son just figured out how to steal chocolate from the pantry, and made a brown, gooey mess. Worried about how hard it is to align AI? Right…
Liked by Aditya Malte
-
This view is a 30 second walk from my desk at NVIDIA's Endeavor Building looking towards our new Voyager building. Simply amazing! #NoFilter
This view is a 30 second walk from my desk at NVIDIA's Endeavor Building looking towards our new Voyager building. Simply amazing! #NoFilter
Liked by Aditya Malte
-
Today, our team is happy to share SteerLM, a much simpler alternative to RLHF which allows dynamic model controls during inference (humor, verbosity,…
Today, our team is happy to share SteerLM, a much simpler alternative to RLHF which allows dynamic model controls during inference (humor, verbosity,…
Liked by Aditya Malte
-
I didn't actually convince Jensen, instead I just explained deep learning to him. He instantly formed his own conviction and pivoted NVIDIA to be an…
I didn't actually convince Jensen, instead I just explained deep learning to him. He instantly formed his own conviction and pivoted NVIDIA to be an…
Liked by Aditya Malte
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More