-
AI Multimedia Lab, CAMPUS Research Center, University Politehnica of Bucharest
- Bucharest, Romania
- https://gconstantin.aimultimedialab.ro/
Stars
Video Captioning is an encoder decoder mode based on sequence to sequence learning
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing t…
Video Content Description (VCD) is a schema, API and set of tools to produce semantically rich labels from multi-sensorial data series.
Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.
Making decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet
Video to Text: Natural language description generator for some given video. [Video Captioning]
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
This project aims to build & optimise a book recommendation system based on collaborative filtering and will tackle an example of both memory based & model based approach (using KNNWithMeans & Sing…
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Recent Transformer-based CV and related works.
This repository holds the code to the https://multimediaeval.github.io/ website. The `master` branch contains only the `_site` folder built with Jekyll due to the use of a non-whitelisted plugin. T…
The benchmarking of DNN on TC scans forCovid19 diagnosis
Pcap Converter: convert pcap to text or flows.
MARS: Motion-Augmented RGB Stream for Action Recognition
Gate-Shift Networks for Video Action Recognition - CVPR 2020
Learning to Detect Violent Videos using Convolution LSTM (Keras + tensorflow)
MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch 1.0 / Pytorch 0.4. Out-of-box support for retraining on Open Images dataset. ONNX and Caffe2 support. Experiment Ideas lik…
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …
The second version of the DeepFusion system, featuring full end-to-end learning process and cleaner code
Convolutional neural network model for video classification trained on the Kinetics dataset.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow