MeMAD multimodal content analysis and machine translation: collection of tools and libraries

This repository contains a joint collection of libraries and tools for multimodal content analysis and machine translation from Aalto University, EURECOM, INA and University of Helsinki. Some of the tools included have been initiated before the MeMAD project and developed further during it, some are results of the project.

The collection consists of the following submodules:

Aalto University

PicSOM: https://github.com/aalto-cbir/PicSOM
DeepCaption: https://github.com/aalto-cbir/DeepCaption
Visual storytelling: https://github.com/aalto-cbir/visual-storytelling
Speech recognition training scripts for Finnish: https://github.com/psmit/char-fin-2017
Speaker-aware ASR training: https://github.com/MeMAD-project/speaker-aware-attention-asr
SphereDiar: https://github.com/Livefull/SphereDiar
Multimodal ASR: https://github.com/aalto-speech/avsr
Spoken language identification: https://github.com/py-lidbox/lidbox
Audio event classification: https://github.com/MeMAD-project/AudioTagger
Multi-modal image caption translation: https://github.com/MeMAD-project/image-caption-translation
Statistical tools for caption dataset analysis: https://github.com/MeMAD-project/statistical-tools

EURECOM

Face recognition: https://github.com/D2KLab/FaceRec
Media memorability in MediaEval 2019-20: https://github.com/MeMAD-project/media-memorability
Video content segmentation: https://github.com/MeMAD-project/content-segmentation
MeMAD metadata converter: https://github.com/MeMAD-project/rdf-converter
MeMAD Knowledge Graph API: https://github.com/MeMAD-project/api
MeMAD Explorer: https://github.com/MeMAD-project/explorer
MeMAD metadata interchange formats: https://github.com/MeMAD-project/interchange-formats

INA

inaSpeechSegmenter: https://github.com/ina-foss/inaSpeechSegmenter
inaFaceGender: https://github.com/ina-foss/inaFaceGender

University of Helsinki

Subtitle translation: https://github.com/MeMAD-project/subtitle-translation
Tools for converting and aligning subtitles: https://github.com/MeMAD-project/subalign
Speech translation: https://github.com/MeMAD-project/speech-translation
Discourse-aware machine translation: https://github.com/MeMAD-project/doclevel-translation
Cross-lingual content retrieval: https://github.com/MeMAD-project/cross-lingual-retrieval
OPUS-MT: MT servers and pre-trained translation models: https://github.com/MeMAD-project/Opus-MT
OPUS-MT-train: MT training procedures and pipelines: https://github.com/MeMAD-project/OPUS-MT-train
OPUS-eval: A collection of MT benchmarks: https://github.com/MeMAD-project/OPUS-MT-eval
The Tatoeba MT Challenge: Multilingual data sets and benchmarks for machine translation: https://github.com//MeMAD-project/Tatoeba-Challenge
OPUS-CAT: MT plugins for professional translators: https://github.com/MeMAD-project/OPUS-CAT
OPUS-translator: Web interface for machine translation: https://github.com/MeMAD-project/OPUS-translator
Document-level machine translation benchmarks
OpenSubtitles2018: a large collection of aligned movie subtitles
TED2020: Aligned TedTalk subtitles
QED: Aligned subtitles of educational videos

	MeMAD project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 780069. This document has been produced by the MeMAD project. The content in this document represents the views of the authors, and the European Commission has no liability in respect of the content.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
AudioTagger @ 72abff5		AudioTagger @ 72abff5
DeepCaption @ a3c56cd		DeepCaption @ a3c56cd
FaceRec @ 7f9cbf0		FaceRec @ 7f9cbf0
GraphNER @ c962438		GraphNER @ c962438
OPUS-CAT @ 1f66279		OPUS-CAT @ 1f66279
OPUS-MT @ ffe7e17		OPUS-MT @ ffe7e17
OPUS-MT-eval @ a1aa31e		OPUS-MT-eval @ a1aa31e
OPUS-MT-train @ a7c92e4		OPUS-MT-train @ a7c92e4
OPUS-translator @ b0a4d6f		OPUS-translator @ b0a4d6f
OpenNMT-py @ 84a81e1		OpenNMT-py @ 84a81e1
OpusTools @ da63fc3		OpusTools @ da63fc3
OpusTools-perl @ 38a055d		OpusTools-perl @ 38a055d
PicSOM @ 9ebc112		PicSOM @ 9ebc112
SphereDiar @ 222b111		SphereDiar @ 222b111
Tatoeba-Challenge @ 1d96a57		Tatoeba-Challenge @ 1d96a57
ToModAPI @ 52b192f		ToModAPI @ 52b192f
ZeSTE @ 36824d0		ZeSTE @ 36824d0
api @ 773192c		api @ 773192c
asr-finnish @ 7b87114		asr-finnish @ 7b87114
avsr @ fdb5092		avsr @ fdb5092
content-segmentation @ 242a501		content-segmentation @ 242a501
cross-lingual-retrieval @ cfc1aea		cross-lingual-retrieval @ cfc1aea
doclevel-translation @ c8b7f63		doclevel-translation @ c8b7f63
explorer @ 1873026		explorer @ 1873026
image-caption-translation @ 816d6c2		image-caption-translation @ 816d6c2
inaFaceGender @ d777322		inaFaceGender @ d777322
inaSpeechSegmenter @ 26b1f5d		inaSpeechSegmenter @ 26b1f5d
interchange-formats @ 6bd5cc6		interchange-formats @ 6bd5cc6
lidbox @ e60d5ad		lidbox @ e60d5ad
media-memorability @ 4c767a7		media-memorability @ 4c767a7
rdf-converter @ 603ac70		rdf-converter @ 603ac70
sparql-transformer @ 3696701		sparql-transformer @ 3696701
speaker-aware-attention-asr @ 31e4f49		speaker-aware-attention-asr @ 31e4f49
speech-translation @ eeb2551		speech-translation @ eeb2551
statistical-tools @ 6257d39		statistical-tools @ 6257d39
subalign @ 2ef47f2		subalign @ 2ef47f2
subtitle-translation @ ff11b22		subtitle-translation @ ff11b22
trecvid-vsum @ 0bd7cdb		trecvid-vsum @ 0bd7cdb
visual-storytelling @ dce1729		visual-storytelling @ dce1729
.gitmodules		.gitmodules
README.md		README.md
euflag.png		euflag.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MeMAD multimodal content analysis and machine translation: collection of tools and libraries

Aalto University

EURECOM

INA

University of Helsinki

About

Releases

Packages

Contributors 7

MeMAD-project/mmca

Folders and files

Latest commit

History

Repository files navigation

MeMAD multimodal content analysis and machine translation: collection of tools and libraries

Aalto University

EURECOM

INA

University of Helsinki

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Packages