Trained YoloV3 using Transfer Learning based on inclusive image dataset(self-annotated) created from live captioned video. The model has been used to identify the salient information region coordinates on the screen followed by the region being overlapped or occluded by captions. A research-based methodological approach has been employed to quan…
training
ace
captions
object-detection
transfer-learning
resnet-50
acessibility
yolov3
graphical-elements
onscreen-elements
dhh-viewers
-
Updated
Aug 31, 2021 - Python