Leveraging the BLIP Model for Visual Question Answering: A Comparative Analysis on VQA and DAQUAR Datasets
machine-learning
natural-language-processing
computer-vision
inference
accuracy
image-captioning
bleu-score
blip
visual-question-answering
wups
vqav2
bert-score
daquar
-
Updated
Jun 18, 2024 - Jupyter Notebook