Stars
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
A community-maintained Python framework for creating mathematical animations.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Full retriever for art and metadata in https://wikiart.org/
Image-embodied Knowledge Representation Learning (IJCAI-2017)
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
This is the official repository for Retrieval Augmented Visual Question Answering
Binary VQA test set for testing Visual Question Answering Models
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
OpenKG-ORG / OpenRichpedia
Forked from z1514/OpenRichpedia东南大学多模态知识图谱-OpenRichpedia工程文件
Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.
Several data modalities for KBs (visual, numerical, temporal, etc.)
A PyTorch reimplementation of bottom-up-attention models
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
Efficiently computes derivatives of NumPy code.
A collection of resources on multimodal knowledge graph, including datasets, papers and contests.
PromptKG Family: a Gallery of Prompt Learning & KG-related research works, toolkits, and paper-list.
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Optimized primitives for collective multi-GPU communication
An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'