Project submission for hack it sapiens hackathon.
-
Updated
Mar 30, 2024 - Python
Project submission for hack it sapiens hackathon.
Auto caption images for training in Stable Diffusion
Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.
Convert PDF to Markdown via OpenAI multi-modal text/vision model.
autoPDFtagger is a Python tool designed for efficient home-office organization, focusing on digitizing and organizing both digital and paper-based documents. By automating the tagging of PDF files, including image-rich documents and scans of varying quality, it aims to streamline the organization of digital archives.
Create AWS infrastructure using architecture diagrams and natural language interpreted using the OpenAI GPT model.
A simple matrix bot that supports image generation and chatting using ChatGPT, Langchain
A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.
Add a description, image, and links to the gpt-vision topic page so that developers can more easily learn about it.
To associate your repository with the gpt-vision topic, visit your repo's landing page and select "manage topics."