Block or Report
Block or report neil-vqa
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (16)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
End-to-end zero-shot entity and relation extraction
Aligning Large Language Models on Information Extraction
Official electron build of draw.io
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-script based invocation make it difficult to use for application …
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
A Strict JSON Framework for LLM Outputs
Task-based Agentic Framework using StrictJSON as the core
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
sqlite3 in ur indexeddb (hopefully a better backend soon)
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
A simple, easy to use PowerShell script to remove pre-installed apps from Windows, disable telemetry, remove Bing from Windows search as well as perform various other changes to declutter and impro…
Code, models, and data for "Personalized Text Generation with Fine-Grained Linguistic Control". EACL 2024, Personalization of Generative AI.
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
A very simple framework for state-of-the-art Natural Language Processing (NLP)
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative