Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
real time face swap and one-click video deepfake with only a single image
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.
A Web UI for Elasticsearch and OpenSearch: Import, browse and edit data with rich filters and query views, create reference search UIs.
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
Viewer for the structure extracted by Grobid on PDF documents
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
DSPy: The framework for programming—not prompting—foundation models
Bash's powerful command line editing in cmd.exe
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Community maintained fork of pdfminer - we fathom PDF
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Port of OpenAI's Whisper model in C/C++
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation