-
MononAI
- Dhaka
Starred repositories
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
A modular graph-based Retrieval-Augmented Generation (RAG) system
A @ClickHouse fork that supports high-performance vector search and full-text search.
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Large World Model -- Modeling Text and Video with Millions Context
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
[ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
Toolkit for creating, sharing and using natural language prompts.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Refine high-quality datasets and visual AI models
A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
The impossibly small web framework for Python and MicroPython.
Emu Series: Generative Multimodal Models from BAAI