Lists (3)
Sort Name ascending (A-Z)
Starred repositories
A list of free LLM inference resources accessible via API.
PyTorch code and models for the DINOv2 self-supervised learning method.
Header-only C++/python library for fast approximate nearest neighbors
A library for efficient similarity search and clustering of dense vectors.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
State-of-the-Art Text Embeddings
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
ImageBind One Embedding Space to Bind Them All
LAVIS - A One-stop Library for Language-Vision Intelligence
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization (BMVC 2024 Oral ✨)
A general fine-tuning kit geared toward diffusion models.
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
Pdf Squirrel offers tools for image-based document analysis, featuring block detection, PDF to image conversion, image normalization, selective blurring, and sentence highlighting. Ideal for develo…
It's not AI that takes away your job, but the people who master the use of AI tools. The most deadly attack is a dimension-reducing strike: destroying you has nothing to do with you - from "The Thr…
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A Bulletproof Way to Generate Structured JSON from Language Models
Track and predict the energy consumption and carbon footprint of training deep learning models.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
My first Multi-Modal RAG pipeline....Dummy version
高颜值的第三方网易云播放器,支持 Windows / macOS / Linux
(NeurIPS 2022) On Embeddings for Numerical Features in Tabular Deep Learning