Highlights
- Pro
Stars
Convert PDF to markdown quickly with high accuracy
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Anthropic's educational courses
An extremely fast Python linter and code formatter, written in Rust.
A feature-rich command-line audio/video downloader
Python bindings for FFmpeg - with complex filtering support
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
real time face swap and one-click video deepfake with only a single image
The fastest way to create an HTML app
SGLang is a fast serving framework for large language models and vision language models.
A modular graph-based Retrieval-Augmented Generation (RAG) system
A free, open source, multi-platform SQLite database manager.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Python packaging and dependency management made easy
KAN (Kolmogorov-Arnold Network)-based Recommendation (CF)
LLM-Merging: Building LLMs Efficiently through Merging
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities or…
Arena-Hard-Auto: An automatic LLM benchmark.