AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official gpt4free repository | various collection of powerful language models
The world's simplest facial recognition api for Python and the command line
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Making large AI models cheaper, faster and more accessible
The simplest, fastest repository for training/finetuning medium-sized GPTs.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A generative speech model for daily dialogue.
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Instant voice cloning by MIT and MyShell.
Code and documentation to train Stanford's Alpaca models, and generate the data.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (,
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
State-of-the-art 2D and 3D Face Analysis Project
Code for the paper "Language Models are Unsupervised Multitask Learners"
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched