Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
🎨 Diagram as Code for prototyping cloud system architectures
We write your reusable computer vision tools. 💜
State-of-the-art 2D and 3D Face Analysis Project
🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
☁️ Build multimodal AI applications with cloud-native stack
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Low-code framework for building custom LLMs, neural networks, and other AI models
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
ModelScope: bring the notion of Model-as-a-Service to life.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Setup and customize deep learning environment in seconds.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Insane(ly slow but wicked good) PNG image optimization
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
The Fulcrum Keyboard is an ergo-mechanical split keyboard with extra thumb functionality. It has 20 keys, two rotary encoders, and two 5-way switches.