Starred repositories
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
A video player for iOS、macOS、tvOS、visionOS , based on AVPlayer and FFmpeg, support the horizontal, vertical screen. support adjust volume, brightness and seek by slide, SwiftUI, support subtitles.
A full iOS/iPadOS app for creating, editing, and storing GIFs
GIF encoder based on libimagequant (pngquant). Squeezes maximum possible quality from the awful GIF format.
Lossy PNG compressor — pngquant command based on libimagequant library
🌈 Convert videos to high-quality GIFs on your Mac
A set of tools to trim, crop and select frames inside a video
Fast and simple OCR library written in Swift
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
An open-source RAG-based tool for chatting with your documents.
MTCNN face detection implementation for TensorFlow, as a PIP package.
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Create Live Graphics in SwiftUI (iOS, tvOS & macOS)
An open-source cross-platform alternative to AirDrop
poseture detection using machine learing and deep learning
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Gamebook Engine is an open source iOS app for creating and playing gamebooks, a type of interactive fiction where the player gets to make decisions that influence the story.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
A sketch extractor for anime/illustration.