Starred repositories
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Link Android and PC easily! 全能手机连接助手!
Official comfyui repository of Hellomeme
微信公众号文章批量下载工具,支持图片、评论下载,支持保存html/mhtml/md/pdf/docx文件
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
An open-source cross-platform alternative to AirDrop
Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.
m3u8[m3u8-downloader] 视频在线提取工具 流媒体下载 、视频下载 、 m3u8下载 、 B站视频下载 桌面客户端 windows mac
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
支持视频号、小程序、抖音、快手、小红书、直播流、酷狗、QQ音乐等常见网络资源!
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
image to prompt by vikhyatk/moondream1
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
Enable Windows Explorer to display thumbnails for HEIC files
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
基于无障碍,高级选择器,订阅规则的自定义屏幕点击 Android 应用 | An Android APP with custom screen tapping based on Accessibility, Advanced Selectors, and Subscription Rules
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
Speed up Stable Diffusion with this one simple trick!