- Nottingham, UK
Block or Report
Block or report YaoFANGUK
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
小火箭 shadowrocket 配置文件 模块 脚本 module sgmodule 图文教程 规则 分流 破解 解锁
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
GPT4V-level open-source multi-modal model based on Llama3-8B
Paint by Example: Exemplar-based Image Editing with Diffusion Models
互联网公司技术架构,微信/淘宝/微博/腾讯/阿里/美团点评/百度/OpenAI/Google/Facebook/Amazon/eBay的架构,欢迎PR补充
视频字幕翻译,输入srt文件生成翻译后的srt文件。无需申请第三方API,本地实现字幕翻译。基于深度学习的视频字幕翻译框架。Srt file translation, generate translated srt file from input SRT file. No need to apply third-party API, local implementation of subti…
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Erase specific content from the video that you don't wanna see
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
a state-of-the-art-level open visual language model | 多模态预训练模型
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Interact with your documents using the power of GPT, 100% privately, no data leaks
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
BibiGPT v1 · one-Click AI Summary for Audio/Video & Chat with Learning Content: Bilibili | YouTube | Tweet丨TikTok丨Dropbox丨Google Drive丨Local files | Websites丨Podcasts | Meetings | Lectures, etc. 音视…