Lists (1)
Sort Name ascending (A-Z)
Stars
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contribute.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
A collection of awesome video generation studies.
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…
unofficial implementation of the High Fidelity Neural Audio Compression
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
High-Resolution Image Synthesis with Latent Diffusion Models
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Background Matting: The World is Your Green Screen
人像matting数据集,包含34427张图像和对应的matting结果图。
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。