Lists (5)
Sort Name ascending (A-Z)
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
A natural language interface for computers
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
real time face swap and one-click video deepfake with only a single image
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell.
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
A modular graph-based Retrieval-Augmented Generation (RAG) system
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas…
Letta (fka MemGPT) is a framework for creating stateful LLM services.
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Large Language Model Text Generation Inference
ImageBind One Embedding Space to Bind Them All
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Example models using DeepSpeed