YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,152 2,159 Updated Jun 28, 2024

kwai / DouZero

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,021 573 Updated Jun 26, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,626 1,889 Updated Apr 15, 2024

HuKai97 / YOLOv5-LPRNet-Licence-Recognition

使用YOLOv5和LPRNet进行车牌检测+识别（CCPD数据集）

Python 371 74 Updated May 31, 2022

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 24,723 3,919 Updated Jun 22, 2024

open-mmlab / mmcv

OpenMMLab Computer Vision Foundation

Python 5,723 1,610 Updated Jun 26, 2024

nndl / nndl.github.io

《神经网络与深度学习》邱锡鹏著 Neural Network and Deep Learning

HTML 17,210 3,570 Updated Oct 7, 2022

Const-me / Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

C++ 7,673 665 Updated Oct 17, 2023

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 32,932 3,295 Updated Jul 3, 2024

chidiwilliams / buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 10,766 813 Updated Jul 2, 2024

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,837 3,209 Updated May 7, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 135,201 25,819 Updated Jul 3, 2024

anse-app / anse

Supercharged experience for multiple models such as ChatGPT, DALL-E and Stable Diffusion.

TypeScript 1,818 433 Updated Apr 30, 2024

THUDM / CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Python 3,539 379 Updated Jun 14, 2023

IDEA-CCNL / Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Python 3,953 370 Updated Dec 27, 2023

xiangsx / gpt4free-ts

Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free

TypeScript 7,544 1,319 Updated Apr 9, 2024

SCUTlihaoyu / open-chat-video-editor

Open source short video automatic generation tool

Python 2,579 329 Updated Jun 20, 2023

xszyou / Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

8,503 1,706 Updated Jul 3, 2024

Picsart-AI-Research / Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 3,906 336 Updated May 6, 2023

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,446 670 Updated Jul 3, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 51,461 8,630 Updated May 29, 2024

InternLM / InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Python 5,516 402 Updated Jul 3, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,383 1,055 Updated Jul 1, 2024

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,390 341 Updated Jun 13, 2024

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,117 935 Updated Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XJ-ITIC-CBQ

Block or report XJ-ITIC-CBQ

AI

exadel-inc / CompreFace

ageitgey / face_recognition

ZLMediaKit / ZLMediaKit

648540858 / wvp-GB28181-pro

alibaba / EasyCV

Megvii-BaseDetection / YOLOX