Block or Report
Block or report XJ-ITIC-CBQ
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseAI
Leading free and open-source face recognition system
The world's simplest facial recognition api for Python and the command line
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
使用YOLOv5和LPRNet进行车牌检测+识别(CCPD数据集)
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Port of OpenAI's Whisper model in C/C++
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Stable Diffusion web UI
Supercharged experience for multiple models such as ChatGPT, DALL-E and Stable Diffusion.
Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free
Open source short video automatic generation tool
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
ModelScope: bring the notion of Model-as-a-Service to life.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Official release of InternLM2 7B and 20B base and chat models. 200K context support
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation