Block or Report
Block or report windwang
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
🇨🇳最全最新中国【省、市、区县、乡镇街道】json,csv,sql数据
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Generate 3D objects conditioned on text or images
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
GraphRAG using Ollama with Gradio UI and Extra Features
Separate audio stems (vocals, bass, drums) from song, recombine, tempo match, slice/crop audio
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
Distributed LLM inference for mobile, desktop and server.
tensorrt for yolo series (YOLOv10,YOLOv9,YOLOv8,YOLOv7,YOLOv6,YOLOX,YOLOv5), nms plugin support
An Open-source Framework for Autonomous Language Agents
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Long Context Transfer from Language to Vision
The App Store for your multi-account eco system.
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快
Low-code development tool based on PaddlePaddle(飞桨低代码开发工具)
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Understand Human Behavior to Align True Needs
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
SEED-Story: Multimodal Long Story Generation with Large Language Model
Neo4j graph construction from unstructured data using LLMs