Skip to content
View zhengwayne's full-sized avatar

Organizations

@storios-team

Block or report zhengwayne

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Streaming ASR and TTS based on FastAPI+ sherpa-onnx

Python 37 5 Updated Sep 30, 2024

MemFree - Hybrid AI Search Engine & AI Page Generator

TypeScript 1,039 164 Updated Nov 16, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,463 166 Updated Nov 7, 2024

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it wil…

TypeScript 2,938 293 Updated Apr 22, 2024

React app for inspecting, building and debugging with the Realtime API

JavaScript 2,065 751 Updated Oct 7, 2024

Build your own AI friend

C++ 370 93 Updated Nov 15, 2024

esp32 based device, mainly used for voice chat with large language models

C++ 665 156 Updated Mar 24, 2024

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 219 32 Updated Oct 23, 2024

ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow

Python 22 1 Updated Aug 29, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,949 417 Updated Aug 10, 2024

本项目使用esp32、esp32s3接入讯飞星火、豆包、通义千问(智能体应用)、Chatgpt等大模型,实现语音对话聊天功能,支持语音唤醒、连续对话、音乐播放等功能,同时外接了一块显示屏实时显示对话的内容。

C 151 34 Updated Nov 7, 2024

Build real-time multimodal AI applications 🤖🎙️📹

Python 3,984 410 Updated Nov 16, 2024

Create Beautiful Resume use Claude Artifacts. AI 智能简历

TypeScript 51 7 Updated Sep 15, 2024

深度学习经典、新论文逐段精读

27,118 2,449 Updated Aug 8, 2024

Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and …

Python 4,664 781 Updated Nov 5, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,000 514 Updated Nov 16, 2024

《照片修复小小助手》是一款基于微信AI能力的微信小程序,实现了图片选定区域的消除修复功能,纯客户端实现,无服务端。Inpaint_wechat is a WeChat mini-program based on the WeChat AI capabilities, implementing the functionality of inpainting and repairing sele…

JavaScript 376 54 Updated Jan 31, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 12,558 1,284 Updated Nov 16, 2024

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 1,328 104 Updated Nov 15, 2024

记录大模型相关的一些知识和方法

Jupyter Notebook 106 18 Updated Nov 13, 2024

Not only automatic, but also intelligent. An Intelligent data Visualization System, based on LLM.

TypeScript 193 17 Updated Oct 31, 2024

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity an…

Python 1,037 85 Updated Oct 14, 2024

Educational language-learning app for Hokkien, a low-resource language, featuring flashcards, quizzes, and generative AI!

JavaScript 8 2 Updated Nov 16, 2024

ComfyUI nodes to use segment-anything-2

Python 645 41 Updated Oct 3, 2024

real time face swap and one-click video deepfake with only a single image

Python 40,693 5,922 Updated Nov 14, 2024

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 76,041 10,023 Updated Nov 16, 2024

Agent Zero AI framework

Python 4,862 1,077 Updated Nov 16, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 22,937 2,246 Updated Nov 16, 2024
Next