Skip to content
View weedge's full-sized avatar
🍀
coding at home
🍀
coding at home

Block or report weedge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • GLM-4-Voice Public

    Forked from THUDM/GLM-4-Voice

    GLM-4-Voice | 端到端中英语音对话模型

    Python Apache License 2.0 Updated Nov 9, 2024
  • docling Public

    Forked from DS4SD/docling

    Get your documents ready for gen AI

    Python MIT License Updated Nov 9, 2024
  • doraemon-nb Public

    ipython notebooks do some sample experiments , make some idea

    Jupyter Notebook 7 Updated Nov 9, 2024
  • first base model for full-duplex conversational audio

    Python Apache License 2.0 Updated Nov 8, 2024
  • ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM;麻烦请快点,迫不及待想学!

    Updated Nov 7, 2024
  • mini-omni2 Public

    Forked from gpt-omni/mini-omni2

    Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

    Python MIT License Updated Nov 6, 2024
  • pipecat Public

    Forked from pipecat-ai/pipecat

    Open Source framework for voice and multimodal conversational AI

    Python BSD 2-Clause "Simplified" License Updated Nov 6, 2024
  • A natural language interface for computers

    Python GNU Affero General Public License v3.0 Updated Nov 6, 2024
  • n8n Public

    Forked from n8n-io/n8n

    Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

    TypeScript Other Updated Nov 6, 2024
  • ichigo Public

    Forked from homebrewltd/ichigo

    Llama3.1 learns to Listen ; 复现训练过程!

    Python Apache License 2.0 Updated Nov 5, 2024
  • apipeai Public

    Multimodal Content pipe to Multimodal Content with AI , a big idea

    Updated Oct 29, 2024
  • VITA Public

    Forked from VITA-MLLM/VITA

    ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM。主要是了解下训练过程。

    Python Other Updated Oct 24, 2024
  • swarm Public

    Forked from openai/swarm

    Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.

    Python MIT License Updated Oct 12, 2024
  • Transforming Multi-Sourced Text into Captivating Multi-Lingual Audio Conversations with GenAI

    Python Other Updated Oct 11, 2024
  • Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

    Python Updated Oct 6, 2024
  • Cross-platform, customizable ML solutions for live and streaming media.

    C++ Apache License 2.0 Updated Oct 4, 2024
  • High-resolution models for human tasks.

    Python Other Updated Oct 3, 2024
  • RT-DETR Public

    Forked from lyuwenyu/RT-DETR

    [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

    Python Apache License 2.0 Updated Sep 27, 2024
  • ML-DL-note Public

    keyword and algorithm about ML, DL on text, audio, vision case

    Updated Sep 27, 2024
  • vosk-api Public

    Forked from alphacep/vosk-api

    Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

    Jupyter Notebook Apache License 2.0 Updated Sep 26, 2024
  • moshi Public

    Forked from kyutai-labs/moshi
    Python Apache License 2.0 Updated Sep 18, 2024
  • Qwen-Agent Public

    Forked from QwenLM/Qwen-Agent

    Agent framework and applications built upon Qwen2.x, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

    Python Other Updated Sep 18, 2024
  • Python Apache License 2.0 Updated Sep 13, 2024
  • minimind Public

    Forked from jingyaogong/minimind

    【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!

    Python Apache License 2.0 Updated Sep 12, 2024
  • chat bots run web

    Updated Sep 6, 2024
  • The Autograd Engine

    HTML Updated Sep 6, 2024
  • mini-omni Public

    Forked from gpt-omni/mini-omni

    open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

    Python MIT License Updated Sep 5, 2024
  • Qwen2-VL Public

    Forked from QwenLM/Qwen2-VL

    Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

    Python Updated Sep 5, 2024
  • agents Public

    Forked from livekit/agents

    Build real-time multimodal AI applications 🤖🎙️📹

    Python Apache License 2.0 Updated Aug 30, 2024
  • livekit Public

    Forked from livekit/livekit

    End-to-end stack for WebRTC. SFU media server and SDKs.

    Go Apache License 2.0 Updated Aug 30, 2024