Skip to content
View QIN2DIM's full-sized avatar
:shipit:
Focusing
:shipit:
Focusing

Organizations

@CaptchaAgent
Block or Report

Block or report QIN2DIM

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

73 2 Updated Jul 16, 2024

GraphRAG using Ollama with Gradio UI and Extra Features

Python 279 22 Updated Jul 16, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 398 27 Updated Jul 15, 2024

基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快

Python 307 24 Updated Jun 30, 2024

Low-code development tool based on PaddlePaddle(飞桨低代码开发工具)

Python 4,652 916 Updated Jul 16, 2024

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 892 149 Updated Jul 16, 2024

Virtual network sound card for Microsoft Windows

C++ 1,698 142 Updated May 1, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 360 24 Updated Jul 16, 2024

Supabase for RAG - Build and scale production-ready user facing AI apps

Python 2,534 168 Updated Jul 16, 2024

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Python 7,549 531 Updated Jul 15, 2024

Python package for graph statistics

Python 675 136 Updated Jul 10, 2024

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,028 134 Updated Jul 16, 2024

Build real-time multimodal AI applications 🤖🎙️📹

Python 680 115 Updated Jul 16, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 40,950 7,526 Updated Jul 16, 2024

End-to-end stack for WebRTC. SFU media server and SDKs.

Go 9,032 729 Updated Jul 16, 2024

Open Source framework for voice and multimodal conversational AI

Python 2,368 136 Updated Jul 15, 2024

由搜狗细胞词库生成的谷歌拼音输入法词典 A dict for Google Pinyin Input, exported from Sougou Pinyin Input.

58 30 Updated Jan 20, 2017

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 3,971 318 Updated Jul 16, 2024

Understand Human Behavior to Align True Needs

Python 2,638 200 Updated Jul 16, 2024
Python 126 39 Updated Jun 18, 2024

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 414 59 Updated Apr 30, 2024

Free on-device web app for audio transcribing and rendering subtitles

ReScript 81 4 Updated Jul 15, 2024

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,810 417 Updated Jul 16, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 2,036 177 Updated Jul 15, 2024

Multilingual Voice Understanding Model

Python 1,344 114 Updated Jul 16, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 11,102 893 Updated Jul 16, 2024

The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts"

530 21 Updated Jul 5, 2024
Next