Skip to content
View yrqs's full-sized avatar

Block or report yrqs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modular and customizable Material Design UI components for the web

TypeScript 17,138 2,147 Updated Oct 9, 2024

multi-task and multi-track music transcription for everyone

85 2 Updated Sep 12, 2024

Audio Plugin for Audio to MIDI transcription using deep learning.

C++ 1,312 67 Updated Oct 6, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,265 171 Updated Sep 23, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 47,490 6,769 Updated Oct 10, 2024

No-code AI workflow

Vue 809 128 Updated Oct 4, 2024

京东抢购自动下单助手,GUI 支持 Windows 和 macOS

Python 4,105 894 Updated Aug 2, 2023

Bring portraits to life!

Python 12,225 1,289 Updated Oct 7, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,695 443 Updated Sep 19, 2024

To be the world's best PyTorch project template.

Python 420 60 Updated Mar 25, 2023

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,458 983 Updated Oct 8, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 254 5 Updated Aug 29, 2024

AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using A…

Python 1,940 355 Updated Oct 8, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 16,411 2,605 Updated Jul 26, 2024

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 197 26 Updated Aug 15, 2024

Understand Human Behavior to Align True Needs

Python 3,334 292 Updated Jul 20, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,372 71 Updated Oct 9, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,756 4,715 Updated Oct 9, 2024

av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Python 14 Updated May 8, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,982 34 Updated Jun 6, 2024

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 238 11 Updated Jan 2, 2024

The official Meta Llama 3 GitHub site

Python 26,568 3,005 Updated Aug 12, 2024

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 700 51 Updated Mar 25, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,299 442 Updated Sep 6, 2024

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,005 78 Updated Oct 10, 2024

🚀 一键部署!真正的 AI 聊天机器人!支持ChatGPT、文心一言、讯飞星火、Bing、Bard、ChatGLM、POE,多账号,人设调教,虚拟女仆、图片渲染、语音发送 | 支持 QQ、Telegram、Discord、微信 等平台

Python 13,102 1,557 Updated Mar 23, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,629 4,190 Updated Aug 19, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,823 305 Updated Sep 29, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,823 2,118 Updated Aug 9, 2024
Next