![database logo](https://raw.githubusercontent.com/github/explore/13295c57999765ac9ffa3281942a72ab08b79de2/topics/database/database.png)
Block or Report
Block or report rhythm35
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
Machine Learning Toolkit for Kubernetes
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
📝 An awesome Data Science repository to learn and apply for real world problems.
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Large-scale text-video dataset. 10 million captioned short videos.
Open-Sora: Democratizing Efficient Video Production for All
A topic-centric list of HQ open datasets.
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…
Starter code for working with the YouTube-8M dataset.
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
PC wechat robot interface [wechat Hook] / PC微信3.9.10.16/3.9.2.23接口 微信Hook 微信机器人 微信Hook源码 PC微信协议算法
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Projects and e-book for our course, REST APIs with Flask and Python
Supervoice Speaker Separation Network
微信公众号文章批量下载工具,支持图片、评论下载,支持保存html/mhtml/md/pdf/docx文件