Stars
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Pandora: Towards General World Model with Natural Language Actions and Video States
Turns Data and AI algorithms into production-ready web applications in no time.
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
A generative speech model for daily dialogue.
Implementation of "Generating Sequences With Recurrent Neural Networks" https://arxiv.org/abs/1308.0850
中文语音识别; Mandarin Automatic Speech Recognition;
动态语义SLAM 目标检测+VSLAM+光流/多视角几何动态物体检测+octomap地图+目标数据库
Proposed implementation of Alex Graves paper
A modern download manager that supports all platforms. Built with Golang and Flutter.
We write your reusable computer vision tools. 💜
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Open-Sora: Democratizing Efficient Video Production for All
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
🏀 Visualization of NBA games from raw SportVU data logs
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained
scikit-learn: machine learning in Python
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
Code that accompanies my blog post outlining five video classification methods in Keras and TensorFlow