Stars
Fast and accurate automatic speech recognition (ASR) for edge devices
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
On-device Speech Recognition for Apple Silicon
📄 A curated list of awesome .cursorrules files
Optimized implementation for color-icon-matrix barcodes
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
「海外工具网站」已经是我人生主要事业了,很庆幸还来得及,感谢这个伟大的 AI 时代。
Find the best cursor rules for your framework and language
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
real time face swap and one-click video deepfake with only a single image
🔊 Text-Prompted Generative Audio Model
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Desktop app for prototyping and debugging LangGraph applications locally.
Real time interactive streaming digital human
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
大概是2020年最全的免费可商用字体,这里收录的商免字体都能找到明确的授权出处,可以放心使用,持续更新中...
Voice activity detector (VAD) for the browser with a simple API
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…