Stars
The development repo for the Primer on Bézier curves, https://pomax.github.io/bezierinfo
a state-of-the-art-level open visual language model | 多模态预训练模型
The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
研究生数学建模,本科生数学建模、数学建模竞赛优秀论文,数学建模算法,LaTeX论文模板,算法思维导图,参考书籍,Matlab软件教程,PPT
A curated list of awesome mathematics resources
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
OCR, layout analysis, reading order, table recognition in 90+ languages
🔊 Text-Prompted Generative Audio Model
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Agent that writes consistent and interesting long stories for any fiction form
🎆Interactive Online Platform that Visualizes Algorithms from Code
A simple C++/OpenGL application to create quick and dirty mathematically accurate animations
Pynamical is a Python package for modeling and visualizing discrete nonlinear dynamical systems, chaos, and fractals.
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
GUI for a Vocal Remover that uses Deep Neural Networks.
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Real-Time High-Resolution Background Matting
Background Matting: The World is Your Green Screen
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation