![cli logo](https://raw.githubusercontent.com/github/explore/aca0b3b69ca680013b925338b0cc428190aa42dc/topics/cli/cli.png)
-
Alibaba
- China, HangZhou
- puke3615.github.io
Block or Report
Block or report puke3615
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
Python version of the Playwright testing and automation library.
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
ControlNet scheduling and masking nodes with sliding context support
A custom node set for Video Frame Interpolation in ComfyUI.
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.
Fast and Simple Face Swap Extension Node for ComfyUI
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
肖像大师 中文版 comfyui-portrait-master
🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Unofficial implementation of InstantID for ComfyUI
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
ComfyUI's ControlNet Auxiliary Preprocessors
YOLOv10: Real-Time End-to-End Object Detection
gpt-4o for windows, macos and linux
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>