Block or Report
Block or report dunzic
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
GPT4All: Chat with Local LLMs on Any Device
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A generative speech model for daily dialogue.
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
aider is AI pair programming in your terminal
Open-Sora: Democratizing Efficient Video Production for All
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
An open-source framework for training large multimodal models.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Turn any glasses into AI-powered smart glasses
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Next generation face swapper and enhancer
Building a quick conversation-based search demo with Lepton AI.