Stars
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
real time face swap and one-click video deepfake with only a single image
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
cmliu / edgetunnel
Forked from zizifn/edgetunnel在原版的基础上修改了显示 VLESS 配置信息转换为订阅内容。使用该脚本,你可以方便地将 VLESS 配置信息使用在线配置转换到 Clash 或 Singbox 等工具中。
[CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Create web-based user interfaces with Python. The nice way.
Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
world modeling challenge for humanoid robots
[SIGGRAPH Asia'24 & TOG] Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenes
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
draw.io is a JavaScript, client-side editor for general diagramming.
Collect Lots of Shadowsocks, ShadowsocksR, Trojan, Vmess from Public Sources & Filter Best Nodes By Speed
🛰️✨ Free V2ray Configs , Updating Every 10 minutes.
Real-time and accurate open-vocabulary end-to-end object detection
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
🦜🔗 Build context-aware reasoning applications
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
Start building LLM-empowered multi-agent applications in an easier way.
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
An open source implementation of CLIP.
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.