Highlights
Starred repositories
dusty-nv / IsaacLab
Forked from isaac-sim/IsaacLabUnified framework for robot learning built on NVIDIA Isaac Sim
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
An unofficial and opinionated project template designed for a quick start with PySide6 and QtQuick
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion
Este repositorio contiene una plantilla para proyectos en PySide6 con Python
Code for the paper: "FusionMamba: Efficient Image Fusion with State Space Model", 2024.
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
QualtricsAPI is a lightweight Python Package for the Qualtrics API.
PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
weifengpy / torchft
Forked from pytorch-labs/torchftPyTorch per step fault tolerance (actively under development)
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Mastering Diverse Domains through World Models
Implementation of DreamerV3 in Pytorch
A suite of image and video neural tokenizers
Official implementation of the paper "Watermark Anything with Localized Messages"
A collection ROS projects utilizing foundation models.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Template for Python Projects
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow…