Skip to content
View SIGMIND's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report SIGMIND

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Jupyter Notebook 752 255 Updated May 15, 2024

real time face swap and one-click video deepfake with only a single image

Python 38,863 5,604 Updated Oct 20, 2024

A fast, local neural text to speech system

C++ 6,242 457 Updated Aug 7, 2024

C library to manage the GPIO header of the Nvidia Jetson boards

C 75 13 Updated Sep 27, 2024

A C++ library that enables the use of Jetson's GPIOs

C++ 281 103 Updated Jun 4, 2024

A simple co-pilot for Linux to interpret human language queries into useful Linux terminal commands and execute them

Python 4 Updated Jul 5, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,898 151 Updated Sep 25, 2024

Official repository for the paper PLLaVA

Python 575 39 Updated Jul 28, 2024

ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Python 46 3 Updated Sep 12, 2024

This project has implemented the RAG function on Jetson with video formats.

Python 6 3 Updated Jun 13, 2024

Onvif Device Manager for Linux

C 92 21 Updated Oct 17, 2024

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 544 59 Updated Oct 4, 2024

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Python 3,428 759 Updated Dec 23, 2022

QualityScaler - image/video AI upscaler app

Python 2,045 149 Updated Sep 25, 2024

A reference example for integrating NanoOwl with Metropolis Microservices for Jetson

Python 24 3 Updated Jun 14, 2024

Instant voice cloning by MIT and MyShell.

Python 29,291 2,875 Updated Aug 21, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,805 2,176 Updated Aug 12, 2024

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,185 106 Updated Aug 27, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,417 73 Updated Oct 9, 2024

LLM inference in C/C++

C++ 66,549 9,570 Updated Oct 20, 2024

Okkhor-Diffusion: Bangla Handwritten Character Generation using DDPM

Python 5 1 Updated Mar 15, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 53,942 7,252 Updated Oct 17, 2024

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Python 2,284 469 Updated Oct 18, 2024

Let us control diffusion models!

Python 30,138 2,715 Updated Feb 25, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,759 253 Updated Jun 4, 2024

Awesome Large Action Model (LAM): Models that could help gets things done.

235 14 Updated Jan 14, 2024

Grok open release

Python 49,493 8,323 Updated Aug 30, 2024

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Linux using TensorRT-LLM

Python 19 5 Updated Mar 1, 2024

Foundational model for human-like, expressive TTS

Python 3,813 655 Updated Jul 30, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,639 252 Updated Aug 9, 2024
Next