visionjo

Joseph P. Robinson visionjo

Ph.D., Northeastern, 2020. Focus: applied machine learning, mostly vision. At Vicarious Surgical's ASDAI group, an AI Engineer working on our surgical robot.

42 followers · 26 following

Achievements

Highlights

Block or Report

Block or report visionjo

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (2)

Sort

Depth

3 repositories

Face

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

L1aoXingyu / Deep-Learning-Project-Template

A best practice for deep learning project template architecture.

Python 1,243 292 Updated Apr 12, 2019

JDAI-CV / fast-reid

SOTA Re-identification Methods and Toolbox

Python 3,346 827 Updated Jul 15, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 9,916 811 Updated Jul 20, 2024

theEricMa / DiffSpeaker

This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Python 123 15 Updated Mar 31, 2024

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,250 872 Updated Jun 17, 2024

facebookresearch / audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,627 245 Updated Jun 24, 2024

PantoMatrix / PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Python 878 159 Updated Jul 7, 2024

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,154 911 Updated Apr 1, 2024

HowieMa / CVTHead

[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"

Python 81 7 Updated Feb 16, 2024

pkhungurn / talking-head-anime-2-demo

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

Python 1,118 154 Updated Jun 29, 2022

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 9,907 845 Updated Jul 6, 2024

abetlen / llama-cpp-python

Python bindings for llama.cpp

Python 7,275 866 Updated Jul 24, 2024

davabase / whisper_real_time

Real time transcription with OpenAI Whisper.

Python 2,112 363 Updated Jun 1, 2024

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 1,305 115 Updated Jul 19, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,505 187 Updated Jun 6, 2024

pkhungurn / talking-head-anime-3-demo

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 913 97 Updated Aug 29, 2023

NVIDIA / waveglow

A Flow-based Generative Network for Speech Synthesis

Python 2,250 528 Updated Oct 19, 2023

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 4,988 1,374 Updated Jun 12, 2024

wladradchenko / wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

Python 793 91 Updated Jul 16, 2024

deepalianeja / CharacterLipSync

Real-Time Lip Sync for Live 2D Animation

126 16 Updated Nov 7, 2019

tracend / papagayo

A lip-syncing program written in python

Python 42 7 Updated Jan 10, 2014

pkhungurn / talking-head-anime-demo

Demo for the "Talking Head Anime from a Single Image."

Python 1,987 286 Updated Jun 29, 2022

yzhou359 / MakeItTalk

Forked from adobe-research/MakeItTalk

Jupyter Notebook 946 215 Updated Mar 20, 2024

DanielSWolf / rhubarb-lip-sync

Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other p…

C++ 1,742 209 Updated Jan 11, 2024

morevnaproject-org / papagayo-ng

Papagayo is a lip-syncing program designed to help you line up phonemes (mouth shapes) with the actual recorded sound of actors speaking. Papagayo makes it easy to lip sync animated characters by m…

Python 233 51 Updated Apr 25, 2023

visenger / awesome-mlops

A curated list of references for MLOps

12,309 1,857 Updated Jun 10, 2024

promptfoo / promptfoo

Test your prompts, agents, and RAGs. Redteaming, pentesting, vulnerability scanning for LLMs. Improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and m…

TypeScript 3,680 261 Updated Jul 24, 2024