Skip to content
View Mikerhinos's full-sized avatar
Block or Report

Block or report Mikerhinos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Google Research

Jupyter Notebook 33,410 7,778 Updated Jul 10, 2024

Tool for robust segmentation of >100 important anatomical structures in CT and MR images

Python 1,302 216 Updated Jul 10, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,345 242 Updated Jul 9, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 16,588 1,246 Updated May 23, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,073 899 Updated Apr 1, 2024

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 1,884 81 Updated Jun 9, 2024

Real time speech to text transcription app.

Python 364 69 Updated Jan 14, 2023

ComfyUI related stuff and things

1,129 85 Updated May 18, 2024

We write your reusable computer vision tools. đź’ś

Python 17,181 1,322 Updated Jul 10, 2024

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 3,133 242 Updated Jul 10, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,537 1,018 Updated Jun 26, 2024

This project aims to enhance the working environment on Windows

C 22,784 997 Updated Jul 9, 2024

Wav2Lip UHQ extension for Automatic1111

Python 1,158 158 Updated Jun 14, 2024

Style Prompts for ComfyUI

Python 65 7 Updated May 22, 2024

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,010 601 Updated Apr 17, 2024

A natural language interface for computers

Python 50,767 4,430 Updated Jul 10, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,422 747 Updated Feb 11, 2024

Next generation face swapper and enhancer

Python 16,541 2,422 Updated Jul 10, 2024

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

C 857 72 Updated Dec 26, 2023

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 12,590 1,573 Updated Jun 21, 2024

ComfyUI Web Extension for saving views and navigating graphs

JavaScript 30 5 Updated May 23, 2024

One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI

JavaScript 59 12 Updated Aug 10, 2023

Pixel art diffusion

Python 79 7 Updated Apr 22, 2023

StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

C# 4,197 334 Updated Jun 21, 2024

Generative Models by Stability AI

Python 23,284 2,571 Updated Jul 9, 2024

SoftVC VITS Singing Voice Conversion

Python 24,727 4,684 Updated Nov 11, 2023

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 41,742 4,426 Updated Jul 10, 2024

Implementation of “DreamDiffusion: Generating High-Quality Images from Brain EEG Signals”

Python 433 49 Updated Jan 30, 2024

Generate 3D objects conditioned on text or images

Python 11,443 910 Updated Jun 22, 2024

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Python 1,273 109 Updated Jun 12, 2024