Skip to content
View KingStorm's full-sized avatar

Block or report KingStorm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

zero-shot voice conversion with in context learning

Python 79 9 Updated Sep 5, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,430 424 Updated Sep 10, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 2,946 311 Updated Sep 10, 2024

Train transformer language models with reinforcement learning.

Python 9,249 1,160 Updated Sep 10, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,007 164 Updated Aug 11, 2024

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,408 230 Updated Jul 8, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 36,945 3,874 Updated Jul 28, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,563 2,071 Updated Aug 9, 2024

VideoSys: An easy and efficient system for video generation

Python 1,611 107 Updated Sep 10, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,231 1,002 Updated Sep 10, 2024

Fast Diffusion Models with Transformers

Python 667 89 Updated Oct 7, 2023

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,977 533 Updated May 31, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 534 35 Updated Jul 23, 2024

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 5,571 1,212 Updated Aug 7, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 94,748 12,025 Updated Sep 2, 2024

Mohanson's Blog

Python 381 108 Updated Sep 7, 2024

Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS

Python 375 27 Updated Nov 26, 2023

Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".

Python 178 22 Updated Mar 9, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 3,942 391 Updated Aug 22, 2024

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 936 97 Updated Apr 19, 2024

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,821 385 Updated Apr 7, 2024

Tensor library for machine learning

C++ 10,819 998 Updated Sep 8, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,740 1,044 Updated Aug 15, 2024

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 6,836 1,835 Updated Jun 1, 2024

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …

JavaScript 5,954 728 Updated Jul 17, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 993 132 Updated Jul 12, 2024

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 8,484 1,125 Updated Apr 2, 2024

Out of time: automated lip sync in the wild

Python 641 143 Updated Jan 23, 2024

MICA - Towards Metrical Reconstruction of Human Faces [ECCV2022]

Python 538 76 Updated Sep 15, 2023

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

Python 1,139 271 Updated Aug 20, 2024
Next