Skip to content
View imomin's full-sized avatar
Block or Report

Block or report imomin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Python scraper based on AI

    Python MIT License Updated May 5, 2024
  • The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualizati…

    Python MIT License Updated Apr 27, 2024
  • Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper

    TypeScript Updated Apr 12, 2024
  • Code and dataset for photorealistic Codec Avatars driven from audio

    Python Other Updated Jan 4, 2024
  • OpenVoice Public

    Forked from myshell-ai/OpenVoice

    Instant voice cloning by MyShell

    Python Other Updated Jan 1, 2024
  • wesper-demo Public

    Forked from rkmt/wesper-demo
    Python MIT License Updated Dec 25, 2023
  • ✨ Experience the enchantment of Story Block: an open-source project merging AI text generation and image synthesis to create captivating video narratives. 📚🎥 Watch as your text prompts come to life…

    Python MIT License Updated Nov 8, 2023
  • [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

    Python Apache License 2.0 Updated Sep 10, 2023
  • Search millions of high-quality royalty-free stock photos, images, and videos from popular online media services.

    TypeScript MIT License Updated Aug 24, 2023
  • StableVideo Public

    Forked from rese1f/StableVideo

    [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

    Python Apache License 2.0 Updated Aug 18, 2023
  • bark-TTS Public

    Forked from suno-ai/bark

    🔊 Text-Prompted Generative Audio Model

    Jupyter Notebook MIT License Updated Jul 31, 2023
  • DPE Public

    Forked from OpenTalker/DPE

    [CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

    Python MIT License Updated Jul 21, 2023
  • ShortGPT Public

    Forked from RayVentura/ShortGPT

    AI framework for automating video and short content creation

    Python Other Updated Jul 17, 2023
  • roop Public

    Forked from s0md3v/roop

    one-click deepfake (face swap)

    Python GNU General Public License v3.0 Updated Jul 16, 2023
  • Real Time Foreign Accent Conversion

    Python GNU General Public License v2.0 Updated Jul 10, 2023
  • Official Pytorch implementation of Text2Cinemagraph: Synthesizing Artistic Cinemagraphs from Text

    Python MIT License Updated Jul 10, 2023
  • TypeScript GNU General Public License v3.0 Updated Jul 4, 2023
  • Faster Whisper transcription with CTranslate2

    Python Updated Jun 17, 2023
  • SadTalker Public

    Forked from OpenTalker/SadTalker

    [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

    Python MIT License Updated Jun 14, 2023
  • 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

    Python Updated Jun 4, 2023
  • Stable Diffusion web UI

    Python GNU Affero General Public License v3.0 Updated May 28, 2023
  • GeneFace Public

    Forked from yerfor/GeneFace

    GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

    Python MIT License Updated May 7, 2023
  • roomGPT Public

    Forked from Nutlope/roomGPT

    Upload a photo of your room to generate your dream room with AI.

    TypeScript Updated Apr 17, 2023
  • TTS Public

    Forked from coqui-ai/TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Python Mozilla Public License 2.0 Updated Apr 1, 2023
  • storyteller Public

    Forked from jaketae/storyteller

    Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

    Python MIT License Updated Mar 23, 2023
  • Intelligent customer support bot

    Python GNU General Public License v3.0 Updated Feb 5, 2023
  • Port of OpenAI's Whisper model in C/C++

    C MIT License Updated Dec 6, 2022
  • calendso Public

    Forked from calcom/cal.com

    The open-source Calendly alternative.

    TypeScript Other Updated Oct 12, 2022
  • STIT Public

    Forked from rotemtzaban/STIT
    Python MIT License Updated May 17, 2022
  • spleeter Public

    Forked from deezer/spleeter

    Deezer source separation library including pretrained models.

    Python MIT License Updated Sep 3, 2021