Skip to content
View mazizulak's full-sized avatar
🚀
Growth time!
🚀
Growth time!

Highlights

  • Pro

Block or report mazizulak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 45,427 6,386 Updated Sep 14, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 92,283 14,742 Updated Sep 14, 2024

Port of OpenAI's Whisper model in C/C++

C 34,392 3,497 Updated Sep 11, 2024

"Muzu" is a TypeScript npm library for server-side HTTP request handling and routing.

TypeScript 7 Updated May 5, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,217 341 Updated Sep 14, 2024

JavaScript framework for visual programming

TypeScript 9,987 651 Updated Aug 30, 2024

A native PyTorch Library for large model training

Python 1,541 141 Updated Sep 13, 2024

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 2,924 210 Updated Aug 16, 2024

The official Meta Llama 3 GitHub site

Python 26,097 2,925 Updated Aug 12, 2024

Inference and training library for high-quality TTS models.

Python 4,180 411 Updated Aug 19, 2024

FRP Fork

Go 119 18 Updated Aug 30, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,405 900 Updated Aug 21, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,379 1,197 Updated Aug 21, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,408 4,056 Updated Aug 16, 2024

Converts text to speech in realtime

Python 1,720 153 Updated Aug 27, 2024

You were probably looking for our website... this is it. We moved our website here, so you can see the insides of how we work.

1,547 323 Updated Sep 5, 2024

Instant voice cloning by MIT and MyShell.

Python 28,377 2,776 Updated Aug 21, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,359 1,061 Updated Jun 21, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,887 1,780 Updated Aug 19, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 89,141 6,979 Updated Sep 13, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,754 1,046 Updated Aug 15, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,593 2,092 Updated Jul 18, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,287 2,210 Updated Sep 9, 2024

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,164 76 Updated Sep 14, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,648 3,442 Updated May 18, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,899 754 Updated Sep 11, 2024

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Python 733 126 Updated Jul 2, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,504 7,963 Updated Sep 10, 2024
Next