acoustic-model

Here are 45 public repositories matching this topic...

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Updated Oct 19, 2023

openvpi / DiffSinger

Star

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

midi diffusion svs acoustic-model singing-voice pitch-prediction singing-voice-synthesis rectified-flow melody-frontend diffussion-model

Updated Oct 20, 2024
Python

MontrealCorpusTools / Montreal-Forced-Aligner

Star

Command line utility for forced alignment using Kaldi

python kaldi pronunciation-dictionary forced-alignment grapheme-to-phone acoustic-model

Updated Oct 1, 2024
Python

My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.

python-library speech-analysis praatscript acoustic-model voice-analysis

Updated Aug 31, 2021
Python

Shahabks / myprosody

Star

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

python-library voice-recognition prosody phonemes speech-analysis acoustic-model acoustic-features speech-patterns

Updated Nov 28, 2022
Python

cvqluu / Factorized-TDNN

Star

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

neural-network pytorch speech-recognition neural-networks kaldi speaker-recognition speaker-verification embedding speaker-diarization tdnn acoustic-model acoustic-models x-vector tdnn-f factorized-tdnn

Updated Jan 6, 2020
Python

guanlongzhao / fac-via-ppg

Star

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

speech-synthesis acoustic-model accent-conversion

Updated Jul 6, 2023
Python

aluo-x / Learning_Neural_Acoustic_Fields

Star

Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)

pytorch impulse-response spatial-audio acoustics 3d-audio reverberation acoustic-model acoustic-models neural-fields implicit-functions neural-field spatial-audio-reproduction

Updated Jan 20, 2024
Python

X-LANCE / UniCATS-CTX-txt2vec

Star

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

text-to-speech tts speech-synthesis acoustic-model unicats vq-diffusion ctx-txt2vec

Updated Feb 23, 2024
Python

HumBug-Mosquito / HumBugDB

Star

Acoustic mosquito detection code with Bayesian Neural Networks

audio pytorch feature-extraction keras-tensorflow bayesian-neural-networks acoustic-model acoustic-features

Updated Oct 4, 2021
Jupyter Notebook

jim-schwoebel / sound_event_detection

Star

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

machine-learning acoustic-fingerprinting object-detection event-detection acoustics object-detection-pipelines audioset acoustic-model sound-event-detection acoustic-features object-detection-label common-voice common-voice-tool voice-computing object-detection-accuracy voicebook surveylex neurolex

Updated Feb 20, 2022
Python

slp-rl / salmon

Star

The official code for the SALMon🍣 benchmark

audio-processing acoustic-model speech-language-model

Updated Sep 15, 2024
Python

hcy71o / SC-CNN

Star

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

text-to-speech tts speech-synthesis zero-shot feature-extractor acoustic-model multi-speaker-tts

Updated Nov 1, 2023
Python

sooftware / End-to-End-Speech-Recognition-Models

Sponsor

Star

PyTorch implementation of automatic speech recognition models.

end-to-end pytorch transformer las vad e2e asr acoustic-model voice-activity-detection deepspeech2 listen-attend-and-spell

Updated Jan 10, 2021
Python

ronggong / jingjuSingingPhraseMatching

Star

Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information

score cnn-model phoneme singing-phrase acoustic-model hsmm

Updated Jul 9, 2017
Python

mozilla / deepspeech-playbook

Star

A crash course for training speech recognition models using DeepSpeech.

speech-recognition language-model acoustic-model deepspeech common-voice

Updated May 16, 2021

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

Star

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

deep-learning tensorflow lstm rnn automatic-speech-recognition ctc timit acoustic-model

Updated Jan 18, 2018
Python

secretsauceai / precise-wakeword-model-maker

Star

Automated, end-to-end wakeword model maker using the Precise Wakeword Engine

nlp machine-learning hotword-detection acoustic-model wakeword wakeword-activation

Updated Feb 23, 2022
Python

mntabassm / SAEN-LARS

Star

Sequential adaptive elastic net (SAEN) approach, complex-valued LARS solver for weighted Lasso/elastic-net problems, and sparsity (or model) order detection with an application to single-snapshot source localization.

adaptive-learning sparse-regression matlab-toolbox regularized-linear-regression elastic-net sparse-reconstruction lasso-regression source-localization acoustic-model regularization-paths direction-of-arrival sparse-regularization compressed-beamforming complex-valued-data solution-path

Updated Mar 5, 2020
MATLAB

HarikalarKutusu / 3d-voice-chess

Star

A voice driven 3D chess game for learning Voice AI

threejs games chess speech-recognition language-model stt acoustic-model common-voice coqui-ai

Updated Jul 6, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the acoustic-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the acoustic-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acoustic-model

Here are 45 public repositories matching this topic...

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

openvpi / DiffSinger

MontrealCorpusTools / Montreal-Forced-Aligner

Shahabks / my-voice-analysis

Shahabks / myprosody

cvqluu / Factorized-TDNN

guanlongzhao / fac-via-ppg

aluo-x / Learning_Neural_Acoustic_Fields

X-LANCE / UniCATS-CTX-txt2vec

HumBug-Mosquito / HumBugDB

jim-schwoebel / sound_event_detection

slp-rl / salmon

hcy71o / SC-CNN

sooftware / End-to-End-Speech-Recognition-Models

ronggong / jingjuSingingPhraseMatching

mozilla / deepspeech-playbook

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

secretsauceai / precise-wakeword-model-maker

mntabassm / SAEN-LARS

HarikalarKutusu / 3d-voice-chess

Improve this page

Add this topic to your repo