Stars
This package aims at simplifying the download of the AudioSet dataset.
A family of diffusion models for text-to-audio generation.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …
Real-time speech enhancement mobile app using Nested U-Net
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A data augmentations library for audio, image, text, and video.
Pytorch code for "Rethinking CNN Models for Audio Classification"
A guide and set of tools for working with TinyML powered Audio Sensors
Code for YouTube series: Deep Learning for Audio Classification
NNtrainer is Software Framework for Training Neural Network Models on Devices.
A library for soundscape synthesis and augmentation
Python library for rapid prototyping of environmental sound analysis systems
ESPHome is a system to control your ESP8266/ESP32 by simple yet powerful configuration files and control them remotely through Home Automation systems.
ESP32 wake word detection with tensor flow
This repository is a collection of TTS Models in TFLite
This repository contains implementations and illustrative code to accompany DeepMind publications
A VGGish-based DNN trained on the Watkins Marine Mammal Sound Database, with transfer learning from Audioset, to detect multiple marine mammal species.
MotionSense Dataset for Human Activity and Attribute Recognition ( time-series data generated by smartphone's sensors: accelerometer and gyroscope) (PMC Journal) (IoTDI'19)
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
Urban Sound Classification : striving towards a fair comparison
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal…
Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers