- Italy
- @FCariaggi
Stars
Speech, Language, Audio, Music Processing with Large Language Model
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
Convert phoneme codes and lexicon formats for English speech synths
Text normalization scripts from IRISA lab
StableLM: Stability AI Language Models
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
A collection of libraries to optimise AI model performances
Turn repositories into Jupyter-enabled Docker images
📚 Parameterize, execute, and analyze notebooks
Pretrained language model with 100B parameters
Doing dirty (but extremely useful) things with equals.
Bone/Air conducted speech signal enhancement exploiting multi-modal framework
Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).
Unsupervised text tokenizer focused on computational efficiency
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Reinforcement Learning for Anomaly Detection
Python library for Reservoir Computing using Echo State Networks