cassiotbatista

Cassio T Batista cassiotbatista

speechproc @Vivoka

54 followers · 115 following

Achievements

Organizations

Block or Report

Block or report cassiotbatista

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (14)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

emptymalei / awesome-research

🌱 a curated list of tools to help you with your research/life; I built a front end around this repo, please use the link below [This repo is Not Maintained Anymore]

1,984 207 Updated Aug 15, 2023

dynamic-superb / dynamic-superb

The official repository of Dynamic-SUPERB.

Python 142 87 Updated Jul 16, 2024

kensho-technologies / sequence_align

Efficient implementations of Needleman-Wunsch and other sequence alignment algorithms written in Rust with Python bindings via PyO3.

Python 62 3 Updated Jun 30, 2024

artie-inc / artie-bias-corpus

Artie Bias Corpus: an audio corpus + code for detecting demographic bias

Python 20 4 Updated Jul 21, 2020

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 726 37 Updated Jul 13, 2024

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,189 206 Updated Jul 9, 2024

magenta / ddsp

DDSP: Differentiable Digital Signal Processing

Python 2,829 331 Updated Jun 17, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 10,233 610 Updated Jul 15, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 689 67 Updated May 22, 2024

apple / ml-ferret

Python 8,216 480 Updated Jan 27, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 2,058 181 Updated Jul 15, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 1,353 115 Updated Jul 16, 2024

OSU-NLP-Group / Mind2Web

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 619 88 Updated Apr 29, 2024

aishwaryanr / awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

6,254 1,282 Updated Jul 15, 2024

huggingface / autotrain-advanced

🤗 AutoTrain Advanced

Python 3,647 441 Updated Jul 9, 2024

microsoft / Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Python 1,365 142 Updated Jul 16, 2024

NVIDIA / audio-flamingo

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 108 2 Updated Jul 13, 2024

yangshun / tech-interview-handbook

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 114,213 14,334 Updated Jul 1, 2024

williamboman / mason.nvim

Portable package manager for Neovim that runs everywhere Neovim runs. Easily install and manage LSP servers, DAP servers, linters, and formatters.

Lua 7,299 261 Updated Jul 11, 2024

Quantco / spox

Pythonic framework for building ONNX graphs

Python 64 4 Updated Jul 8, 2024

BriansIDP / WhisperBiasing

Jupyter Notebook 58 2 Updated Sep 12, 2023

bayartsogt-ya / whisper-multiple-hf-datasets

Whisper fine-tuning event script to use multiple hf datasets

Python 31 7 Updated Dec 20, 2022

HKAB / whisper-finetune-vietnamese

Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM

Jupyter Notebook 34 10 Updated Oct 6, 2023

jumon / whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Python 199 27 Updated May 29, 2023

vasistalodagala / whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 196 46 Updated May 23, 2023

Vaibhavs10 / fast-whisper-finetuning

Jupyter Notebook 407 33 Updated Jul 10, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 748 119 Updated Jul 3, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,475 183 Updated Jun 6, 2024

SpeechColab / GigaSpeech2

An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement

Python 81 4 Updated Jun 22, 2024

utter-project / mHuBERT-147-scripts

Collection of scripts from mHuBERT-147.

Python 19 2 Updated Jul 2, 2024

Cassio T Batista cassiotbatista

Organizations

Block or report cassiotbatista

Lists (14)

ASR

career

DataPipe

FA

FE

Linux

LLM

OnDev

SD

SER

SPE

SSL

TTS

VAD

Stars