Skip to content
View hbredin's full-sized avatar

Highlights

  • Pro

Organizations

@tvd-dataset @camomile-project @pyannote

Block or report hbredin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Mamba SSM architecture

Python 13,283 1,126 Updated Nov 5, 2024

Official Repository For VoxBlink2

Python 51 4 Updated Aug 13, 2024

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

Svelte 45 7 Updated Nov 6, 2024

Package to play with active learning-like subset selection in pyannote.

Jupyter Notebook 1 Updated Sep 20, 2024

Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024

HTML 2 Updated Jul 16, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,534 208 Updated Aug 1, 2024

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

Python 31 4 Updated Jun 14, 2024

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Python 47 3 Updated Jun 19, 2024

plotting on terminal

Python 1,788 85 Updated Sep 24, 2024

Speech-to-text in Obsidian using OpenAI Whisper

TypeScript 227 30 Updated Mar 2, 2024

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,222 321 Updated Nov 22, 2024
Python 49 3 Updated Feb 8, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,621 298 Updated Oct 28, 2024

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Python 95 9 Updated Sep 25, 2023

C++ fast hierarchical clustering algorithms

C++ 81 16 Updated Jun 13, 2023

Official implementation of "Separate Anything You Describe"

Python 1,635 118 Updated Oct 25, 2024

Cross-Platform, GPU Accelerated Whisper 🏎️

TypeScript 1,739 76 Updated Feb 27, 2024

Voice Conversion With Just Nearest Neighbors

Python 456 67 Updated Mar 18, 2024

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Shell 45 2 Updated Sep 19, 2024

Track and predict the energy consumption and carbon footprint of training deep learning models.

Python 400 30 Updated Nov 18, 2024

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 71 4 Updated Oct 18, 2023

MeetEval - A meeting transcription evaluation toolkit

Python 78 14 Updated Oct 23, 2024

Repo for the paper "Plug-and-Play Multilingual Few-shot Spoken Words Recognition"

HTML 16 Updated Jul 22, 2023
JavaScript 99 25 Updated Jan 8, 2023

A custom micropython firmware integrating tensorflow lite for microcontrollers and ulab to implement the tensorflow micro examples.

C 183 87 Updated Jun 11, 2024

Behavioral probing of language acquisition models at the lexical and syntactic level

Python 14 1 Updated Jul 17, 2023

Simple Diarization model

Python 42 3 Updated Nov 29, 2023

Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow

Python 181 3 Updated Aug 16, 2024

MicroPython Code to control a 28BYJ-48 stepper motor using ULN2003 IC.

Python 1 Updated May 14, 2023
Next