Skip to content
View iver56's full-sized avatar
🎯
Writing Python code every day
🎯
Writing Python code every day
  • Nomono
  • Trondheim, Norway
  • X @iver56

Organizations

@ninjadev

Block or report iver56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

Python 108 13 Updated Mar 24, 2023

Tools to work with IAMF

C++ 17 7 Updated Oct 28, 2024

This is the repository for the speech enhancement model SyncFormer

8 Updated Oct 12, 2024

PyTorch native quantization and sparsity for training and inference

Python 1,515 154 Updated Nov 1, 2024

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

Python 79 8 Updated Oct 5, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,518 227 Updated Oct 5, 2024
Python 41 1 Updated Oct 19, 2024

Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural network models (and their initializations) to make them easier to…

Python 64 4 Updated Aug 7, 2024

o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…

Python 2,758 286 Updated Oct 2, 2024

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 61 3 Updated Sep 20, 2024

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 230 7 Updated Oct 19, 2024

Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2

C++ 17,779 1,887 Updated Oct 31, 2024

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,285 51 Updated Sep 27, 2024

High-Fidelity Neural Phonetic Posteriorgrams

Python 91 6 Updated Sep 19, 2024

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 111 5 Updated Sep 3, 2024

Less than 100 Kilobytes. Works for Android 5.1 and above

C 2,058 133 Updated Oct 6, 2024

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Python 85 9 Updated Oct 25, 2024

Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"

Python 10 1 Updated Sep 19, 2024

Strict separation of config from code.

Python 2,813 194 Updated Jan 20, 2024

[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Python 65 1 Updated Sep 21, 2024

Apollo audio restoration Colab fork

Python 13 2 Updated Sep 27, 2024

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 82 4 Updated Sep 19, 2024

The official code for the SALMon🍣 benchmark

Python 39 Updated Sep 15, 2024

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 222 32 Updated Jul 25, 2024

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 49 3 Updated Oct 29, 2024

TS-BSmamba2: A TWO-STAGE BAND-SPLIT MAMBA-2 NETWORK FOR MUSIC SEPARATION

Python 35 Updated Sep 16, 2024

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 374 84 Updated Jun 30, 2021

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,501 166 Updated Sep 24, 2024
Next