iver56

🎯

Writing Python code every day

Iver Jordal iver56

🎯

Writing Python code every day

Machine learning, audio/music tech, computer vision, demoscene, web technology, games, startups. I mainly program in Python, Go and C

312 followers · 140 following

Nomono
Trondheim, Norway
@iver56

Achievements

x3 x3 x4

Achievements

x3 x3 x4

Organizations

Starred repositories

Audio-WestlakeU / McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

Python 108 13 Updated Mar 24, 2023

AOMediaCodec / iamf-tools

Tools to work with IAMF

C++ 17 7 Updated Oct 28, 2024

alibabasglab / GatedFormer

This is the repository for the speech enhancement model SyncFormer

8 Updated Oct 12, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 1,515 154 Updated Nov 1, 2024

skirdey / voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

Python 79 8 Updated Oct 5, 2024

nttcslab-sp-admin / mamba-diarization

11 1 Updated Oct 10, 2024

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,518 227 Updated Oct 5, 2024

xuyaoxun / MuCodec

Python 41 1 Updated Oct 19, 2024

google-deepmind / dks

Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural network models (and their initializations) to make them easier to…

Python 64 4 Updated Aug 7, 2024

Doriandarko / o1-engineer

o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…

Python 2,758 286 Updated Oct 2, 2024

hhguo / SoCodec

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 61 3 Updated Sep 20, 2024

haidog-yaqub / EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 230 7 Updated Oct 19, 2024

google / filament

Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2

C++ 17,779 1,887 Updated Oct 31, 2024

menyifang / MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,285 51 Updated Sep 27, 2024

interactiveaudiolab / ppgs

High-Fidelity Neural Phonetic Posteriorgrams

Python 91 6 Updated Sep 19, 2024

IDRnD / ReDimNet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 111 5 Updated Sep 3, 2024

VadimBoev / FlappyBird

Less than 100 Kilobytes. Works for Android 5.1 and above

C 2,058 133 Updated Oct 6, 2024

Audio-WestlakeU / FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Python 85 9 Updated Oct 25, 2024

SarthakYadav / axlstm-official

Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"

Python 10 1 Updated Sep 19, 2024

HBNetwork / python-decouple

Strict separation of config from code.

Python 2,813 194 Updated Jan 20, 2024

XiangZ-0 / HiT-SR

[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Python 65 1 Updated Sep 21, 2024

jarredou / Apollo-Colab-Inference

Apollo audio restoration Colab fork

Python 13 2 Updated Sep 27, 2024

jrgillick / laughter-detection

Python 222 48 Updated Jul 25, 2024

Aria-K-Alethia / BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 82 4 Updated Sep 19, 2024

slp-rl / salmon

The official code for the SALMon🍣 benchmark

Python 39 Updated Sep 15, 2024

Edresson / VoiceSplit

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 222 32 Updated Jul 25, 2024

WangHelin1997 / SoloAudio

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 49 3 Updated Oct 29, 2024

baijinglin / TS-BSmamba2

TS-BSmamba2: A TWO-STAGE BAND-SPLIT MAMBA-2 NETWORK FOR MUSIC SEPARATION

Python 35 Updated Sep 16, 2024

WenzheLiu-Speech / sound-source-localization-algorithm_DOA_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 374 84 Updated Jun 30, 2021

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,501 166 Updated Sep 24, 2024