Skip to content

abdouaziz/wavautoencoder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Masked WavAutoEncoder

This repository is an implementation of masked wavautoencoder based on Transformer architecture for self supervised speech representation .

Install

$ pip install wav-autoencoder

Usage

Simple example for using the model

import torch
from wav_autoencoder import WavAutoEncoderConfig ,  WavAutoEncoderModel 

>> config = WavAutoEncoderConfig()
>> model = WavAutoEncoderModel(config)
>> wav_signal = torch.randn(2, 1, 16000)
>> outputs = model(wav_signal)
>> print(outputs.shape)

Todo

Citations

@misc{
  title  = {WavAutoencoder: A Self-Supervised Framework for Learning Audio Representations},
  author = {Abdou Aziz DIOP},
  year   = {2022}
}