Table of Content

Demo: https://sony.github.io/DiffRoll/
Paper: https://arxiv.org/abs/2210.05148

Table of Content

Installation
Table of Content
Installation
Training
- Supervised training
- Unsupervised pretraining
  - Step 1: Pretraining on MAESTRO using only piano rolls
  - Step 2
Sampling

Installation

This repo is developed using python==3.8.10, so it is recommended to use python>=3.8.10.

To install all dependencies

pip install -r requirements.txt

Training

Supervised training

python train_spec_roll.py gpus=[0] model.args.kernel_size=9 model.args.spec_dropout=0.1 dataset=MAESTRO dataloader.train.num_workers=4 epochs=2500 download=True

gpus sets which GPU to use. gpus=[k] means device='cuda:k', gpus=2 means DistributedDataParallel (DDP) is used with two GPUs.
model.args.kernel_size sets the kernel size for the ResNet layers in DiffRoll. model.args.kernel_size=9 performs the best according to our experiments.
model.args.spec_dropout sets the dropout rate ($p$ in the paper)
dataset sets the dataset to be trained on. Can be MAESTRO or MAPS.
dataloader.train.num_workers sets the number of workers for train loader.
download should be set to True if you are running the script for the first time to download and setup the dataset automatically. You can set it to False if you already have the dataset downloaded.

The checkpoints and training logs are avaliable at outputs/YYYY-MM-DD/HH-MM-SS/.

To check the progress of training using TensorBoard, you can use the command below

tensorboard --logdir='./outputs'

Unsupervised pretraining

Step 1: Pretraining on MAESTRO using only piano rolls

python train_spec_roll.py gpus=[0] model.args.kernel_size=9 model.args.spec_dropout=1 dataset=MAESTRO dataloader.train.num_workers=4 epochs=2500

model.args.spec_dropout sets the dropout rate ($p$ in the paper). When it is set to 1, it means no spectrograms will be used (all spectrograms dropped to -1)
other arguments are same as Supervised Training.

The pretrained checkpoints are avaliable at outputs/YYYY-MM-DD/HH-MM-SS/ClassifierFreeDiffRoll/version_1/checkpoints.

After this, you can choose one of the options (2A, 2B, or 2C) to continue training below.

Step 2

Choose one of the options below (A, B, or C).

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
config		config
model		model
my_audio		my_audio
task		task
utils		utils
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
LICENSE		LICENSE
README.md		README.md
continue_train_both.py		continue_train_both.py
continue_train_single.py		continue_train_single.py
infer.py		infer.py
requirements.txt		requirements.txt
roll2midi.ipynb		roll2midi.ipynb
sampling.py		sampling.py
test.py		test.py
train_spec_roll.py		train_spec_roll.py
visualization_master.ipynb		visualization_master.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Content

Installation

Training

Supervised training

Unsupervised pretraining

Step 1: Pretraining on MAESTRO using only piano rolls

Step 2

Option A: pre-DiffRoll (p=0.1)

License

sony/DiffRoll

Folders and files

Latest commit

History

Repository files navigation

Table of Content

Installation

Training

Supervised training

Unsupervised pretraining

Step 1: Pretraining on MAESTRO using only piano rolls

Step 2

Option A: pre-DiffRoll (p=0.1)