Skip to content

Improving generalization via style transfer-based data augmentation: Novel regularization method

Notifications You must be signed in to change notification settings

AgaMiko/neural-based-data-augmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

Improving generalization via style transfer-based data augmentation: Novel regularization method

Generated skin lesions: an example

Introduction

Currently, deep learning algorithms are considered as state-of-the-art in many classification tasks, and yet the problem of weak generalization is very common, widely mentioned, and still up-to-date.

The present paper focuses most on the data augmentation. In our method, new images are synthetized with neural style transfer (NST), and the generated images are then used to train the convolutional neural network (CNN) in order to improve its generalization abilities.
The main contributions of this paper are:

  • The proposition of using neural style transfer for the data augmentation (ST-DA). This approach is presented on the skin lesion case study by transforming a benign skin lesion to a malignant lesion, and tested with dataset enrichment evaluation;
  • Incorporating unlabeled, synthesized data into training by adding pseudo-labels generated by another CNN;
  • Limiting the problem of noisy pseudo-labels in synthetic images used as a CNN training set by using only real images in validation and test sets;
  • Evaluating the ability to enrich the training dataset with artificially generated data with Deep Taylor Decomposition,
  • Proving that the ST-DA method significantly improves the performance and repeatability of training for deep neural networks.

ST-DA

How-to

Short and friendly how-to tutorial will be soon available here

Details

The result and details of the method will be able to be find soon in the original paper here: soon You can check instead our previous papers about data augmentation:

Database

Download

The total databse size is 248 489 unalabeled generated dermoscopic images of skin lesions (224x224 px).

  • Few full-size examples can be found here
  • Database can be download soon here (soon)

If you use this database please star the repository and cite the following paper (soon):

"Improving generalization via style transfer-based data augmentation: Novel regularization method", by Agnieszka Mikołajczyk , Michał Grochowski, Arkadiusz Kwasigroch

Sources

The database was generated using following sources: