Optimizing Reusable Knowledge for Continual Learning via Metalearning

This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.

Paper

When learning tasks over time, artificial neural networks suffer from a problem known as Catastrophic Forgetting (CF). This happens when the weights of a network are overwritten during the training of a new task causing forgetting of old information. To address this issue, we propose MetA Reusable Knowledge or MARK, a new method that fosters weight reusability instead of overwriting when learning a new task. Specifically, MARK keeps a set of shared weights among tasks. We envision these shared weights as a common Knowledge Base (KB) that is not only used to learn new tasks, but also enriched with new knowledge as the model learns new tasks. Key components behind MARK are two-fold. On the one hand, a metalearning approach provides the key mechanism to incrementally enrich the KB with new knowledge and to foster weight reusability among tasks. On the other hand, a set of trainable masks provides the key mechanism to selectively choose from the KB relevant weights to solve each task.

Main Idea

A schematic view of our proposal is described in the following image:

The flow of information in MARK is as follows. Input X_i goes into F^t to extract the representation F^t_i. This representation is then used by M^t to produce the set of masks that condition each of the blocks in the KB. The same input X_i enters the mask-conditioned KB leading to vector F^t_i,KB used by the classification head. Finally, classifier C^t generates the model prediction, where t is the task ID associated to input X_i.

The motivation behind this flow of information is that MARK learns to reuse information stored in the KB. By using M^t, MARK weights information from the KB, delivering greater value to relevant information and ignoring irrelevant information.

To encourage the reuse of knowledge, we must find knowledge that can be relevant across tasks. MARK uses a metalearning strategy that improves the ability to generalize to future tasks

More details in the paper.

Code

To run code, first you need to install the libraries listed in requirement.txt. Then, run the following command:

python main.py --config ./configs_file.yml

where "./configs_file.yml" is the corresponding configuration file. For example, to run experiments in CIFAR100:

python main.py --config ./configs/config_cifar100.yml

In the configuration files (.yml), we change the hyperparameters of the experiments. For example, the number of epochs, if using or not Meta-Learning or Mask Functions, using a pre-trained Resnet as F^t, etc.

If you have any questions, do not hesitate to write // Si tienes alguna pregunta, no dudes en escribirme.

To cite our works:

@article{hurtado2021optimizing,
  title={Optimizing Reusable Knowledge for Continual Learning via Metalearning},
  author={Hurtado, Julio and Raymond-Saez, Alain and Soto, Alvaro},
  journal={arXiv preprint arXiv:2106.05390},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
configs		configs
dataloaders		dataloaders
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
approach.py		approach.py
main.py		main.py
mark_architecture.png		mark_architecture.png
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimizing Reusable Knowledge for Continual Learning via Metalearning

Paper

Main Idea

Code

About

Releases

Packages

Contributors 2

Languages

License

JuliousHurtado/meta-training-setup

Folders and files

Latest commit

History

Repository files navigation

Optimizing Reusable Knowledge for Continual Learning via Metalearning

Paper

Main Idea

Code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages