Self-improving generative artificial neural network for pseudorehearsal incremental class learning

Diego Mellado, Carolina Saavedra, Steren Chabert, Romina Torres, Rodrigo Salas

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Deep learning models are part of the family of artificial neural networks and, as such, they suffer catastrophic interference when learning sequentially. In addition, the greater number of these models have a rigid architecture which prevents the incremental learning of new classes. To overcome these drawbacks, we propose the Self-Improving Generative Artificial Neural Network (SIGANN), an end-to-end deep neural network system which can ease the catastrophic forgetting problem when learning new classes. In this method, we introduce a novel detection model that automatically detects samples of new classes, and an adversarial autoencoder is used to produce samples of previous classes. This system consists of three main modules: a classifier module implemented using a Deep Convolutional Neural Network, a generator module based on an adversarial autoencoder, and a novelty-detection module implemented using an OpenMax activation function. Using the EMNIST data set, the model was trained incrementally, starting with a small set of classes. The results of the simulation show that SIGANN can retain previous knowledge while incorporating gradual forgetfulness of each learning sequence at a rate of about 7% per training step. Moreover, SIGANN can detect new classes that are hidden in the data with a median accuracy of 43% and, therefore, proceed with incremental class learning.

Original languageEnglish
Article number206
JournalAlgorithms
Volume12
Issue number10
DOIs
StatePublished - 2019
Externally publishedYes

Keywords

  • Artificial neural networks
  • Catastrophic interference
  • Deep learning
  • Generative neural networks
  • Incremental learning
  • Novelty detection

Fingerprint

Dive into the research topics of 'Self-improving generative artificial neural network for pseudorehearsal incremental class learning'. Together they form a unique fingerprint.

Cite this