A manifold learning perspective on representation learning: Learning decoder and representations without an encoder

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

A manifold learning perspective on representation learning : Learning decoder and representations without an encoder. / Schuster, Viktoria; Krogh, Anders.

I: Entropy, Bind 23, Nr. 11, 1403, 11.2021.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Schuster, V & Krogh, A 2021, 'A manifold learning perspective on representation learning: Learning decoder and representations without an encoder', Entropy, bind 23, nr. 11, 1403. https://doi.org/10.3390/e23111403

APA

Schuster, V., & Krogh, A. (2021). A manifold learning perspective on representation learning: Learning decoder and representations without an encoder. Entropy, 23(11), [1403]. https://doi.org/10.3390/e23111403

Vancouver

Schuster V, Krogh A. A manifold learning perspective on representation learning: Learning decoder and representations without an encoder. Entropy. 2021 nov.;23(11). 1403. https://doi.org/10.3390/e23111403

Author

Schuster, Viktoria ; Krogh, Anders. / A manifold learning perspective on representation learning : Learning decoder and representations without an encoder. I: Entropy. 2021 ; Bind 23, Nr. 11.

Bibtex

@article{864aba41d2324f6dbf931ef93537832a,
title = "A manifold learning perspective on representation learning: Learning decoder and representations without an encoder",
abstract = "Autoencoders are commonly used in representation learning. They consist of an encoder and a decoder, which provide a straightforward method to map n-dimensional data in input space to a lower m-dimensional representation space and back. The decoder itself defines an m-dimensional manifold in input space. Inspired by manifold learning, we showed that the decoder can be trained on its own by learning the representations of the training samples along with the decoder weights using gradient descent. A sum-of-squares loss then corresponds to optimizing the manifold to have the smallest Euclidean distance to the training samples, and similarly for other loss functions. We derived expressions for the number of samples needed to specify the encoder and decoder and showed that the decoder generally requires much fewer training samples to be well-specified compared to the encoder. We discuss the training of autoencoders in this perspective and relate it to previous work in the field that uses noisy training examples and other types of regularization. On the natural image data sets MNIST and CIFAR10, we demonstrated that the decoder is much better suited to learn a low-dimensional representation, especially when trained on small data sets. Using simulated gene regulatory data, we further showed that the decoder alone leads to better generalization and meaningful representations. Our approach of training the decoder alone facilitates representation learning even on small data sets and can lead to improved training of autoencoders. We hope that the simple analyses presented will also contribute to an improved conceptual understanding of representation learning.",
keywords = "Autoencoders, Manifold learning, Neural networks, Representation learning",
author = "Viktoria Schuster and Anders Krogh",
note = "Publisher Copyright: {\textcopyright} 2021 by the authors. Licensee MDPI, Basel, Switzerland.",
year = "2021",
month = nov,
doi = "10.3390/e23111403",
language = "English",
volume = "23",
journal = "Entropy",
issn = "1099-4300",
publisher = "MDPI AG",
number = "11",

}

RIS

TY - JOUR

T1 - A manifold learning perspective on representation learning

T2 - Learning decoder and representations without an encoder

AU - Schuster, Viktoria

AU - Krogh, Anders

N1 - Publisher Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland.

PY - 2021/11

Y1 - 2021/11

N2 - Autoencoders are commonly used in representation learning. They consist of an encoder and a decoder, which provide a straightforward method to map n-dimensional data in input space to a lower m-dimensional representation space and back. The decoder itself defines an m-dimensional manifold in input space. Inspired by manifold learning, we showed that the decoder can be trained on its own by learning the representations of the training samples along with the decoder weights using gradient descent. A sum-of-squares loss then corresponds to optimizing the manifold to have the smallest Euclidean distance to the training samples, and similarly for other loss functions. We derived expressions for the number of samples needed to specify the encoder and decoder and showed that the decoder generally requires much fewer training samples to be well-specified compared to the encoder. We discuss the training of autoencoders in this perspective and relate it to previous work in the field that uses noisy training examples and other types of regularization. On the natural image data sets MNIST and CIFAR10, we demonstrated that the decoder is much better suited to learn a low-dimensional representation, especially when trained on small data sets. Using simulated gene regulatory data, we further showed that the decoder alone leads to better generalization and meaningful representations. Our approach of training the decoder alone facilitates representation learning even on small data sets and can lead to improved training of autoencoders. We hope that the simple analyses presented will also contribute to an improved conceptual understanding of representation learning.

AB - Autoencoders are commonly used in representation learning. They consist of an encoder and a decoder, which provide a straightforward method to map n-dimensional data in input space to a lower m-dimensional representation space and back. The decoder itself defines an m-dimensional manifold in input space. Inspired by manifold learning, we showed that the decoder can be trained on its own by learning the representations of the training samples along with the decoder weights using gradient descent. A sum-of-squares loss then corresponds to optimizing the manifold to have the smallest Euclidean distance to the training samples, and similarly for other loss functions. We derived expressions for the number of samples needed to specify the encoder and decoder and showed that the decoder generally requires much fewer training samples to be well-specified compared to the encoder. We discuss the training of autoencoders in this perspective and relate it to previous work in the field that uses noisy training examples and other types of regularization. On the natural image data sets MNIST and CIFAR10, we demonstrated that the decoder is much better suited to learn a low-dimensional representation, especially when trained on small data sets. Using simulated gene regulatory data, we further showed that the decoder alone leads to better generalization and meaningful representations. Our approach of training the decoder alone facilitates representation learning even on small data sets and can lead to improved training of autoencoders. We hope that the simple analyses presented will also contribute to an improved conceptual understanding of representation learning.

KW - Autoencoders

KW - Manifold learning

KW - Neural networks

KW - Representation learning

UR - http://www.scopus.com/inward/record.url?scp=85117901514&partnerID=8YFLogxK

U2 - 10.3390/e23111403

DO - 10.3390/e23111403

M3 - Journal article

C2 - 34828101

AN - SCOPUS:85117901514

VL - 23

JO - Entropy

JF - Entropy

SN - 1099-4300

IS - 11

M1 - 1403

ER -

ID: 284633541