Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling

Nicolas Béreux; Aurélien Decelle; Cyril Furtlehner; Beatriz Seoane

doi:10.21468/SciPostPhys.14.3.032

Article Dans Une Revue SciPost Physics Année : 2022

Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling

(1) , (2, 3, 1) , (2, 3) , (1)

1
2
3

Nicolas Béreux

Fonction : Auteur

Departamento de Física Teórica

Aurélien Decelle

Fonction : Auteur
PersonId : 1079959

Laboratoire Interdisciplinaire des Sciences du Numérique

TAckling the Underspecified

Departamento de Física Teórica

Cyril Furtlehner

Fonction : Auteur
PersonId : 838835
IdHAL : cyril-furtlehner

Laboratoire Interdisciplinaire des Sciences du Numérique

TAckling the Underspecified

Beatriz Seoane

Fonction : Auteur

Departamento de Física Teórica

Résumé

Restricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. This situation becomes critical when dealing with low-dimensional clustered datasets, where the time required to sample ergodically the trained models becomes computationally prohibitive. In this work, we show that this divergence of Monte Carlo mixing times is related to a phenomenon of phase coexistence, similar to that which occurs in physics near a first-order phase transition. We show that sampling the equilibrium distribution using the Markov chain Monte Carlo method can be dramatically accelerated when using biased sampling techniques, in particular the Tethered Monte Carlo (TMC) method. This sampling technique efficiently solves the problem of evaluating the quality of a given trained model and generating new samples in a reasonable amount of time. Moreover, we show that this sampling technique can also be used to improve the computation of the log-likelihood gradient during training, leading to dramatic improvements in training RBMs with artificial clustered datasets. On real low-dimensional datasets, this new training method fits RBM models with significantly faster relaxation dynamics than those obtained with standard PCD recipes. We also show that TMC sampling can be used to recover the free-energy profile of the RBM. This proves to be extremely useful to compute the probability distribution of a given model and to improve the generation of new decorrelated samples in slow PCD-trained models. The main limitations of this method are, first, the restriction to effective low-dimensional datasets and, second, the fact that the Tethered MC method breaks the possibility of performing parallel alternative Monte Carlo updates, which limits the size of the systems we can consider in practice.

Domaines

Systèmes désordonnés et réseaux de neurones [cond-mat.dis-nn] Apprentissage [cs.LG] Mécanique statistique [cond-mat.stat-mech]

Fichier principal

art_tmc.pdf (5.2 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Aurélien Decelle : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03795598

Soumis le : vendredi 4 novembre 2022-10:37:49

Dernière modification le : vendredi 9 juin 2023-15:22:31

Archivage à long terme le : lundi 6 février 2023-11:20:00

Dates et versions

hal-03795598 , version 1 (04-11-2022)

Identifiants

HAL Id : hal-03795598 , version 1
ARXIV : 2206.01310v2
DOI : 10.21468/SciPostPhys.14.3.032

Citer

Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane. Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling. SciPost Physics, In press, 14, ⟨10.21468/SciPostPhys.14.3.032⟩. ⟨hal-03795598⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY LISN GS-COMPUTER-SCIENCE LISN-AO

90 Consultations

25 Téléchargements

Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager