词条 | Wake-sleep algorithm |
释义 |
The wake-sleep algorithm[1] is an unsupervised learning algorithm for a stochastic multilayer neural network. The algorithm adjusts the parameters so as to produce a good density estimator.[2] There are two learning phases, the “wake” phase and the “sleep” phase, which are performed alternately.[3] It was first designed as a model for brain functioning using variational Bayesian learning. After that, the algorithm was adapted to machine learning. It can be viewed as a way to train a Helmholtz Machine[4][5]. It can also be used in Deep Belief Networks(DBN). DescriptionThe wake-sleep algorithm is visualized as a stack of layers containing representations of data.[6] Layers above represent data from the layer below it. Actual data is placed below the bottom layer, causing layers on top of it to become gradually more abstract. Between each pair of layers there is a recognition weight and generative weight, which are trained to improve reliability during the algorithm runtime.[7] The wake-sleep algorithm is convergent[8] and can be stochastic[9] if alternated appropriately. TrainingTraining consists of two phases – the “wake” phase and the “sleep” phase. The "wake" phaseNeurons are fired by recognition connections (from what would be input to what would be output). Generative connections (leading from outputs to inputs) are then modified to increase probability that they would recreate the correct activity in the layer below – closer to actual data from sensory input.[10] The "sleep" phaseThe process is reversed in the “sleep” phase – neurons are fired by generative connections while recognition connections are being modified to increase probability that they would recreate the correct activity in the layer above – further to actual data from sensory input.[11] Potential risksVariational Bayesian learning is based on probabilities. There is a chance that an approximation is performed with mistakes, damaging further data representations. Another downside pertains to complicated or corrupted data samples, making it difficult to infer a representational pattern. The wake-sleep algorithm has been suggested not to be powerful enough for the layers of the inference network in order to recover a good estimator of the posterior distribution of latent variables.[12] See also
References1. ^{{Cite journal|title = The wake-sleep algorithm for unsupervised neural networks|journal = Science|date = 1995-05-26|pages = 1158–1161|volume = 268|issue = 5214|doi = 10.1126/science.7761831|first = Geoffrey E.|last = Hinton|first2 = Peter|last2 = Dayan|first3 = Brendan J.|last3 = Frey|first4 = Radford|last4 = Neal|bibcode = 1995Sci...268.1158H}} 2. ^{{Cite web|url = http://papers.nips.cc/paper/1153-does-the-wake-sleep-algorithm-produce-good-density-estimators.pdf|title = Does the wake-sleep algorithm produce good density estimators?|date = 1996-05-01|publisher = Advances in Neural Information Processing Systems |volume = 8|last = Frey|first = Brendan J.|last2 = Hinton|first2 = Geoffrey E.|last3 = Dayan|first3 = Peter}} 3. ^{{Cite journal|title = Models of MT and MST areas using wake–sleep algorithm|journal = Neural Networks|date = 2004-04-01|pages = 339–351|volume = 17|issue = 3|doi = 10.1016/j.neunet.2003.07.004|pmid = 15037352|first = Katsuki|last = Katayama|first2 = Masataka|last2 = Ando|first3 = Tsuyoshi|last3 = Horiguchi}} 4. ^{{Cite journal|title = The wake-sleep algorithm for unsupervised neural networks|journal = Science|date = 1995-05-26|pages = 1158–1161|volume = 268|issue = 5214|doi = 10.1126/science.7761831|first = Geoffrey E.|last = Hinton|first2 = Peter|last2 = Dayan|first3 = Brendan J.|last3 = Frey|first4 = Radford|last4 = Neal|bibcode = 1995Sci...268.1158H}} 5. ^{{Cite journal|title = Varieties of Helmholtz Machine|journal = Neural Networks|date = 1996-11-01|pages = 1385–1403|volume = 9|series = Four Major Hypotheses in Neuroscience|issue = 8|doi = 10.1016/S0893-6080(96)00009-3|first = Peter|last = Dayan|first2 = Geoffrey E.|last2 = Hinton|citeseerx = 10.1.1.29.1677}} 6. ^{{Cite web|url = http://www.iro.umontreal.ca/~lisa/seminaires/25-01-2007.ppt|title = Wake-sleep algorithm for representational learning|date = 2007-01-25|accessdate = 2011-11-01|website = |publisher = University of Montreal|last = Maei|first = Hamid Reza}} 7. ^{{Cite web|url = http://www.cs.toronto.edu/~radford/ftp/ws-fa.pdf|title = Factor Analysis Using Delta Rules Wake-Sleep Learning|date = 1996-11-24|accessdate = 2015-11-01|website = |publisher = University of Toronto|last = Neal|first = Radford M.|last2 = Dayan|first2 = Peter}} 8. ^{{Cite web|url = http://www.ism.ac.jp/~shiro/papers/books/nips1998.pdf|title = Convergence of The Wake-Sleep Algorithm|date = |accessdate = 2015-11-01|website = |publisher = The Institute of Statistical Mathematics|last = Ikeda|first = Shiro|last2 = Amari|first2 = Shun-ichi|last3 = Nakahara|first3 = Hiroyuki}} 9. ^{{Cite book|title = A framework for a discrete valued Helmholtz machine|url = http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=819540|journal = Artificial Neural Networks, 1999. ICANN 99. Ninth International Conference on (Conf. Publ. No. 470)|date = 1999-01-01|pages = 49–54 vol.1|volume = 1|doi = 10.1049/cp:19991083|first = R.W.H.|last = Dalzell|first2 = A.F.|last2 = Murray|isbn = 0 85296 721 7}} 10. ^{{Cite web|url = http://www.gatsby.ucl.ac.uk/~dayan/papers/hdfn95.pdf|title = The wake-sleep algorithm for unsupervised neural networks|date = 1995-04-03|accessdate = 2015-11-01|website = |publisher = |last = Hinton|first = Geoffrey|last2 = Dayan|first2 = Peter|last3 = Frey|first3 = Brendan J|last4 = Neal|first4 = Radford M}} 11. ^{{Cite web|url = http://www.gatsby.ucl.ac.uk/~dayan/papers/d2000a.pdf|title = Helmholtz Machines and Wake-Sleep Learning|date = |accessdate = 2015-11-01|website = |publisher = |last = Dayan|first = Peter}} 12. ^{{cite arxiv|title = Reweighted Wake-Sleep|eprint= 1406.2751|date = 2014-06-10|first = Jörg|last = Bornschein|first2 = Yoshua|last2 = Bengio|class= cs.LG}} 1 : Machine learning algorithms |
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。