“AlexNet”的意思、由来-开放百科全书

AlexNet competed in the ImageNet Large Scale Visual Recognition Challenge^[5] on September 30, 2012. The network achieved a top-5 error of 15.3%, more than 10.8 percentage points lower than that of the runner up. The original paper's primary result was that the depth of the model was essential for its high performance, which was computationally expensive, but made feasible due to the utilization of graphics processing units (GPUs) during training.^[5]

Historic context

AlexNet was not the first fast GPU-implementation of a CNN to win an image recognition contest. A CNN on GPU by K. Chellapilla et al. (2006) was 4 times faster than an equivalent implementation on CPU.^[6] A deep CNN of Dan Ciresan et al. (2011) at IDSIA was already 60 times faster^[7] and achieved superhuman performance in August 2011.^[8] Between May 15, 2011 and September 10, 2012, their CNN won no less than four image competitions.^[9]^[10] They also significantly improved on the best performance in the literature for multiple image databases.^[11]

According to the AlexNet paper,^[5] Ciresan's earlier net is "somewhat similar." Both were originally written with CUDA to run with GPU support. In fact, both are actually just variants of the CNN designs introduced by Yann LeCun et al. (1989)^[12]^[13] who applied the backpropagation algorithm to a variant of Kunihiko Fukushima's original CNN architecture called "neocognitron."^[14]^[15] The architecture was later modified by J. Weng's method called max-pooling.^[16]^[10]

In 2015, AlexNet was outperformed by Microsoft's very deep CNN with over 100 layers, which won the ImageNet 2015 contest.^[17]

Network design

AlexNet contained eight layers; the first five were convolutional layers, some of them followed by max-pooling layers, and the last three were fully connected layers.^[5] It used the non-saturating ReLU activation function, which showed improved training performance over tanh and sigmoid.^[5]

Influence

AlexNet is considered one of the most influential papers published in computer vision, having spurred many more papers published employing CNNs and GPUs to accelerate deep learning.^[18] {{As of|2018}}, the Alexnet paper has been cited over 30,000 times.

Alex Krizhevsky

Alex Krizhevsky (born in the Ukraine, raised in Canada) is a computer scientist most noted for his work on artificial neural networks and deep learning.

Shortly after having won the ImageNet challenge 2012 through AlexNet, he and his colleagues sold their startup DNN Research Inc. to Google.^[1]

Krizhevsky left Google in September 2017 when he lost interest in the work.^[1] At the company Dessa, Krizhevsky will advise and help research new deep-learning techniques.^[1]

Many of his numerous papers on machine learning and computer vision are frequently cited by other researchers.^[19]

References

1. ^¹²³⁴{{cite web|website=Quartz|author=Dave Gershgorn|title=The inside story of how AI got good enough to dominate Silicon Valley|url=https://qz.com/1307091/the-inside-story-of-how-ai-got-good-enough-to-dominate-silicon-valley/|date=18 June 2018|accessdate=5 October 2018}}
2. ^{{cite web|url=https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-research-and-possibly-the-world/|title=The data that transformed AI research—and possibly the world}}
3. ^{{cite web|url=http://www.image-net.org/challenges/LSVRC/2012/results.html|title=ILSVRC2012 Results}}
4. ^{{cite web|url=http://www.image-net.org/challenges/LSVRC/2012/supervision.pdf|title=ImageNet Classification with Deep Convolutional Neural Networks|last=Krizhevsky|first=Alex|accessdate=17 November 2013}}
5. ^¹²³⁴{{Cite journal|last=Krizhevsky|first=Alex|last2=Sutskever|first2=Ilya|last3=Hinton|first3=Geoffrey E.|date=2017-05-24|title=ImageNet classification with deep convolutional neural networks|url=https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf|journal=Communications of the ACM|volume=60|issue=6|pages=84–90|doi=10.1145/3065386|issn=0001-0782|via=}}
6. ^{{cite book|author1=Kumar Chellapilla|author2=Sid Puri|author3=Patrice Simard|editor1-last=Lorette|editor1-first=Guy|title=Tenth International Workshop on Frontiers in Handwriting Recognition|date=2006|publisher=Suvisoft|chapter-url=https://hal.inria.fr/inria-00112631/document|archivedate=2016-03-14|chapter=High Performance Convolutional Neural Networks for Document Processing}}
7. ^{{cite journal|last=Ciresan|first=Dan|author2=Ueli Meier |author3=Jonathan Masci |author4=Luca M. Gambardella |author5=Jurgen Schmidhuber |title=Flexible, High Performance Convolutional Neural Networks for Image Classification|journal=Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence-Volume Volume Two|year=2011|volume=2|pages=1237–1242|url=http://www.idsia.ch/~juergen/ijcai2011.pdf|accessdate=17 November 2013}}
8. ^{{Cite web|url=http://benchmark.ini.rub.de/?section=gtsrb&subsection=results|title=IJCNN 2011 Competition result table|website=OFFICIAL IJCNN2011 COMPETITION|language=en-US|access-date=2019-01-14|date=2010}}
9. ^{{Cite web|url=http://people.idsia.ch/~juergen/computer-vision-contests-won-by-gpu-cnns.html|last1=Schmidhuber|first1=Jürgen|title=History of computer vision contests won by deep CNNs on GPU|language=en-US|access-date=14 January 2019|date=17 March 2017}}
10. ^¹{{cite journal|last1=Schmidhuber|first1=Jürgen|title=Deep Learning|journal=Scholarpedia|url=http://www.scholarpedia.org/article/Deep_Learning|date=2015|volume=10|issue=11|pages=1527–54|pmid=16764513|doi=10.1162/neco.2006.18.7.1527|citeseerx=10.1.1.76.1541}}
11. ^{{cite book |last1=Ciresan |first1=Dan |first2=Ueli |last2=Meier |first3=Jürgen |last3=Schmidhuber |title=Multi-column deep neural networks for image classification |journal=2012 IEEE Conference on Computer Vision and Pattern Recognition |date=June 2012 |pages=3642–3649 |doi=10.1109/CVPR.2012.6248110 |arxiv=1202.2745 |isbn=978-1-4673-1226-4 |oclc=812295155 |publisher=Institute of Electrical and Electronics Engineers (IEEE) |location=New York, NY|citeseerx=10.1.1.300.3283 }}
12. ^Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, Backpropagation Applied to Handwritten Zip Code Recognition; AT&T Bell Laboratories
13. ^{{cite journal|last=LeCun|first=Yann|author2=Léon Bottou |author3=Yoshua Bengio |author4=Patrick Haffner |title=Gradient-based learning applied to document recognition|journal=Proceedings of the IEEE|year=1998|volume=86|issue=11|pages=2278–2324|url=http://yann.lecun.com/exdb/publis/pdf/lecun-01a.pdf|accessdate=October 7, 2016|doi=10.1109/5.726791|citeseerx=10.1.1.32.9552}}
14. ^{{cite journal | last1 = Fukushima | first1 = K. | year = 2007 | title = Neocognitron | url = | journal = Scholarpedia | volume = 2 | issue = 1| page = 1717 | doi=10.4249/scholarpedia.1717}}
15. ^{{cite journal|last=Fukushima|first=Kunihiko|title=Neocognitron: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position|journal=Biological Cybernetics|year=1980|volume=36|issue=4|pages=193–202|url=http://www.cs.princeton.edu/courses/archive/spr08/cos598B/Readings/Fukushima1980.pdf|accessdate=16 November 2013|doi=10.1007/BF00344251|pmid=7370364}}
16. ^{{cite journal |first1=J |last1=Weng |first2=N |last2=Ahuja |first3=TS |last3=Huang |title=Learning recognition and segmentation of 3-D objects from 2-D images |journal=Proc. 4th International Conf. Computer Vision |year=1993 |pages=121–128 }}
17. ^{{cite journal|last1=He|first1=Kaiming|last2=Zhang|first2=Xiangyu|last3=Ren|first3=Shaoqing|last4=Sun|first4=Jian|title=Deep Residual Learning for Image Recognition.|journal= 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)|date=2016}}
18. ^{{Cite web|url=https://adeshpande3.github.io/adeshpande3.github.io/The-9-Deep-Learning-Papers-You-Need-To-Know-About.html|title=The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)|last=Deshpande|first=Adit|website=adeshpande3.github.io|access-date=2018-12-04}}
19. ^{{cite web | title=Alex Krizhevsky | website=Google Scholar Citations | url=https://scholar.google.com/citations?user=xegzhJcAAAAJ}}