“CUDA”的意思、由来-开放百科全书

The CUDA platform is designed to work with programming languages such as C, C++, and Fortran. This accessibility makes it easier for specialists in parallel programming to use GPU resources, in contrast to prior APIs like Direct3D and OpenGL, which required advanced skills in graphics programming.^[3] Also, CUDA supports programming frameworks such as OpenACC and OpenCL.^[2] When it was first introduced by Nvidia, the name CUDA was an acronym for Compute Unified Device Architecture,^[4] but Nvidia subsequently dropped the use of the acronym.

Background

The graphics processing unit (GPU), as a specialized computer processor, addresses the demands of real-time high-resolution 3D graphics compute-intensive tasks. By 2012, GPUs had evolved into highly parallel multi-core systems allowing very efficient manipulation of large blocks of data. This design is more effective than general-purpose central processing unit (CPUs) for algorithms in situations where processing large blocks of data is done in parallel, such as:

Programming abilities

The CUDA platform is accessible to software developers through CUDA-accelerated libraries, compiler directives such as OpenACC, and extensions to industry-standard programming languages including C, C++ and Fortran. C/C++ programmers can use 'CUDA C/C++', compiled with nvcc, Nvidia's LLVM-based C/C++ compiler.^[5] Fortran programmers can use 'CUDA Fortran', compiled with the PGI CUDA Fortran compiler from The Portland Group.

In addition to libraries, compiler directives, CUDA C/C++ and CUDA Fortran, the CUDA platform supports other computational interfaces, including the Khronos Group's OpenCL,^[6] Microsoft's DirectCompute, [https://www.khronos.org/opengl/wiki/Compute_Shader OpenGL Compute Shaders] and C++ AMP.^[7] Third party wrappers are also available for Python, Perl, Fortran, Java, Ruby, Lua, Common Lisp, Haskell, R, MATLAB, IDL, Julia, and native support in Mathematica.

In the computer game industry, GPUs are used for graphics rendering, and for game physics calculations (physical effects such as debris, smoke, fire, fluids); examples include PhysX and Bullet. CUDA has also been used to accelerate non-graphical applications in computational biology, cryptography and other fields by an order of magnitude or more.^[8]^[9]^[10]^[11]^[12]

CUDA provides both a low level API (CUDA Driver API, non single-source) and a higher level API (CUDA Runtime API, single-source). The initial CUDA SDK was made public on 15 February 2007, for Microsoft Windows and Linux. Mac OS X support was later added in version 2.0,^[13] which supersedes the beta released February 14, 2008.^[14] CUDA works with all Nvidia GPUs from the G8x series onwards, including GeForce, Quadro and the Tesla line. CUDA is compatible with most standard operating systems. Nvidia states that programs developed for the G8x series will also work without modification on all future Nvidia video cards, due to binary compatibility.{{Citation needed|date=January 2014}}

CUDA 8.0 comes with the following libraries (for compilation & runtime, in alphabetical order):

Advantages

CUDA has several advantages over traditional general-purpose computation on GPUs (GPGPU) using graphics APIs:

Limitations

GPUs supported

Version features and specifications

Note: Any missing lines or empty entries do reflect some lack of information on that exact item.

For more information see the article: {{cite web|url=http://www.geeks3d.com/20100606/gpu-computing-nvidia-cuda-compute-capability-comparative-table/|title=(GPU Computing) NVIDIA CUDA Compute Capability Comparative Table|author=JeGX|publisher=Geeks3D|date=2010-06-06|access-date=2017-08-08}} and read Nvidia CUDA programming guide.^[41]

Example

This example code in C++ loads a texture from an image into an array on the GPU:

Below is an example given in Python that computes the product of two arrays on the GPU. The unofficial Python language bindings can be obtained from PyCUDA.^[42]

Additional Python bindings to simplify matrix multiplication operations can be found in the program pycublas.^[43]

Benchmarks

Language bindings

Current and future usages of CUDA architecture

See also

References

1. ^{{cite web|url=http://www.nvidia.com/object/cuda_home_new.html|title=Nvidia CUDA Home Page}}
2. ^¹{{cite web|url=http://www.tomshardware.com/reviews/nvidia-cuda-gpu,1954.html|title=Nvidia's CUDA: The End of the CPU?|last=Abi-Chahla|first=Fedy|date=June 18, 2008|publisher=Tom's Hardware|accessdate=May 17, 2015}}
3. ^{{Cite news|url=https://www.videomaker.com/article/c15/19313-cuda-vs-opencl-vs-opengl|title=CUDA vs. OpenCL vs. OpenGL|last=Zunitch|first=Peter|date=2018-01-24|work=Videomaker|access-date=2018-09-16|language=en-US}}
4. ^{{cite web|url=http://www.anandtech.com/show/2116/8|title=Nvidia's GeForce 8800 (G80): GPUs Re-architected for DirectX 10|last1=Shimpi|first1=Anand Lal|last2=Wilson|first2=Derek|date=November 8, 2006|publisher=AnandTech|accessdate=May 16, 2015}}
5. ^{{cite web|url=http://developer.nvidia.com/cuda/cuda-llvm-compiler|title=CUDA LLVM Compiler}}
6. ^{{YouTube|r1sN1ELJfNo|First OpenCL demo on a GPU}}
7. ^{{YouTube|K1I4kts5mqc|DirectCompute Ocean Demo Running on Nvidia CUDA-enabled GPU}}
8. ^{{cite journal|last1=Vasiliadis |first1=Giorgos |last2=Antonatos |first2=Spiros |last3=Polychronakis |first3=Michalis |last4=Markatos |first4=Evangelos P. |last5=Ioannidis |first5=Sotiris |title= Gnort: High Performance Network Intrusion Detection Using Graphics Processors |journal= Proceedings of the 11th International Symposium on Recent Advances in Intrusion Detection (RAID) |date=September 2008 |url= http://www.ics.forth.gr/dcs/Activities/papers/gnort.raid08.pdf }}
9. ^{{cite journal |last1=Schatz |first1=Michael C. |last2=Trapnell |first2=Cole |last3=Delcher |first3=Arthur L. |last4=Varshney |first4=Amitabh |year= 2007 |title= High-throughput sequence alignment using Graphics Processing Units |journal= BMC Bioinformatics |volume= 8|doi= 10.1186/1471-2105-8-474 |pages= 474 |pmid= 18070356 |pmc= 2222658}}
10. ^{{cite journal|last1= Manavski |first1= Svetlin A. |last2=Giorgio |first2=Valle |title= CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment |journal= BMC Bioinformatics |volume= 10 |year= 2008 |doi= 10.1186/1471-2105-9-S2-S10 |pages= S10 |pmid= 18387198 |pmc= 2323659}}
11. ^{{cite web|url=https://code.google.com/p/pyrit/|title=Pyrit – Google Code}}
12. ^{{cite web|url=http://boinc.berkeley.edu/cuda.php|title=Use your Nvidia GPU for scientific computing|archive-url=https://web.archive.org/web/20081228022142/http://boinc.berkeley.edu/cuda.php|archive-date=2008-12-28|dead-url=yes|access-date=2017-08-08|publisher=BOINC|date=2008-12-18}}
13. ^{{cite web|url=http://developer.download.nvidia.com/compute/cuda/sdk/website/doc/CUDA_SDK_release_notes_macosx.txt|title=Nvidia CUDA Software Development Kit (CUDA SDK) – Release Notes Version 2.0 for MAC OS X|deadurl=yes|archiveurl=https://web.archive.org/web/20090106020401/http://developer.download.nvidia.com/compute/cuda/sdk/website/doc/CUDA_SDK_release_notes_macosx.txt|archivedate=2009-01-06|df=}}
14. ^{{cite web|url=http://news.developer.nvidia.com/2008/02/cuda-11---now-o.html|title=CUDA 1.1 – Now on Mac OS X|date=February 14, 2008|deadurl=yes|archiveurl=https://web.archive.org/web/20081122105633/http://news.developer.nvidia.com/2008/02/cuda-11---now-o.html|archivedate=November 22, 2008|df=}}
15. ^{{cite conference|doi=10.1145/1375527.1375572|title=Efficient computation of sum-products on GPUs through software-managed cache|conference=Proceedings of the 22nd annual international conference on Supercomputing – ICS '08|year=2008|last1=Silberstein|first1=Mark|last2=Schuster|first2=Assaf|author2-link= Assaf Schuster |last3=Geiger|first3=Dan|last4=Patney|first4=Anjul|last5=Owens|first5=John D.|isbn=978-1-60558-158-3|pages=309–318}}
16. ^{{cite web|title=CUDA Toolkit Documentation|url=http://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf|website=nVidia Developer Zone - CUDA C Programming Guide v8.0|accessdate=22 March 2017|location=Section 3.1.5|page=19|date=January 2017}}
17. ^{{cite web|url=https://devtalk.nvidia.com/default/topic/508479/cuda-programming-and-performance/nvcc-forces-c-compilation-of-cu-files/#entry1340190|title=NVCC forces c++ compilation of .cu files}}
18. ^{{cite web |url=http://www.nvidia.com/object/cuda_learn_products.html |title=CUDA-Enabled Products |work=CUDA Zone |publisher=Nvidia Corporation |accessdate=2008-11-03}}
19. ^{{Cite web|url=https://developer.nvidia.com/sites/default/files/akamai/cuda/files/NVIDIA-CUDA-Floating-Point.pdf |first1=Nathan |last1=Whitehead |first2=Alex |last2=Fit-Florea |title=Precision & Performance: Floating Point and IEEE 754 Compliance for Nvidia GPUs |accessdate=November 18, 2014 |publisher=Nvidia}}
20. ^http://developer.download.nvidia.com/compute/cuda/1.0/NVIDIA_CUDA_Programming_Guide_1.0.pdf
21. ^http://developer.download.nvidia.com/compute/cuda/2_1/toolkit/docs/NVIDIA_CUDA_Programming_Guide_2.1.pdf
22. ^http://developer.download.nvidia.com/compute/cuda/2_2/toolkit/docs/NVIDIA_CUDA_Programming_Guide_2.2.pdf
23. ^http://developer.download.nvidia.com/compute/cuda/2_21/toolkit/docs/NVIDIA_CUDA_Programming_Guide_2.2.1.pdf
24. ^http://developer.download.nvidia.com/compute/cuda/2_3/toolkit/docs/NVIDIA_CUDA_Programming_Guide_2.3.pdf
25. ^http://developer.download.nvidia.com/compute/cuda/3_0/toolkit/docs/NVIDIA_CUDA_ProgrammingGuide.pdf
26. ^http://developer.download.nvidia.com/compute/cuda/3_1/toolkit/docs/NVIDIA_CUDA_C_ProgrammingGuide_3.1.pdf
27. ^http://developer.download.nvidia.com/compute/cuda/3_2_prod/toolkit/docs/CUDA_C_Programming_Guide.pdf
28. ^https://developer.nvidia.com/cuda-toolkit-archive
29. ^https://www.techpowerup.com/gpu-specs/quadro-nvs-420.c1448
30. ^{{cite web|url=http://www.phoronix.com/scan.php?page=news_item&px=Tegra-X2-Nouveau-Support|title=NVIDIA Rolls Out Tegra X2 GPU Support In Nouveau|last=Larabel|first=Michael|author-link=Michael Larabel|publisher=Phoronix|date=March 29, 2017|access-date=August 8, 2017}}
31. ^[https://www.techpowerup.com/gpudb/3232/xavier Nvidia Xavier Specs] on TechPowerUp (preliminary)
32. ^[https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications H.1. Features and Technical Specifications - Table 13. Feature Support per Compute Capability]
33. ^https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications
34. ^[https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications H.1. Features and Technical Specifications - Table 14. Technical Specifications per Compute Capability]
35. ^ALUs perform only single-precision floating-point arithmetics. There is 1 double-precision floating-point unit.
36. ^[https://devblogs.nvidia.com/inside-volta/ Inside Volta] on Nvidia DevBlogs
37. ^No more than one scheduler can issue 2 instructions at once. The first scheduler is in charge of warps with odd IDs. The second scheduler is in charge of warps with even IDs.
38. ^[https://devblogs.nvidia.com/inside-volta/ Inside Volta] on Nvidia DevBlogs
39. ^https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#architecture-7-x
40. ^[https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capability-7-x H.6. Compute Capability 7.x]
41. ^{{cite web|url= http://developer.download.nvidia.com/compute/DevZone/docs/html/C/doc/CUDA_C_Programming_Guide.pdf |title=Appendix F. Features and Technical Specifications }} {{small|(3.2 MiB)}}, Page 148 of 175 (Version 5.0 October 2012)
42. ^{{cite web|url=http://mathema.tician.de/software/pycuda|title=PyCUDA}}
43. ^{{cite web|url=http://kered.org/blog/2009-04-13/easy-python-numpy-cuda-cublas/|title=pycublas|archive-url=https://web.archive.org/web/20090420124748/http://kered.org/blog/2009-04-13/easy-python-numpy-cuda-cublas/|archive-date=2009-04-20|dead-url=yes|access-date=2017-08-08}}
44. ^https://devblogs.nvidia.com/gpu-computing-julia-programming-language/
45. ^{{cite web|title=MATLAB Adds GPGPU Support|url=http://www.hpcwire.com/features/MATLAB-Adds-GPGPU-Support-103307084.html|date=2010-09-20|deadurl=yes|archiveurl=https://web.archive.org/web/20100927155948/http://www.hpcwire.com/features/MATLAB-Adds-GPGPU-Support-103307084.html|archivedate=2010-09-27|df=}}