词条 | Feature extraction |
释义 |
In machine learning, pattern recognition and in image processing, feature extraction starts from an initial set of measured data and builds derived values (features) intended to be informative and non-redundant, facilitating the subsequent learning and generalization steps, and in some cases leading to better human interpretations. Feature extraction is a dimensionality reduction process, where an initial set of raw variables is reduced to more manageable groups (features) for processing, while still accurately and completely describing the original data set.[1] When the input data to an algorithm is too large to be processed and it is suspected to be redundant (e.g. the same measurement in both feet and meters, or the repetitiveness of images presented as pixels), then it can be transformed into a reduced set of features (also named a feature vector). Determining a subset of the initial features is called feature selection.[2] The selected features are expected to contain the relevant information from the input data, so that the desired task can be performed by using this reduced representation instead of the complete initial data. GeneralFeature extraction involves reducing the amount of resources required to describe a large set of data. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power, also it may cause a classification algorithm to overfit to training samples and generalize poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy. Many machine learning practitioners believe that properly optimized feature extraction is the key to effective model construction.[3] Results can be improved using constructed sets of application-dependent features, typically built by an expert. One such process is called feature engineering. Alternatively, general dimensionality reduction techniques are used such as:
Image processingOne very important area of application is image processing, in which algorithms are used to detect and isolate various desired portions or shapes (features) of a digitized image or video stream. It is particularly important in the area of optical character recognition. Low-level
Curvature
Image motion
Shape based
Flexible methods
Feature extraction in softwareMany data analysis software packages provide for feature extraction and dimension reduction. Common numerical programming environments such as MATLAB, SciLab, NumPy and the R language provide some of the simpler feature extraction techniques (e.g. principal component analysis) via built-in commands. More specific algorithms are often available as publicly available scripts or third-party add-ons. There are also software packages targeting specific software machine learning applications that specialize in feature extraction.[4] See also
References1. ^{{Cite web|url=https://deepai.org/machine-learning-glossary-and-terms/feature-extraction|title=What is Feature Extraction?|last=|first=|date=|website=deepai.org|archive-url=|archive-date=|dead-url=|access-date=}} {{More citations needed|date=January 2016}}2. ^{{Cite book|author=Alpaydin, Ethem|title=Introduction to Machine Learning|url=https://books.google.com/books?id=7f5bBAAAQBAJ&printsec=frontcover#v=onepage&q=%22feature%20(extraction%20OR%20selection)%22&f=false |year=2010 |publisher=The MIT Press |place=London|page=110 |isbn= 978-0-262-01243-0 |access-date=4 February 2017 }} 3. ^Reality AI Blog, "Its all about the features," September 2017, https://reality.ai/it-is-all-about-the-features/ 4. ^See, for example, https://reality.ai/ 2 : Feature detection (computer vision)|Dimension reduction |
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。