请输入您要查询的百科知识:

 

词条 Chroma feature
释义

  1. Definition

  2. Applications

  3. Computation of audio chromagrams

  4. See also

  5. References

  6. External links

In music, the term chroma feature or chromagram closely relates to the twelve different pitch classes. Chroma-based features, which are also referred to as "pitch class profiles", are a powerful tool for analyzing music whose pitches can be meaningfully categorized (often into twelve categories) and whose tuning approximates to the equal-tempered scale. One main property of chroma features is that they capture harmonic and melodic characteristics of music, while being robust to changes in timbre and instrumentation.

Definition

The underyling observation is that humans perceive two musical pitches as similar in color if they differ by an octave. Based on this observation, a pitch can be separated into two components, which are referred to as tone height and chroma.[1] Assuming the equal-tempered scale, one considers twelve chroma values represented by the set

{C, C{{music|#}}, D, D{{music|#}}, E ,F, F{{music|#}}, G, G{{music|#}}, A, A{{music|#}}, B}

that consists of the twelve pitch spelling attributes as used in Western music notation. Note that in the equal-tempered scale different pitch spellings such

C{{music|#}} and D{{music|b}} refer to the same chroma. Enumerating the chroma values, one can identify the set of chroma values with the set of integers {1,2,...,12}, where 1 refers to chroma C, 2 to C{{music|#}}, and so on. A pitch class is defined as the set of all pitches that share the same chroma. For example, using the scientific pitch notation, the pitch class corresponding to the chroma C is the set

{..., C−2, C−1, C0, C1, C2, C3 ...}

consisting of all pitches separated by an integer number of octaves. Given a music representation (e.g. a musical score or an audio recording), the main idea of chroma features is to aggregate for a given local time window (e.g. specified in beats or in seconds) all information that relates to a given chroma into a single coefficient. Shifting the time window across the music representation results in a sequence of chroma features each expressing how the representation's pitch content within the time window is spread over the twelve chroma bands. The resulting time-chroma representation is also referred to as chromagram. The figure above shows chromagrams for a C-major scale, once obtained from a musical score and once from an audio recording. Because of the close relation between the terms chroma and pitch class, chroma features are also referred to as pitch class profiles.

Applications

Identifying pitches that differ by an octave, chroma features show a high degree of robustness to variations in timbre and closely correlate to the musical aspect of harmony. This is the reason why chroma features are a well-established tool for processing and analyzing music data.[2] For example, basically every chord recognition procedure relies on some kind of chroma representation.[3][4][5][6] Also, chroma features have become the de facto standard for tasks such as music alignment and synchronization[7][8] as well as audio structure analysis.[9] Finally, chroma features have turned out to be a powerful mid-level feature representation in content-based audio retrieval such as cover song

identification[10][11] or audio matching.[12][13]

Computation of audio chromagrams

There are many ways for converting an audio recording into a chromagram. For example, the conversion of an audio recording into a chroma representation (or chromagram) may be performed either by using short-time Fourier transforms in combination with binning strategies[14][15][16] or by employing suitable multirate filter banks.[12]

Furthermore, the properties of chroma features can be significantly changed by

introducing suitable pre- and post-processing steps modifying spectral, temporal,

and dynamical aspects. This leads to a large number of chroma variants, which

may show a quite different behavior in the context of a specific music analysis scenario.[17]

See also

  • Time-frequency analysis
  • Time-frequency analysis for music signal
  • Pitch (music)
  • Musical theory

References

1. ^{{cite journal|last=Shepard|first=Roger N.|title=Circularity in judgments of relative pitch|journal=Journal of the Acoustical Society of America|volume=36|issue=212|date=1964|pages=2346–2353|doi=10.1121/1.1919362}}
2. ^{{cite book| last = Müller| first = Meinard| title = Fundamentals of Music Processing| url = http://www.music-processing.de| publisher = Springer| year = 2015| doi = 10.1007/978-3-319-21945-5| isbn = 978-3-319-21944-8}}
3. ^{{cite journal|last1=Cho|first1=Taemin|last2=Bello|first2=Juan Pablo |title=On the Relative Importance of Individual Components of Chord Recognition Systems|journal=IEEE/ACM Transactions on Audio, Speech, and Language Processing|volume=22|issue=2|year=2014|pages=477–4920|doi=10.1109/TASLP.2013.2295926}}
4. ^{{cite journal|last1=Mauch|first1=Matthias|last2=Dixon|first2=Simon|title=Simultaneous estimation of chords and musical context from audio|journal=IEEE Transactions on Audio, Speech, and Language Processing|volume=18|issue=6|year=2010|pages=138–153|doi=10.1109/TASL.2009.2032947|citeseerx=10.1.1.414.7800}}
5. ^{{cite journal|last=Fujishima|first=Takuya|title=Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music|journal=Proceedings of the International Computer Music Conference|year=1999|pages=464–467}}
6. ^{{cite journal|last1=Jiang|first1=Nanzhu |last2=Grosche|first2=Peter|last3=Konz|first3=Verena|last4=Müller|first4=Meinard|title=Analyzing Chroma Feature Types for Automated Chord Recognition|url = https://www.audiolabs-erlangen.de/content/05-fau/professor/00-mueller/03-publications/2011_JiangGroscheKonzMueller_ChordRecognitionEvaluation_AES42-Ilmenau.pdf |journal=Proceedings of the AES Conference on Semantic Audio|year=2011}}
7. ^{{cite journal|last1=Hu|first1=Ning |last2=Dannenberg|first2=Roger B. |last3=Tzanetakis|first3=George |title=Polyphonic Audio Matching and Alignment for Music Retrieval|journal=Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics|year=2003}}
8. ^{{cite journal|last1= Ewert |first1=Sebastian |last2=Müller |first2=Meinard|last3=Grosche|first3=Peter|title=High Resolution Audio Synchronization Using Chroma Onset Features|url = https://www.audiolabs-erlangen.de/content/05-fau/professor/00-mueller/03-publications/2009_EwertMuellerGrosche_HighResAudioSync_ICASSP.pdf |journal=Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing|year=2009|pages=1869–1872}}
9. ^{{cite journal|last1=Paulus|first1=Jouni |last2=Müller|first2=Meinard|last3=Klapuri|first3=Anssi |title=Audio-based Music Structure Analysis|url = http://ismir2010.ismir.net/proceedings/ismir2010-107.pdf |journal=Proceedings of the International Conference on Music Information Retrieval|year=2010|pages=625–636}}
10. ^{{cite journal|last1=Ellis|first1=Daniel P.W.|last2=Poliner|first2=Graham |title=Identifying 'Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking|journal=Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing|year=2007}}
11. ^{{cite journal|last1=Serrà|first1=Joan|last2=Gómez|first2=Emilia|last3=Herrera|first3=Perfecto|last4=Serra|first4=Xavier |title=Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification|journal=IEEE Transactions on Audio, Speech, and Language Processing|volume=16|issue=6|year=2008|pages=1138–1151|doi=10.1109/TASL.2008.924595}}
12. ^{{cite journal|last1=Müller|first1=Meinard|last2=Kurth|first2=Frank|last3=Clausen|first3=Michael|title=Audio Matching via Chroma-Based Statistical Features|url = http://ismir2005.ismir.net/proceedings/1019.pdf|journal=Proceedings of the International Conference on Music Information Retrieval |year=2005|pages=288–295}}
13. ^{{cite journal|last1=Kurth|first1=Frank|last2=Müller|first2=Meinard|title=Efficient Index-Based Audio Matching|journal=IEEE Transactions on Audio, Speech, and Language Processing|volume=16|issue=2|year=2008|pages=382–395|doi=10.1109/TASL.2007.911552}}
14. ^{{cite journal|last1=Bartsch|first1=Mark A. |last2=Wakefield|first2=Gregory H. |title=Audio thumbnailing of popular music using chroma-based representations|journal=IEEE Transactions on Multimedia|volume=7|number=1|year=2005|pages=96–104|doi=10.1109/TMM.2004.840597 |citeseerx=10.1.1.379.3293 }}
15. ^{{cite journal|last=Gómez|first=Emilia |title=Tonal Description of Music Audio Signals|journal=PhD Thesis, UPF Barcelona, Spain|year=2006}}
16. ^{{cite book| last = Müller| first = Meinard| title = Music Synchronization. In Fundamentals of Music Processing, chapter 3, pages 115-166| url = http://www.music-processing.de| publisher = Springer| year = 2015| isbn = 978-3-319-21944-8 }}
17. ^{{cite journal|last1=Müller|first1=Meinard|last2=Ewert|first2=Sebastian |title=Chroma Toolbox: MATLAB Implementations For Extracting Variants of Chroma-Based Audio Features|url = https://www.audiolabs-erlangen.de/content/05-fau/professor/00-mueller/03-publications/2011_MuellerEwert_ChromaToolbox_ISMIR.pdf|journal=Proceedings of the International Society for Music Information Retrieval Conference|year=2011|pages=215–220}}

External links

  • [https://www.audiolabs-erlangen.de/resources/MIR/chromatoolbox Chroma Toolbox] Free MATLAB implementations of various chroma types of pitch-based and chroma-based audio features
  • Harmonic Pitch Class Profile plugin

4 : Music information retrieval|Music technology|Musicology|Time–frequency analysis

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/23 8:21:55