请输入您要查询的百科知识:

 

词条 Horizontal correlation
释义

  1. References

{{multiple issues|{{Orphan|date=February 2009}}{{Refimprove|date=January 2008}}
}}

Horizontal correlation is a methodology for gene sequence analysis. Rather than referring to one specific technique, horizontal correlation instead encompasses a variety of approaches to sequence analysis that are unified by two specific themes:

  • Sequence analysis is performed by making comparisons horizontally, along the length of a single genetic sequence; this is in contrast to vertical methods that make comparisons across several different genetic sequences.
  • The comparisons made generally measure information theoretic quantities such as value of the mutual information function between two regions of the sequence.

The core ideas of the horizontal correlation approach were first presented in a year 2000 paper by Grosse, Herzel, Buldyrev, and Stanley (Grosse, et al. 2000). In this first formulation, Grosse and colleagues sought to characterize a large genetic sequence by dividing the sequence into coding and non-coding regions. Whereas traditional approaches to the coding-vs.-non-coding problem generally relied on sophisticated pattern recognition systems that were first trained on small inputs and then run over the entire sequence (Ohler, et al. 1999), the horizontal correlation approach of Grosse and colleagues worked instead by breaking the sequence into many relatively short sequence fragments, each only 500 base pairs in length. They then sought to characterize each of these fragments as either coding or non-coding. This was accomplished by comparing each size 3 window along the length of a fragment with the first size 3 window in that fragment, then measuring the value of the mutual information function between the two windows. Coding sequences were found to display a stylized pattern of 3-periodicity that non-coding sequences did not. Such a pattern was easy to recognize, and enabled significantly more rapid, more species-independent identification of coding regions (Grosse, et al. 2000).

Since 2000, horizontal correlation methodologies emphasizing the measurement of information theoretic quantities along the length of a gene sequence have been put to widespread use, and have even found application in shotgun sequencing fragment assembly (Otu & Sayood, 2004).

References

  • I. Grosse, H. Herzel, S. Buldyrev, H. Stanley: "Species Independence of Mutual Information in Coding and non-Coding DNA," Physical Review E, Vol. 61, No. 5 (2000)
  • U. Ohler, S. Harbeck, H. Niemann, E. Noth, and M. Reese: "Interpolated Markov Chains for Eukaryotic Promoter Recognition," Bioinformatics, Vol. 15, pp. 362–369 (1999)
  • H. Otu, K. Sayood: "A Divide and Conquer Approach to Fragment Assembly," Bioinformatics, Vol. 19, No. 1 pp. 22–29 (2004)

1 : Bioinformatics

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/16 5:57:02