请输入您要查询的百科知识:

 

词条 ProbCons
释义

  1. Algorithm

     Step 1: Reliability of an alignment edge  Step 2: Maximum expected accuracy  Step 3: Probabilistic Consistency Transformation  Step 4: Computation of guide tree  Step 5: Compute MSA 

  2. See also

  3. References

  4. External links

ProbCons is an open source probabilistic consistency-based multiple alignment of amino acid sequences. It is an efficient protein multiple sequence alignment program, which has demonstrated a statistically significant improvement in accuracy compared to several leading alignment tools.[1][2]

Algorithm

The following describes the basic outline of the ProbCons algorithm.[3]

Step 1: Reliability of an alignment edge

For every pair of sequences compute the probability that letters and are paired in an alignment that is generated by the model.

(Where is equal to 1 if and are in the alignment and 0 otherwise.)

Step 2: Maximum expected accuracy

The accuracy of an alignment with respect to another alignment is defined as the number of common aligned pairs divided by the length of the shorter sequence.

Calculate expected accuracy of each sequence:

This yields a maximum expected accuracy (MEA) alignment:

Step 3: Probabilistic Consistency Transformation

All pairs of sequences x,y from the set of all sequences are now re-estimated using all intermediate sequences z:

This step can be iterated.

Step 4: Computation of guide tree

Construct a guide tree by hierarchical clustering using MEA score as sequence similarity score. Cluster similarity is defined using weighted average over pairwise sequence similarity.

Step 5: Compute MSA

Finally compute the MSA using progressive alignment or iterative alignment.

See also

  • Sequence alignment software
  • Clustal
  • MUSCLE
  • AMAP
  • T-Coffee
  • Probalign

References

1. ^{{cite journal |doi=10.1101/gr.2821705 |vauthors=Do CB, Mahabhashyam MS, Brudno M, Batzoglou S |year=2005 |title=PROBCONS: Probabilistic Consistency-based Multiple Sequence Alignment |journal=Genome Research |volume=15 |issue=2 |pages=330–340 |pmid=15687296 |pmc=546535}}
2. ^{{Cite book|title=Multiple Sequence Alignment Methods|volume = 1079|last=Roshan|first=Usman|date=2014-01-01|publisher=Humana Press|isbn=9781627036450|editor-last=Russell|editor-first=David J|series=Methods in Molecular Biology|pages=147–153|language=English|doi=10.1007/978-1-62703-646-7_9|pmid = 24170400|chapter = Multiple Sequence Alignment Using Probcons and Probalign}}
3. ^Lecture "Bioinformatics II" at University of Freiburg

External links

  • {{Official website|http://probcons.stanford.edu/}}

2 : Bioinformatics|Computational phylogenetics

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/11 21:41:53