“CS23D”的意思、由来-开放百科全书

CS23D is a web server to generate 3D structural models from NMR chemical shifts.^[1] CS23D combines maximal fragment assembly with chemical shift threading, de novo structure generation, chemical shift-based torsion angle prediction, and chemical shift refinement. CS23D makes use of RefDB and ShiftX.

CS23D input formats

CS23D options

CS23D output

CS23D output consists of a set of 10 best-score PDB coordinates. A hyperlink to the single best score structure is also provided. The overall CS23D score, knowledge-based score, chemical shift score, Ramachandran plot statistics, correlations between the observed and calculated shifts before and after refinement are displayed. A conclusion about structure reliability is given to the user.

CS23D protocol

Homology search: The query sequence is used to find homologous proteins or/and protein fragments in a non-redundant database of PDB sequences and secondary structures of PPT-DB using BLAST.

Homology modelling: Homology modelling is done by the Homodeller program, which is a part of the PROTEUS2 program.^[2] The proteins that are identified during the homology search step are used as the templates in homology modelling.

Chemical shift re-referencing: Chemical shifts are re-referenced by the RefCor,^[3] which is a part of the RCI webserver backend.

Secondary structure prediction from chemical shifts: Secondary structure is predicted from chemical shifts by CSI.

Chemical shift threading: Backbone Phi and Psi torsion angles predicted from chemical shifts by PREDITOR^[4] are mapped into nine different regions in Ramachandran space, each of which are assigned specific letters. A protein can represented by a sequence of these nine "torsion angle letters". Thrifty is using these sequences of torsion angle letters to identify good templates in a database of ∼18 500 nonredundant PDB structures that have had their structures converted to the nine-letter Ramachandran "alphabet".

In a similar manner, chemical shift threading is additionally done using three-letter secondary structure alphabet (H for helix, B for beta-strand, C for coil) and secondary structure predicted from chemical shifts by the CSI program.

Subfragments identified by homology modelling and chemical shift threading steps are assembled into initial 3D models using CS23D SFassembler (SubFragment assembler). The initial models are evaluated by the GAFolder scoring function (see below) and the best model is further refined by GAFolder (see more info about GAFolder below).

Ab initio folding: Ab initio folding is done by Rosetta^[5] when no template was identified by the homology modelling and chemical shift threading steps. Rosetta models are evaluated by GAFolder scoring function and the best Rosetta models are refined by GAFolder (see below).

Model optimization: Model optimization in CS23D is done by a torsion-angle-based minimizer GAfolder (Genetic Algorithm folder) that uses a genetic algorithm to sample conformation space. The method is similar to that employed by GENFOLD.^[5] GAFolder makes torsion angles moves within the ranges defined by the values and uncertainties of torsion angles predicted by PREDITOR.^[4] GAFolder evaluates protein models by the scoring function described below.

Scoring function: Scoring function of GAFolder consists of knowledge based scores and chemical shift scores.

CS23D sub-programs

CS23D dependence on template sequence identity

CS23D is a template-based method. Therefore, its performance depends on sequence identity of the selected template(s), see the adjacent picture. Likewise, Rosetta is a fragment-biased method. Its performance depends on the quality of selected fragments. Fragment quality and, thus, Rosetta performance can be improved by using chemical shifts during the fragment selection step (e.g. in CS-Rosetta protocol). For a structural solution that is not biased by a template structure or fragment structure, one may want to consider obtaining NOE-based distance restraints (8-10 per residue) and using them with the GeNMR program in its ab initio mode.

See also

References

1. ^{{cite journal |vauthors=Wishart DS, Arndt D, Berjanskii M, Tang P, Zhou J, Lin G |title=CS23D: a web server for rapid protein structure generation using NMR chemical shifts and sequence data |journal=Nucleic Acids Research |volume=36 |issue=Web Server issue |pages=W496–502 |date=July 2008 |pmid=18515350 |pmc=2447725 |doi=10.1093/nar/gkn305}}
2. ^¹{{cite journal |vauthors=Montgomerie S, Cruz JA, Shrivastava S, Arndt D, Berjanskii M, Wishart DS |title=PROTEUS2: a web server for comprehensive protein structure prediction and structure-based annotation |journal=Nucleic Acids Research |volume=36 |issue=Web Server issue |pages=W202–9 |date=July 2008 |pmid=18483082 |pmc=2447806 |doi=10.1093/nar/gkn255}}
3. ^{{cite journal |vauthors=Berjanskii M, Wishart DS |title=NMR: prediction of protein flexibility |journal=Nature Protocols |volume=1 |issue=2 |pages=683–8 |year=2006 |pmid=17406296 |doi=10.1038/nprot.2006.108}}
4. ^¹²³{{cite journal |vauthors=Berjanskii MV, Neal S, Wishart DS |title=PREDITOR: a web server for predicting protein torsion angle restraints |journal=Nucleic Acids Research |volume=34 |issue=Web Server issue |pages=W63–9 |date=July 2006 |pmid=16845087 |pmc=1538894 |doi=10.1093/nar/gkl341}}
5. ^¹²{{cite journal |vauthors=Rohl CA, Strauss CE, Misura KM, Baker D |title=Protein structure prediction using Rosetta |journal=Methods in Enzymology |volume=383 |issue= |pages=66–93 |year=2004 |pmid=15063647 |doi=10.1016/S0076-6879(04)83004-0}}
6. ^{{cite journal |vauthors=Bryant SH, Lawrence CE |title=An empirical energy function for threading protein sequence through the folding motif |journal=Proteins |volume=16 |issue=1 |pages=92–112 |date=May 1993 |pmid=8497488 |doi=10.1002/prot.340160110}}