请输入您要查询的百科知识:

 

词条 Coding region
释义

  1. Structure

  2. Difference with cDNA

  3. Coding sequence annotation

  4. See also

  5. References

{{refimprove|date=August 2018}}

The coding region of a gene, also known as the CDS (from coding sequence), is that portion of a gene's DNA or RNA that codes for protein. The region usually begins at the 5' end by a start codon and ends at the 3' end with a stop codon.

Structure

The coding region in an mRNA is flanked by the five prime untranslated region (5'-UTR) and the three prime untranslated region (3'-UTR). [1] The CDS is that portion of an mRNA transcript that is translated by a ribosome. CDS is a keyword (feature-key) used to denote the 'protein-coding sequence' in a gene feature table by the major sequence databases INSDC. They also read CDS as both coding sequence and [https://ncbiinsights.ncbi.nlm.nih.gov/tag/cds/ coding region].

Difference with cDNA

A cDNA sequence is derived from the transcript by reverse transcription, but in this case it also contains the 5' and 3' UTRs, which are not part of the CDS (they are transcribed, but not translated). A CDS will almost always start with an AUG initiation codon in eukaryotes and stop at one of the three stop codons (UAA, UGA, UAG).

Coding sequence annotation

While identification of open reading frames within a DNA sequence is straightforward, identifying coding sequences is not, because the cell translates only a subset of all open reading frames to proteins.[2]

Currently CDS prediction uses sampling and sequencing of mRNA from cells, although there is still the problem of determining which parts of a given mRNA are actually translated to protein. CDS prediction is a subset of gene prediction, the latter also including prediction of DNA sequences that code not only for protein but also for other functional elements such as RNA genes and regulatory sequences.

See also

  • Coding strand The strand that codes for a protein
  • Gene structure The other elements that make up a gene
  • Non-coding DNA Parts of genomes that don't encode genes
  • Non-coding RNA Genes that do not encode proteins, have no CDS

References

1. ^{{cite web|url=http://genome.wellcome.ac.uk/doc_WTD020755.html|title=Gene Structure|last=Twyman|first=Richard|date=1 August 2003|publisher=The Wellcome Trust|archiveurl=https://web.archive.org/web/20070328214808/http://genome.wellcome.ac.uk/doc_WTD020755.html|archivedate=28 March 2007|deadurl=yes|accessdate=6 April 2003|df=}}
2. ^{{Cite journal | last1 = Furuno | first1 = Masaaki | last2 = Kasukawa | first2 = Takeya | last3 = Saito | first3 = Rintaro | last4 = Adachi | first4 = Jun | last5 = Suzuki | first5 = Harukazu | last6 = Baldarelli | first6 = Richard | last7 = Hayashizaki | first7 = Yoshihide | last8 = Okazaki | first8 = Yasushi | title = CDS Annotation in Full-Length cDNA Sequence | journal = Genome Research | volume = 21 | issue = 9 | pages = 1478–1487 | publisher = Cold Spring Harbor Laboratory Press | date = September 2011 | url = http://genome.cshlp.org/content/13/6b/1478.full.pdf+html | doi = 10.1101/gr.1060303 | accessdate = 18 September 2011 | postscript = {{inconsistent citations}} | pmc = 403693}}

2 : DNA|Biochemistry

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/11 19:27:47