Evola CSM (http://www.h-invitational.jp/csm/) contains the results of cross-species mapping (CSM) of human transcripts onto non-human primate genomes. The purpose of this database is to provide annotation of the coding potential of unknown human transcripts (whether they code a protein or not). In the coding sequence (CDS), synonymous substitution (amino acid not-changing) tends to occur more frequently than nonsynonymous (amino acid changing) substitution. Pairwise alignments of human transcripts and non-human primate genomes were examined by window analysis (20-codon window with 1-codon step). If the statistical difference (P < 0.01, Fisher's exact test) between synonymous and nonsynonymous substitutions was observed, the human transcripts were identified as "Protein-coding [conserved]". If insertions of frame-shift or stop-codon into the non-human primate genomes resulted in shortening (< 50%) of human CDS, the human transcripts were identified as "Protein-coding [partially conserved]". [ Sample ] |
|
|
|