H-InvDB topic annotation

NEDO FL-cDNA PJ: annotation of NEDO full-length cDNA project [Download]

Description: We provide the detailed annotation data for the H-InvDB_6.0 representative transcripts and NEDO human full-length cDNA sequences (rep. + NEDO-PJ).

  2. evolutionary conservation and coding-potential
  3. expression frequency based on EST and CAGE tags

1. summary of anntaion
[Download (1)] Annotation summary
File name: hinvdb_nedo_summary.txt.gz [Download]
HIXHITACC#Chr. NumberEvolutionary conservationExpression frequencyTranscript length (b.p.)CDS length (a.a.)Similarity categoryDefinition
1 HIX0001452 HIT000000166 AB007918 1 1High98951625IProtein OSCP1 (Organic solute transport protein 1) (hOSCP1) (Oxidored- nitro domain-containing protein 1).
2 HIX0000960 HIT000000171 AB007923 1 2High80182241IFibroblast growth factor 10 precursor (FGF-10) (Keratinocyte growth factor 2).
3 HIX0022179 HIT000000174 AB007926 1 1High6833833ISialyltransferase 6 isoform j.
4 HIX0001545 HIT000000180 AB007932 1 1High62631910IPalmitoyl-protein thioesterase 1 precursor (EC (PPT-1) (Palmitoyl-protein hydrolase 1).
5 HIX0001257 HIT000000181 AB007933 1 1High2879507IPeflin (PEF protein with a long N-terminal hydrophobic domain) (Penta- EF hand domain-containing protein 1).
6 HIX0000672 HIT000000190 AB007942 1 1High5747914IEukaryotic translation initiation factor 1A, X-chromosomal (eIF-1A X isoform) (eIF-4C).
7 HIX0001372 HIT000000192 AB007944 1 1High5983410IPolyhomeotic-like protein 2 (hPH2) (Early development regulatory protein 2).
8 HIX0000646 HIT000000201 AB007954 1 3NA656590IMannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-acetylglucosaminyltransferase, isoenzyme B isoform 1.
9 HIX0114185 HIT000000208 AB007961 1 3Medium592969IAntigen peptide transporter 2 (APT2) (ATP-binding cassette sub-family B member 3) (Peptide transporter TAP2) (Peptide transporter PSF2) (Peptide supply factor 2) (PSF-2) (Peptide transporter involved in antigen processing 2).
10 HIX0000998 HIT000000209 AB007962 1 3High5732282IHistone H3.1 (H3/a) (H3/b) (H3/c) (H3/d) (H3/f) (H3/h) (H3/i) (H3/j) (H3/k) (H3/l).
11 HIX0000555 HIT000000210 AB007963 1 3High5766496IRING finger protein 44.
12 HIX0001365 HIT000000217 AB007970 1 3NA4756152I40S ribosomal protein S18 (Ke-3) (Ke3).
13 HIX0001159 HIT000000218 AB007971 1 3Low5584145IOlfactory receptor, family 2, subfamily J, member 3.
14 HIX0018156 HIT000000221 AB007974 1 3Low5809228IUbiquitin-conjugating enzyme E2 D2 (EC (Ubiquitin-protein ligase D2) (Ubiquitin carrier protein D2) (Ubiquitin-conjugating enzyme E2-17 kDa 2) (E2(17)KB 2).
15 HIX0001244 HIT000000222 AB007975 1 3Medium5951141ITubulin beta chain (Tubulin beta-5 chain).
16 HIX0001353 HIT000000226 AB007979 1 3High559696IMajor histocompatibility complex, class II, DM alpha precursor.
17 HIX0000617 HIT000000432 AB014607 1 1High6359608IChloride channel protein ClC-Ka (Chloride channel Ka) (ClC-K1).
18 HIX0001253 HIT000058452 AB015856 1 1High2509671ITubulointerstitial nephritis antigen-like precursor (Tubulointerstitial nephritis antigen-related protein) (TIN Ag-related protein) (TIN-Ag-RP) (Glucocorticoid-inducible protein 5) (Oxidized LDL-responsive gene 2 protein) (OLRG-2).
19 HIX0000182 HIT000058537 AB017919 1 3High2263664IActin-binding LIM protein 3 (Actin-binding LIM protein family member 3) (abLIM-3).
20 HIX0001006 HIT000000461 AB018279 1 1High4353743IMannosyl-oligosaccharide 1,2-alpha-mannosidase IC (EC (Processing alpha-1,2-mannosidase IC) (Alpha-1,2-mannosidase IC) (Mannosidase alpha class 1C member 1) (HMIC).
21 HIX0000412 HIT000058567 AB018739 1 1High2413730IDNA fragmentation factor subunit alpha (DNA fragmentation factor 45 kDa subunit) (DFF-45) (Inhibitor of CAD) (ICAD).
22 HIX0000915 HIT000000602 AB020692 1 3High4076799IDysbindin (Dystrobrevin-binding protein 1) (Hermansky-Pudlak syndrome 7 protein homolog) (Hps7-like protein).
23 HIX0000836 HIT000000610 AB020700 1 3High4195920IRAS-responsive element-binding protein 1 (RREB-1) (Raf-responsive zinc finger protein LZ321) (Zinc finger motif-enhancer binding-protein 1) (Zep-1) (Finger protein in nuclear bodies).
24 HIX0000108 HIT000000628 AB020718 1 1High5219982IIntegrator complex subunit 11 (EC 3.1.27.-) (Int11) (Cleavage and polyadenylation-specific factor 3-like protein) (CPSF3-like protein) (Protein related to CPSF subunits of 68 kDa) (RC-68).
25 HIX0000180 HIT000000711 AB023211 1 1High4343666IProtein RER1.
26 HIX0000124 HIT000001052 AB037758 1 1High51811393IVon Willebrand factor A domain-related protein isoform 1.
27 HIX0001701 HIT000001098 AB037804 1 1High4904906INeurotrophic factor artemin isoform 1, precursor.
28 HIX0000932 HIT000001151 AB037857 1 1High6160880IThioredoxin family Trp26.
29 HIX0001055 HIT000001156 AB037862 1 1High53781238IExostoses (multiple)-like 1.
30 HIX0028498 HIT000059328 AB042410 1 1Medium3517385IDolichyl-diphosphooligosaccharide-protein glycosyltransferase precursor.
31 HIX0001643 HIT000059351 AB043587 1 1High3598529IClaudin-19. Isoform 2.
32 HIX0000682 HIT000059404 AB044343 1 3High1272356IAlpha-1,3-mannosyl-glycoprotein 2-beta-N-acetylglucosaminyltransferase (EC (N-glycosyl-oligosaccharide-glycoprotein N- acetylglucosaminyltransferase I) (GNT-I) (GlcNAc-T I).
33 HIX0023719 HIT000059426 AB044805 1 1High3103472IL-myc-1 proto-oncogene protein. Isoform 2.
34 HIX0023517 HIT000059435 AB045116 1 3High1286373IEndothelin-1 precursor (Preproendothelin-1) (PPET1) [Contains: Endothelin-1 (ET-1); Big endothelin-1].
35 HIX0001392 HIT000001329 AB046834 1 1High41431191IGap junction beta-3 protein (Connexin-31) (Cx31).
36 HIX0021590 HIT000241990 AB057595 1 1High1622429IAlcohol dehydrogenase [NADP+] (EC (Aldehyde reductase) (Aldo- keto reductase family 1 member A1).
37 HIX0000530 HIT000059785 AB057597 1 1High3071572ILung type-I cell membrane-associated glycoprotein isoform c.
38 HIX0000110 HIT000059826 AB060688 1 3High1181191IProtocadherin gamma-A1 precursor (PCDH-gamma-A1).
39 HIX0028880 HIT000383156 AB065439 1 3NA1339313IApolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 2.
40 HIX0028944 HIT000383158 AB065441 1 3NA1330339IUNC5C-like protein (Protein unc-5 homolog C-like) (ZU5 and death domain-containing protein).
41 HIX0029602 HIT000383162 AB065447 1 3NA1337243IMyelin-oligodendrocyte glycoprotein precursor.
42 HIX0028956 HIT000383265 AB065597 1 1NA1339313ICytochrome b-c1 complex subunit 6, mitochondrial precursor (Ubiquinol- cytochrome c reductase complex 11 kDa protein) (Cytochrome c1 non-heme 11 kDa protein) (Mitochondrial hinge protein) (Complex III subunit VIII) (Complex III subunit 6).
43 HIX0028930 HIT000383276 AB065611 1 3NA1354318ITrem-like transcript 2 protein precursor (TLT-2) (Triggering receptor expressed on myeloid cells-like protein 2).
44 HIX0028899 HIT000383279 AB065614 1 3NA1351317ITriggering receptor expressed on myeloid cells 2 precursor (TREM-2) (Triggering receptor expressed on monocytes 2).
45 HIX0028898 HIT000383280 AB065616 1 3NA1339313ITrem-like transcript 1 protein precursor (TLT-1) (Triggering receptor expressed on myeloid cells-like protein 1).
46 HIX0028925 HIT000383289 AB065630 1 1NA1354318IProtein 4.1 (Band 4.1) (P4.1) (EPB4.1) (4.1R). Isoform 3.
47 HIX0029603 HIT000383291 AB065632 1 1NA1400308IDelta-type opioid receptor (DOR-1).
48 HIX0028878 HIT000383299 AB065642 1 1NA2084313IGlucocorticoid modulatory element-binding protein 1 (GMEB-1) (Parvovirus initiation factor p96) (PIF p96) (DNA-binding protein p96PIF). Isoform 2.
49 HIX0028896 HIT000383302 AB065645 1 1NA1357332ICytochrome P450 4B1 (EC (CYPIVB1) (P450-HP).
50 HIX0028936 HIT000383303 AB065646 1 1NA1369323IPhosphatidylinositol 3-kinase regulatory subunit gamma (PI3-kinase p85 subunit gamma) (PtdIns-3-kinase p85-gamma) (p55PIK).
2. evolutionary conservation and coding-potential
Result of dN/dS window analysis (20-codon window with 1-codon step)
Not-parenthesized: Selection found (P<0.01)
Parenthesized: No-selection found
   0/0: dN=dS=0
   inf: dS=0 (infinite)
   sat: larger number of substitution (saturated)
   <20: (less than 20 aa)
[download (2)]
file name: hinvdb_nedo_evo_conservation.txt.gz [Download]

01: HIT
02: Coding potential [1: coding(conserved), 2: coding(partially conserved), 3: not significantly conserved, ND: not-detectable]
03: Coding potential by comparison with chimp [1, 2, 3, ND]
04: Coding potential by comparison with orang [1, 2, 3, ND]
05: Coding potential by comparison with macaque [1, 2, 3, ND]
06: dN/dS window analysis [Selection-found, No-selection-found]
07: Average dN/dS* - chimp
08: Average dN/dS* - orang
09: Average dN/dS* - macaque
10: COV analyzed for dN/dS - chimp
11: COV analyzed for dN/dS - orang
12: COV analyzed for dN/dS - macaque
13: amino acid IDxCOV by fasty - chimp
14: amino acid IDxCOV by fasty - orang
15: amino acid IDxCOV by fasty - macaque
3. expression frequency based on EST and CAGE tags
[download (3)]
file name: hinvdb_nedo_cage_freq.txt.gz [Download]

01: HIT
02: HIX
03: ACC#
04: Sequence data provider
05: Sequence length (nucleotide)
06: Sequence length (translation)
06: Number of overlapping CAGE tags at HIT start (±10bp)
07: Number of overlapping CAGE tags at HIT start (±50bp)
08: Number of overlapping CAGE tags at HIT start (±100bp)
09: Number of overlapping CAGE tags at HIT start (±500bp)
10: Number of overlapping CAGE tags at HIT start (±1000bp)
11: Number of overlapping CAGE tags at HIT start-end
12: Classification of CAGE expression frequency (high, medium, low, NA)
13: Chr. number
14: Category
15: Definition
[download (4)]
file name: hinvdb_nedo_est_freq.txt.gz [Download]
01: HIT
02: HIX
03: ACC#
04: 5'-EST support (No. of overlapped ESTs)
05: 3'-EST support (No. of overlapped ESTs)
06: CDS EST support (No. of overlapped ESTs)
07: HIT EST support (No. of overlapped ESTs)
08: Classification of EST expression frequency (high, medium, low)
09: Chr. number
10: Category
11: Definition