H-InvDB x AHG DB
Transcript view
H-InvDB_9.0 released on May 27, 2015.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
H-Invitational ID: HIT000021717 Accession number: AK096862 Created date: 26-Mar-2013 Last modified: 27-May-2015
Definition: Similar to Transcription factor 4 isoform g.
 
 

Transcript original information
Accession number AK096862.1
CAGE tag ID NA
EST ID NA
Clone Number PUAEN2008939
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ; Antibody: searching human antibodies at "BIO-kaimono.com" Antibody (TCF4) ; Catalog: searching experimental product catalogs at "BIO-kaimono.com" Catalog (TCF4);
Sequence data provider Project:FLJ; Provider:FLJ/HRI; 
Annotation project H-Invitational FLcDNA
Length of cDNA 2396[bp] (No. of exon:18)[A:686 T:536 G:551 C:623]
Devision HUM
Molecular type mRNA
Library origin Cell type pulmonary artery endothelial cells (HPAEC)
Tissue type NA
Develpmental stage NA
Mini-G
Sequence quality information
CDS feature Complete CDS
Kozak sequence NA
PolyA NA
Vector/adapter sequence NA
Frame shift NA
Remaining intron NA
Splice site acceptor (NAGNAG) CAGCAG;  TAGCAG; 
Transcript quality feature NA
Notes NA

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0022913
Genomic location  G-integra Help Chromosome 18
Location 18q21.2
Position 52894920- 53178000
Strand -
Possible duplicated location(s) NA
Gene structure 18 exon(s)
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:6925
KEGG GENES KEGG GENES(6925)
GeneCard GeneCardTCF4*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS G-integraG-integra cDNA-genome alignmentcDNA-genome alignment

Predicted CDS information
HIP ID HIP000045306
Predicted CDS 173..2050;  625[aa];  Orientation:+2; 
Codon Adaptation Index (CAI). 0.761
Database links RefSeq NP_001230160
UniProt NA
CCDS CCDS58628

Motif information
ORF

length(625),orf(173:2050)
MEEDSRDVEDRSSSGSWGNGGHPSPSRNYGDGTPYDHMTSRDLGSHDNLS
PPFVNSRIQSKTERGSYSSYGRESNLQGCHQQSLLGGDMDMGNPGTLSPT
KPGSQYYQYSSNNPRRRPLHSSAMEVQTKKVRKVPPGLPSSVYAPSASTA
DYNRDSPGYPSSKPATSTFPSSFFMQDGHHSSDPWSSSSGMNQPGYAGML
GNSSHIPQSSSYCSLHPHERLSYPSHSSADINSSLPPMSTFHRSGTNHYS
TSSCTPPANGTDSIMANRGSGAAGSSQTGDALGKALASIYSPDHTNNSFS
SNPSTPVGSPPSLSAGTAVWSRNGGQASSSPNYEGPLHSLQSRIEDRLER
LDDAIHVLRNHAVGPSTAMPGGHGDMHGIIGPSHNGAMGGLGSGYGTGLL
SANRHSLMVGTHREDGVALRGSHSLLPNQVPVPQLPVQSATSPDLNPPQD
PYRGMPPGLQGQSVSSGSSEIKSDDEGDENLQDTKSSEDKKLDDDKKDIK
SITSNNDDEDLTPEQKAEREKERRMANNARERLRVRDINEAFKELGRMVQ
LHLKSDKPQTKLLILHQAVAVILSLEQQVRERNLNPKAACLKRREEEKVS
SEPPPLSLAGPHPGMGDASNHMGQM*
a.a.
length
InterPro Name
length(68), motif(514:581) 68 IPR011598 Myc-type, basic helix-loop-helix (bHLH) domain [Domain]
length(54), motif(522:575) 54 IPR011598 Myc-type, basic helix-loop-helix (bHLH) domain [Domain]
length(54), motif(523:576) 54 IPR011598 Myc-type, basic helix-loop-helix (bHLH) domain [Domain]
length(60), motif(523:582) 60 IPR011598 Myc-type, basic helix-loop-helix (bHLH) domain [Domain]
length(54), motif(528:581) 54 IPR011598 Myc-type, basic helix-loop-helix (bHLH) domain [Domain]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000021717
H-Inv cluster ID Locus viewHIX0022913
Accession number AK096862.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help NO; 
Coding potential  Help Protein coding; 
Definition Similar to Transcription factor 4 isoform g.
Similarity category  Help Category: Similar to known protein(Category II).
Similar to known protein (NP_001230160)  [Identity/coverage = 100.0%/100.0%] to Homo sapiens protein.
Experimental evidence Protein evidence
PubMed ID NA
Gene family/group Gene family H-Inv gene family/group ID NA
Gene family/group name NA
Evidence motif (InterPro) ID NA
Gene symbol/name HGNC symbol TCF4
HGNC aliases NA
HGNC name transcription factor 4
DDBJ NA
UniProt NA
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000045306
No. of interaction 7
Interaction partner(s) HIP000002890HIP000034631HIP000034631HIP000055065HIP000064410HIP000176869HIP000194532
BIND 150982; 
DIP 17E;  182238E; 
MINT MINT-4508065;  MINT-4508268;  MINT-4790867; 
HPRD 00011;  00241;  00242;  00243;  00286;  00536;  00590;  01071;  01166;  01302;  01414;  01435;  01753;  02437;  02608;  03428;  04000;  04078;  04694;  08935;  08980;  08995;  09581;  10423;  13634; 
IntAct NA
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:6925
KEGG GENES KEGG GENES(6925)
GeneCard GeneCardTCF4*GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases
Curation status Auto-annotated
Notes NA
Related H-InvDB links Gene familyGene family;  Similarity Search ToolSimilarity Search Tool;  TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA

Gene ontology information
Molecular function protein dimerization activity (GO:0046983); 

Subcellular localization information  Last modified:27-May-2015
WoLF PSORT nuclear;  cytosol; 
Target P Other
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:27-May-2015
Start End PDB_ID E-value Identity Coverage SCOP_ID
518 595 1am9A 5e-12 26.7 75/80 a.38.1.1
Related H-InvDB links GTOP GTOP

Gene expression information  H-ANGEL DNAProbeLocator Last modified:27-May-2015
Tissue-specific expression  H-ANGEL NA
Probe
information DNAProbeLocator
AceGene AGhsA090517; 
Affymetrix
GeneChip
HG-Focus 203753_at; 
HG-U133 203753_at;  222146_s_at; 
HG-U133A 203753_at;  222146_s_at; 
HG-U133A_2 203753_at;  222146_s_at; 
HG-U133B NA
HG-U133_Plus_2 203753_at;  222146_s_at; 
HG-U95 56694_at;  68263_at;  89997_at; 
HG-U95A NA
HG-U95B 56694_at; 
HG-U95C NA
HG-U95D 68263_at;  89997_at; 
HG-U95E NA
HG-U95Av2 NA
HuEx-1_0 3808862;  3808865;  3808866;  3808870;  3808871;  3808890;  3808891;  3808892;  3808897;  3808899;  3808904;  3808905;  3808909;  3808922;  3808923;  3808938;  3808949;  3808950;  3808972; 
HuGeneFL M74719_at; 
Agilent Human 1A Oligo Microarray:PGID215 A_23_P27332; 
Whole Human Genome Oligo Microarray:PGID247 A_23_P27332; 
Related H-InvDB links H-ANGELH-ANGEL DNAProbeLocatorDNAProbeLocator

Disease/pathology information  DiseaseInfo Viewer LEGENDA Last modified:27-May-2015
Disease relation Disease name: Pitt-Hopkins syndrome (610954); 
Related information in OMIM OMIM ID:  602272;  Title: TRANSCRIPTION FACTOR 4
Co-localized orphan diseases NA
Disease related mutation MutationView:  602272
JRE-1.4.0 or later is required.Download JRE at Sun's web site.
Literature-Extracted GENe-Disease Associations (LEGENDA) LEGENDA Gene name Entrez Gene ID:(6925)
Disease Entrez Gene ID:(6925)
Substance Entrez Gene ID:(6925)
Related H-InvDB links DiseaseInfo ViewerDiseaseInfo ViewerLEGENDALEGENDA

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
Location Variation dbSNP ID Strand CDS/UTR Translation
43 .. 43 A/G rs12956259 - 5'UTR
64 .. 64 G/T rs141065203 - 5'UTR
109 .. 109 C/T rs146425715 - 5'UTR
226 .. 226 G/A rs151203913 - CDS Synonymous[Gly18Gly]
255 .. 255 A/G rs200359873 - CDS Nonsynonymous[Asn28Ser]
276 .. 276 A/G rs76646268 - CDS Nonsynonymous[Tyr35Cys]
284 .. 284 A/G rs113943820 - CDS Nonsynonymous[Met38Val]
315 .. 315 A/G rs143244149 - CDS Nonsynonymous[Asn48Ser]
368 .. 368 T/C rs112222111 - CDS Nonsynonymous[Ser66Pro]
387 .. 387 G/A rs139876825 - CDS Nonsynonymous[Arg72Lys]
458 .. 458 A/G rs146412750 - CDS Nonsynonymous[Thr96Ala]
460 .. 460 C/G rs201061418 - CDS Synonymous[Thr96Thr]
512 .. 512 C/A rs200889338 - CDS Nonsynonymous[Pro114Thr]
622 .. 622 C/A rs143993583 - CDS Synonymous[Ala150Ala]
630 .. 630 A/G rs148573556 - CDS Nonsynonymous[Asn153Ser]
650 .. 650 C/T rs144835402 - CDS Nonsynonymous[Pro160Ser]
706 .. 706 C/T rs149861305 - CDS Synonymous[Gly178Gly]
733 .. 733 C/A rs200800656 - CDS Synonymous[Ser187Ser]
752 .. 752 C/G rs139859596 - CDS Nonsynonymous[Pro194Ala]
813 .. 813 G/C rs147305084 - CDS Nonsynonymous[Ser214Thr]
843 .. 843 C/T rs201776550 - CDS Nonsynonymous[Pro224Leu]
853 .. 853 C/T rs143896861 - CDS Synonymous[Ser227Ser]
862 .. 862 C/T rs138403996 - CDS Synonymous[Asp230Asp]
990 .. 990 C/T rs147445499 - CDS Nonsynonymous[Ala273Val]
991 .. 991 C/T rs200115299 - CDS Synonymous[Ala273Ala]
1012 .. 1012 T/C rs142998298 - CDS Synonymous[Asp280Asp]
1133 .. 1133 T/C rs113615316 - CDS Nonsynonymous[Ser321Pro]
1157 .. 1157 T/G rs75109886 - CDS Nonsynonymous[Ser329Ala]
1159 .. 1159 G/A rs148308964 - CDS Synonymous[Ser329Ser]
1199 .. 1199 C/T rs121909122 + CDS AA-STOP[Arg343*]
1203 .. 1203 T/C rs143594544 - CDS Nonsynonymous[Ile344Thr]
1240 .. 1240 T/C rs200225114 - CDS Synonymous[His356His]
1264 .. 1264 C/T rs201481787 - CDS Synonymous[Gly364Gly]
1291 .. 1291 T/C rs148909575 - CDS Synonymous[His373His]
1329 .. 1329 G/T rs186508321 - CDS Nonsynonymous[Gly386Val]
1332 .. 1332 C/T rs180788300 - CDS Nonsynonymous[Ala387Val]
1363 .. 1363 C/T rs200112082 - CDS Synonymous[Thr397Thr]
1396 .. 1396 G/A rs11660217 - CDS Nonsynonymous[Met408Ile]
1400 .. 1400 G/A rs138570124 - CDS Nonsynonymous[Gly410Arg]
1465 .. 1465 G/C rs143944746 - CDS Synonymous[Pro431Pro]
1584 .. 1584 T/C rs112174081 - CDS Nonsynonymous[Ile471Thr]
1597 .. 1597 C/T rs140862252 - CDS Synonymous[Asp475Asp]
1623 .. 1623 C/T rs202025804 - CDS Nonsynonymous[Thr484Met]
1685 .. 1685 A/G rs146454287 - CDS Nonsynonymous[Asn505Asp]
1717 .. 1717 G/A rs71368997 - CDS Synonymous[Gln515Gln]
1738 .. 1738 G/A rs144068462 - CDS Synonymous[Glu522Glu]
1767 .. 1767 G/C rs121909123 + CDS Nonsynonymous[Arg532Pro]
1772 .. 1772 C/T rs121909120 + CDS Nonsynonymous[Arg534Trp]
1773 .. 1773 G/A rs121909121 + CDS Nonsynonymous[Arg534Gln]
1900 .. 1900 G/A rs140078086 - CDS Synonymous[Glu576Glu]
1930 .. 1930 G/A rs200096408 - CDS Synonymous[Pro586Pro]
1939 .. 1939 G/A rs151150677 - CDS Synonymous[Ala589Ala]
1957 .. 1957 G/A rs76956936 - CDS Synonymous[Glu595Glu]
1975 .. 1975 A/G rs8766 + CDS Synonymous[Ser601Ser]
1986 .. 1986 C/G rs148802110 - CDS Nonsynonymous[Pro605Arg]
1999 .. 1999 C/T rs147289056 - CDS Synonymous[Ala609Ala]
2072 .. 2072 G/A rs182372608 - 3'UTR
2085 .. 2085 A/T rs190089558 - 3'UTR
2103 .. 2103 C/A rs111947783 - 3'UTR
2144 .. 2144 C/G rs141586028 - 3'UTR
2244 ^ 2245 -/A rs201242245 - 3'UTR
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
No data available
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD)
Related H-InvDB links VaryGeneVaryGene;  Repeat mask viewerRepeat Mask Viewer