H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000089630
Accession number:
BC018583
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Similar to Polycomb protein SUZ12; Chromatin precipitated E2F target 9 protein; ChET 9 protein; Joined to JAZF1 protein; Suppressor of zeste 12 protein homolog;
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
BC018583.1
CAGE tag ID
NA
EST ID
NA
Clone Number
IMAGE:4155691
Experimental resources
NBRC
;
HGPD
;
Antibody (SUZ12)
;
Catalog (SUZ12)
;
Sequence data provider
NA
Annotation project
NA
Length of cDNA
4024[bp] (No. of exon:15)[A:1294 T:1185 G:808 C:737]
Devision
HTC
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
Brain, anaplastic oligodendroglioma with 1p/19q loss
Develpmental stage
NA
Sequence quality information
CDS feature
Complete CDS
Kozak sequence
NA
PolyA
Site: 3961(+)
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NA
Notes
NA
TTTTTTTTTCCTCCCTCCTTCCCTCCTCTCCTCCTCCCTTCCCTTCCCCT CTCCTCCCCTCTCTCCTCCTTCCCCCCTCGGTCCGCCGGAGCCTGCTGGG GCGAGCGGTTGGTATTGCAGGCGCTTGCTCTCCGGGGCCGCCCGGCGGGT AGCTGGCGGGGGGAGGAGGCAGGAACCGCGATGGCGCCTCAGAAGCACGG CGGTGGGGGAGGGGGCGGCTCGGGGCCCAGCGCGGGGTCCGGGGGAGGCG GCTTCGGGGGTTCGGCGGCGGTGGCGGCGGCGACGGCTTCGGGCGGCAAA TCCGGCGGCGGGAGCTGTGGAGGGGGTGGCAGTTACTCGGCCTCCTCCTC CTCCTCCGCGGCGGCAGCGGCGGGGGCTGCGGTGTTACCGGTGAAGAAGC CGAAAATGGAGCACGTCCAGGCTGACCACGAGCTTTTCCTCCAGGCCTTT GAGAAGCCAACACAGATCTATAGATTTCTTCGAACTCGGAATCTCATAGC ACCAATATTTTTGCACAGAACTCTTACTTACATGTCTCATCGAAACTCCA GAACAAACATCAAAAGCTTGTCAGCTCATTTGCAGCTTACGTTTACTGGT TTCTTCCACAAAAATGATAAGCCATCACCAAACTCAGAAAATGAACAAAA TTCTGTTACCCTGGAAGTCCTGCTTGTGAAAGTTTGCCACAAAAAAAGAA AGGATGTAAGTTGTCCAATAAGGCAAGTTCCCACAGGTAAAAAGCAGGTG CCTTTGAATCCTGACCTCAATCAAACAAAACCCGGAAATTTCCCGTCCCT TGCAGTTTCCAGTAATGAATTTGAACCTAGTAACAGCCATATGGTGAAGT CTTACTCGTTGCTATTTAGAGTGACTCGTCCAGGAAGAAGAGAGTTTAAT GGAATGATTAATGGAGAAACCAATGAAAATATTGATGTCAATGAAGAGCT TCCAGCCAGAAGAAAACGAAATCGTGAGGATGGGGAAAAGACATTTGTTG CACAAATGACAGTATTTGATAAAAACAGGCGCTTACAGCTTTTAGATGGG GAATATGAAGTAGCCATGCAGGAAATGGAAGAATGTCCAATAAGCAAGAA AAGAGCAACATGGGAGACTATTCTTGATGGGAAGAGGCTGCCTCCATTCG AAACATTTTCTCAGGGACCTACGTTGCAGTTCACTCTTCGTTGGACAGGA GAGACCAATGATAAATCTACGGCTCCTATTGCCAAACCTCTTGCCACTAG AAATTCAGAGAGTCTCCATCAGGAAAACAAGCCTGGTTCAGTTAAACCTA CTCAAACTATTGCTGTTAAAGAATCATTGACTACAGATCTACAAACAAGA AAAGAAAAGGATACTCCAAATGAAAACCGACAAAAATTAAGAATATTTTA TCAGTTTCTCTATAACAACAATACAAGGCAACAAACTGAAGCAAGAGATG ACCTGCATTGCCCTTGGTGTACTCTGAACTGCCGCAAACTTTATAGTTTA CTCAAGCATCTTAAACTCTGCCATAGCAGATTTATCTTCAACTATGTTTA TCATCCAAAAGGTGCTAGGATAGATGTTTCTATCAATGAGTGTTATGATG GCTCCTATGCAGGAAATCCTCAGGATATTCATCGCCAACCTGGATTTGCT TTTAGTCGCAACGGACCAGTTAAGAGAACACCTATCACACATATTCTTGT GTGCAGGCCAAAACGAACAAAAGCAAGCATGTCTGAATTTCTTGAATCTG AAGATGGGGAAGTAGAACAGCAAAGAACATATAGTAGTGGCCACAATCGT CTGTATTTCCATAGTGATACCTGCTTACCTCTCCGTCCACAAGAAATGGA AGTAGATAGTGAAGATGAAAAGGATCCTGAATGGCTAAGAGAAAAAACCA TTACACAAATTGAAGAGTTTTCTGATGTTAATGAAGGAGAGAAAGAAGTG ATGAAACTCTGGAATCTCCATGTCATGAAGCATGGGTTTATTGCTGACAA TCAAATGAATCATGCCTGTATGCTGTTTGTAGAAAATTATGGACAGAAAA TAATTAAGAAGAATTTATGTCGAAACTTCATGCTTCATCTAGTCAGCATG CATGACTTTAATCTTATTAGCATAATGTCAATAGATAAAGCTGTTACCAA GCTCCGTGAAATGCAGCAAAAATTAGAAAAGGGGGAATCTGCTTCCCCTG CAAACGAAGAAATAACTGAAGAACAAAATGGGACAGCAAATGGATTTAGT GAAATTAACTCAAAAGAGAAAGCTTTGGAAACAGATAGTGTCTCAGGGGT TTCAAAACAGAGCAAAAAACAAAAACTCTGAAAAGCTCTAACCCCATGTT ATGGACAAACACTGAAATTACATTTTAGGGAATTCATCCTCTAAGAATTA TGTTTTTGTTTTTAATCATATGTTCCAAACAGGCACTGTTAGATGAAGTA AATGATTTCAACAAGGATATTTGTATCAGGGTTCTACTTCACTTCATTAT GCAGCATTACATGTATATCACTTTTATTGATGTCATTAAAACATTCTGTA CTTTAAGCATGAAAAGCAATATTTCAAAGTATTTTTAAACTCAACAAATG TCATCAAATATGTTGAATTGATCTAGAAATTATTTCATATATAAATCAGA ATTTTTTTGCATTTATGAACGGCTGTTTTTCTACTTTGTAATTGTGAGAC ATTTTCTTGGGGAGGGAAAATTGGAATGGTTCCCTTTTTTAGAAATTGAA GTGGTCTTCATATGTCAACTACAGAAAAGGAAAAAAATAGAAATTGAAGG ATTTTTATGAAATTATATTGCATTACTATTTGCAGTCAAACTTTGATCCT TGTTTTTGAAATCATTTGTCAATTCGGAATGAAAAATTATAATGTAATTT TACATTACATAAGTTCCTTTTACAATTAAAAAATAGCACTTCTTCATCTT ATGCCTGTTTGAGAAGATATTAAATTTTCACATTGTTGACAGTGAAATGC TATGTTGGTTTATAAGATTACAGACCATTTGTTTTCATGTGGATAATTTT AGTGCATTGCTCACCCGGTATGTTTTTTTTTTTTAACTTGAACATTTTGC TTGTTTTGTTTTTCTTTTTTAATTAGATAATCACACGGAAAATTAAGCTG TTCATATCTTTAAATTAGGATTGCAAACCAAGGAAAGAACGCATTTGAGA TTTTAAGATGTCACTTATAAGGGGAGAAGTGTTCTTAAAAAGTCAACCAG AAAACTGTTATGCCTTTTATTTGTTTGCAAGGATGTCTTTGTAATGTGTT TCATGAATAGAATATCCAATAGAGATAAGCTGACTTGAATCATTTTGAGC AATTTTGCCCTGTGTTATATGTGTTTCACGCACATATTTGCAGTTGGATT TTCTCCAACAGAAAGTGGATTCACTACTGGCACATTAACAAGCACCAATA GGTTTTTATTCCAACTCCGAGCACTGTGGTTGAGTAACATCACCTCAATT TTTTATTATCCTTAAAGATATTGCATTTTCATATTCTTTATTTATAAAGG ATCAATGCTGCTGTAAATACAGGTATTTTTAATTTTAAAATTTCATTCCA CCACCATCAGATGCAGTTCCCTATTTTGTTTAATGAAGGGATATATAAGC TTTCTAATGGTGTCTTCAGAAATTTATAAAATGTAAATACTGATTTGACT GGTCTTTAAGATGTGTTTAACTGTGAGGCTATTTAACGAATAGTGTGGAT GTGATTTGTCATCCAGTATTAAGTTCTTAGTCATTGATTTTTGTGTTTAA AAAAAAATAGGAAAGAGGGAAACTGCAGCTTTCATTACAGATTCCTTGAT TGGTAAGCTCTCCAAATGATGAGTTCTAGTAAACTCTGATTTTTGCCTCT GGATAGTAGATCTCGAGCGTTTATCTCGGGCTTTAATTTGCTAAAGCTGT GCACATATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAA
Gene structure information
H-Inv cluster ID
HIX0013689
Genomic location
Chromosome
17
Location
17q11.2
Position
30264086- 30327650
Strand
+
Possible duplicated location(s)
NA
Gene structure
15 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:23512
;
KEGG GENES
KEGG GENES(23512)
;
GeneCard
SUZ12
;
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000031496
Predicted CDS
181..2331; 716[aa]; Orientation:+1;
Codon Adaptation Index (CAI).
0.699
MAPQKHGGGGGGGSGPSAGSGGGGFGGSAAVAAATASGGKSGGGSCGGGG SYSASSSSSAAAAAGAAVLPVKKPKMEHVQADHELFLQAFEKPTQIYRFL RTRNLIAPIFLHRTLTYMSHRNSRTNIKSLSAHLQLTFTGFFHKNDKPSP NSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTK PGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNEN IDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEME ECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPI AKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENR QKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSR FIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRT PITHILVCRPKRTKASMSEFLESEDGEVEQQRTYSSGHNRLYFHSDTCLP LRPQEMEVDSEDEKDPEWLREKTITQIEEFSDVNEGEKEVMKLWNLHVMK HGFIADNQMNHACMLFVENYGQKIIKKNLCRNFMLHLVSMHDFNLISIMS IDKAVTKLREMQQKLEKGESASPANEEITEEQNGTANGFSEINSKEKALE TDSVSGVSKQSKKQKL*
Motif information
a.a.
length
InterPro
Name
24
IPR015880
Zinc finger, C2H2-like [Domain]
137
IPR019135
Polycomb protein, VEFS-Box [Domain]
Gene function information
H-Inv ID
HIT000089630
H-Inv cluster ID
HIX0013689
Accession number
BC018583.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
NO;
Splicing isoform
Coding potential
Protein coding;
Definition
Similar to Polycomb protein SUZ12; Chromatin precipitated E2F target 9 protein; ChET 9 protein; Joined to JAZF1 protein; Suppressor of zeste 12 protein homolog;
Similarity category
Category: Similar to known protein(Category II).
Similar to known protein (
Q15022
) [Identity/coverage = 100.0%/100.0%] to Homo sapiens (Human). protein.
Experimental evidence
Protein evidence
PubMed ID
8590280
;
11371647
;
11564866
;
12351676
;
12435631
;
14532106
;
15099518
;
15225548
;
15231737
;
15385962
;
15489334
;
15684044
;
16224021
;
16431907
;
16618801
;
16712789
;
17081983
;
17200670
;
17344414
;
18086877
;
18220336
;
18285464
;
18628979
;
18669648
;
18772439
;
19026781
;
19690332
;
20068231
;
21406692
;
ALL
;
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
SUZ12
HGNC aliases
"suppressor of zeste 12 homolog (Drosophila)"
HGNC name
SUZ12 polycomb repressive complex 2 subunit
DDBJ
NA
UniProt
SUZ12
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000031496
No. of interaction
12
Interaction partner(s)
HIP000023351
;
HIP000026716
;
HIP000035533
;
HIP000042725
;
HIP000065282
;
HIP000080231
;
HIP000085996
;
HIP000094069
;
HIP000104226
;
HIP000146054
;
HIP000236471
;
HIP000253875
;
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:23512
;
KEGG GENES
KEGG GENES(23512)
;
GeneCard
SUZ12
;
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Auto-annotated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
nuclear; cytosol;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Protein structure information (GTOP)
Last modified:27-May-2015
Start
End
PDB_ID
E-value
Identity
Coverage
SCOP_ID
522
644
1fvpA
3e-21
17.0
106/100
c.1.16.2
Related H-InvDB links
GTOP
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
AGhsB210513;
Affymetrix
GeneChip
HG-Focus
NA
HG-U133
212287_at;
HG-U133A
212287_at;
HG-U133A_2
212287_at;
HG-U133B
NA
HG-U133_Plus_2
212287_at;
HG-U95
40957_at;
HG-U95A
40957_at;
HG-U95B
NA
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
3717413; 3717420; 3717422; 3717424; 3717425; 3717426; 3717428; 3717430; 3717431; 3717432; 3752513; 3752515; 3752516; 3752517;
HuGeneFL
D63881_at;
Agilent
Human 1A Oligo Microarray:PGID215
A_23_P100883;
Whole Human Genome Oligo Microarray:PGID247
A_23_P100883; A_32_P24223;
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Disease/pathology information
Last modified:27-May-2015
Disease relation
Disease name:NA
Related information in OMIM
OMIM ID:
606245
; Title: SUPPRESSOR OF ZESTE 12, DROSOPHILA, HOMOLOG OF
Co-localized orphan diseases
NA
Disease related mutation
NA
Literature-Extracted GENe-Disease Associations (LEGENDA)
Gene name
Entrez Gene ID:(23512)
Disease
Entrez Gene ID:(23512)
Substance
Entrez Gene ID:(23512)
Related H-InvDB links
DiseaseInfo Viewer
;
LEGENDA
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
101 .. 101
G/A
rs182208921
+
5'UTR
376 .. 376
G/A
rs199728576
+
CDS
Nonsynonymous[Ala66Thr]
391 .. 391
G/A
rs149833913
+
CDS
Nonsynonymous[Val71Met]
472 .. 472
A/G
rs79452603
+
CDS
Nonsynonymous[Arg98Gly]
481 .. 481
C/T
rs2627175
-
CDS
AA-STOP[Arg101*]
563 .. 563
A/G
rs79415083
+
CDS
Nonsynonymous[Lys128Arg]
758 .. 758
A/T
rs17339444
+
CDS
Nonsynonymous[Asn193Ile]
858 .. 858
G/A
rs145801482
+
CDS
Synonymous[Ser226Ser]
973 .. 973
C/T
rs199547628
+
CDS
Nonsynonymous[Arg265Cys]
980 .. 980
A/G
rs183650826
+
CDS
Nonsynonymous[Asp267Gly]
985 .. 985
G/A
rs148546138
+
CDS
Nonsynonymous[Glu269Lys]
1053 .. 1053
A/G
rs145438545
+
CDS
Synonymous[Glu291Glu]
1086 .. 1086
T/C
rs147681445
+
CDS
Synonymous[Cys302Cys]
1183 .. 1183
A/G
rs142354346
+
CDS
Nonsynonymous[Thr335Ala]
1189 .. 1189
C/T
rs146306622
+
CDS
Nonsynonymous[Arg337Cys]
1208 .. 1208
A/G
rs151019506
+
CDS
Nonsynonymous[Asn343Ser]
1344 .. 1344
A/C
rs140872258
+
CDS
Nonsynonymous[Gln388His]
1377 .. 1377
C/T
rs77143645
+
CDS
Synonymous[Asn399Asn]
1437 .. 1437
T/G
rs138584567
+
CDS
Synonymous[Thr419Thr]
1521 .. 1521
C/T
rs148296125
+
CDS
Synonymous[Cys447Cys]
1596 .. 1596
T/C
rs113231885
+
CDS
Synonymous[Tyr472Tyr]
1658 .. 1658
G/T
rs137920170
+
CDS
Nonsynonymous[Arg493Leu]
1661 .. 1661
A/G
rs142366304
+
CDS
Nonsynonymous[Asn494Ser]
1662 .. 1662
C/T
rs143863610
+
CDS
Synonymous[Asn494Asn]
1692 .. 1692
T/C
rs148804361
+
CDS
Synonymous[His504His]
1782 .. 1782
T/C
rs55832508
+
CDS
Synonymous[Tyr534Tyr]
1809 .. 1809
C/A
rs11551268
+
CDS
Nonsynonymous[Phe543Leu]
1818 .. 1818
T/C
rs201407775
+
CDS
Synonymous[Asp546Asp]
1995 .. 1995
T/A
rs150888728
+
CDS
Synonymous[Ala605Ala]
2043 .. 2043
A/G
rs138328609
+
CDS
Synonymous[Gly621Gly]
2064 .. 2064
T/C
rs200797131
+
CDS
Synonymous[Asn628Asn]
2135 .. 2135
A/G
rs112915103
+
CDS
Nonsynonymous[Asp652Gly]
2147 .. 2147
C/T
rs139363631
+
CDS
Nonsynonymous[Thr656Ile]
2204 .. 2204
A/G
rs145524414
+
CDS
Nonsynonymous[Asn675Ser]
2235 .. 2235
A/G
rs147697832
+
CDS
Synonymous[Thr685Thr]
2258 .. 2258
A/G
rs115100527
+
CDS
Nonsynonymous[Asn693Ser]
2288 .. 2288
G/A
rs140801412
+
CDS
Nonsynonymous[Ser703Asn]
2440 .. 2440
T/G
rs143901922
+
3'UTR
2554 .. 2554
T/C
rs115308907
+
3'UTR
2596 .. 2596
A/T
rs1061194
+
3'UTR
2664 .. 2664
T/G
rs113398624
+
3'UTR
2669 .. 2669
A/C
rs15654
-
3'UTR
2681 .. 2681
C/T
rs201628524
-
3'UTR
2736 .. 2736
T/G
rs16967070
+
3'UTR
2900 .. 2900
T/C
rs116124238
+
3'UTR
2958 .. 2958
T/C
rs79896497
+
3'UTR
3003 .. 3003
T/C
rs148348149
+
3'UTR
3025 .. 3025
C/A
rs186993715
+
3'UTR
3064 .. 3064
C/T
rs141497489
+
3'UTR
3136 .. 3136
C/T
rs537166
-
3'UTR
3322 .. 3322
G/C
rs550923
+
3'UTR
3424 .. 3424
C/T
rs191480323
+
3'UTR
3628 .. 3628
G/A
rs10627
+
3'UTR
3642 .. 3642
T/C
rs150884111
+
3'UTR
3667 .. 3667
C/T
rs185207156
+
3'UTR
3947 .. 3947
C/T
rs190103141
+
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
No data available
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene;
Repeat Mask Viewer
;