H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000037802
Accession number:
BC017040
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Protein C-ets-2;
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
BC017040.1
CAGE tag ID
NA
EST ID
NA
Clone Number
MGC:9151 IMAGE:3852274
Experimental resources
NBRC
;
HGPD
;
Sequence data provider
Provider:
MGC/NCI
;
Annotation project
H-Invitational FLcDNA
Length of cDNA
2500[bp] (No. of exon:10)[A:638 T:606 G:632 C:624]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
Colon, adenocarcinoma
Develpmental stage
NA
Sequence quality information
CDS feature
Complete CDS
Kozak sequence
NA
PolyA
Site: 2482(+)
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NA
Notes
NA
GTGTCGCTCCAGCTCAGAGCTCCCGGAGCCGCCCGGCCAGCGTCCGGCCT CCCTGATCGTCTCTGGCCGGCGCCCTCGCCCTCGCCCGGCGCGCACCGAG CAGCCGCGGGCGCCGAGCAGCCACCGTCCCGACCAAGCGCCGGCCCTGCC CGCAGCGGCAGGATGAATGATTTCGGAATCAAGAATATGGACCAGGTAGC CCCTGTGGCTAACAGTTACAGAGGGACACTCAAGCGCCAGCCAGCCTTTG ACACCTTTGATGGGTCCCTGTTTGCTGTTTTTCCTTCTCTAAATGAAGAG CAAACACTGCAAGAAGTGCCAACAGGCTTGGATTCCATTTCTCATGACTC CGCCAACTGTGAATTGCCTTTGTTAACCCCGTGCAGCAAGGCTGTGATGA GTCAAGCCTTAAAAGCTACCTTCAGTGGCTTCAAAAAGGAACAGCGGCGC CTGGGCATTCCAAAGAACCCCTGGCTGTGGAGTGAGCAACAGGTATGCCA GTGGCTTCTCTGGGCCACCAATGAGTTCAGTCTGGTGAACGTGAATCTGC AGAGGTTCGGCATGAATGGCCAGATGCTGTGTAACCTTGGCAAGGAACGC TTTCTGGAGCTGGCACCTGACTTTGTGGGTGACATTCTCTGGGAACATCT GGAGCAAATGATCAAAGAAAACCAAGAAAAGACAGAAGATCAATATGAAG AAAATTCACACCTCACCTCCGTTCCTCATTGGATTAACAGCAATACATTA GGTTTTGGCACAGAGCAGGCGCCCTATGGAATGCAGACACAGAATTACCC CAAAGGCGGCCTCCTGGACAGCATGTGTCCGGCCTCCACACCCAGCGTAC TCAGCTCTGAGCAGGAGTTTCAGATGTTCCCCAAGTCTCGGCTCAGCTCC GTCAGCGTCACCTACTGCTCTGTCAGTCAGGACTTCCCAGGCAGCAACTT GAATTTGCTCACCAACAATTCTGGGACGCCCAAAGACCACGACTCCCCTG AGAACGGTGCGGACAGCTTCGAGAGCTCAGACTCCCTCCTCCAGTCCTGG AACAGCCAGTCGTCCTTGCTGGATGTGCAACGGGTTCCTTCCTTCGAGAG CTTCGAAGATGACTGCAGCCAGTCTCTCTGCCTCAATAAGCCAACCATGT CTTTCAAGGATTACATCCAAGAGAGGAGTGACCCGGTGGAGCAAGGCAAA CCAGTTATACCTGCAGCTGTGCTGGCCGGCTTCACAGGAAGTGGACCTAT TCAGCTGTGGCAGTTTCTCCTGGAGCTGCTATCAGACAAATCCTGCCAGT CATTCATCAGCTGGACTGGAGACGGATGGGAGTTTAAGCTCGCCGACCCC GATGAGGTGGCCCGCCGGTGGGGAAAGAGGAAAAATAAGCCCAAGATGAA CTACGAGAAGCTGAGCCGGGGCTTACGCTACTATTACGACAAGAACATCA TCCACAAGACGTCGGGGAAGCGCTACGTGTACCGCTTCGTGTGCGACCTC CAGAACTTGCTGGGGTTCACGCCCGAGGAACTGCACGCCATCCTGGGCGT CCAGCCCGACACGGAGGACTGAGGTCGCCGGGACCACCCTGAGCCGGCCC CAGGCTCGTGGACTGAGTGGGAAGCCCATCCTGACCAGCTGCTCCGAGGA CCCAGGAAAGGCAGGATTGAAAATGTCCAGGAAAGTGGCCAAGAAGCAGT GGCCTTATTGCATCCCAAACCACGCCTCTTGACCAGGCTGCCTCCCTTGT GGCAGCAACGGCACAGCTAATTCTACTCACAGTGCTTTTAAGTGAAAATG GTCGAGAAAGAGGCACCAGGAAGCCGTCCTGGCGCCTGGCAGTCCGTGGG ACGGGATGGTTCTGGCTGTTTGAGATTCTCAAAGGAGCGAGCATGTCGTG GACACACACAGACTATTTTTAGATTTTCTTTTGCCTTTTGCAACCAGGAA CAGCAAATGCAAAAACTCTTTGAGAGGGTAGGAGGGTGGGAAGGAAACAA CCATGTCATTTCAGAAGTTAGTTTGTATATATTATTATAATCTTATAATT GTTCTCAGAATCCCTTAACAGTTGTATTTAACAGAAATTGTATATTGTAA TTTAAAATAATTATATAACTGTATTTGAAATAAGAATTCAGACATCTGAG GTTTTATTTCATTTTTCAATAGCACATATGGAATTTTGCAAAGATTTAAT CTGCCAAGGGCCGACTAAGAGAAGTTGTAAAGTATGTATTATTTACATTT AATAGACTTACAGGGATAAGGCCTGTGGGGGGTAATCCCTGCTTTTTGTG TTTTTTTGTTTGTTTGTTTGTTTGTTTTTGGGGGGTTTTCTTGCCTTGGT TGTCTGGCAAGGACTTTGTACATTTGGGAGTTTTTATGAGAAACTTAAAT GTTATTATCTGGGCTTATATCTGGCCTCTGCTTTCTCCTTTAATTGTAAA GTAAAAGCTATAAAGCAGTATTTTTCTTGACAAAAAAAAAAAAAAAAAAA
Gene structure information
H-Inv cluster ID
HIX0016112
Genomic location
Chromosome
21
Location
21q22.2
Position
40177883- 40195723
Strand
+
Possible duplicated location(s)
NA
Gene structure
10 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:2114
;
KEGG GENES
KEGG GENES(2114)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000079395
Predicted CDS
163..1572; 469[aa]; Orientation:+1;
Codon Adaptation Index (CAI).
0.799
Database links
RefSeq
NP_005230
;
UniProt
P15036
;
CCDS
CCDS13659
;
MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQ EVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIP KNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLEL APDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPHWINSNTLGFGT EQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRLSSVSVT YCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIP AAVLAGFTGSGPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVA RRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFVCDLQNLL GFTPEELHAILGVQPDTED*
Motif information
a.a.
length
InterPro
Name
102
IPR013761
Sterile alpha motif/pointed domain [Domain]
90
IPR013761
Sterile alpha motif/pointed domain [Domain]
86
IPR003118
Pointed domain [Domain]
84
IPR003118
Pointed domain [Domain]
82
IPR003118
Pointed domain [Domain]
121
IPR011991
Winged helix-turn-helix DNA-binding domain [Domain]
86
IPR000418
Ets domain [Domain]
14
IPR000418
Ets domain [Domain]
82
IPR000418
Ets domain [Domain]
81
IPR000418
Ets domain [Domain]
9
IPR000418
Ets domain [Domain]
19
IPR000418
Ets domain [Domain]
19
IPR000418
Ets domain [Domain]
16
IPR000418
Ets domain [Domain]
19
IPR000418
Ets domain [Domain]
Gene function information
H-Inv ID
HIT000037802
H-Inv cluster ID
HIX0016112
Accession number
BC017040.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
NO;
Coding potential
Protein coding;
Definition
Protein C-ets-2;
Similarity category
Category: Identical to known human protein(Category I).
Identical to known human protein (
P15036
) [Identity/coverage = 100.0%/100.0%] to Homo sapiens (Human). protein.
Experimental evidence
Protein evidence
PubMed ID
2250910
;
2847145
;
2997781
;
10830953
;
14702039
;
15489334
;
18691976
;
ALL
;
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
NA
HGNC aliases
NA
HGNC name
NA
DDBJ
ETS2
UniProt
ETS2
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000079395
No. of interaction
37
Interaction partner(s)
HIP000021733
;
HIP000022129
;
HIP000022526
;
HIP000023055
;
HIP000026585
;
HIP000027524
;
HIP000028809
;
HIP000033301
;
HIP000036732
;
HIP000041711
;
HIP000043673
;
HIP000048613
;
HIP000056841
;
HIP000068104
;
HIP000078885
;
HIP000081026
;
HIP000081455
;
HIP000087835
;
HIP000093084
;
HIP000094122
;
HIP000096367
;
HIP000100939
;
HIP000100939
;
HIP000105122
;
HIP000109256
;
HIP000110646
;
HIP000116713
;
HIP000116926
;
HIP000136553
;
HIP000144372
;
HIP000147100
;
HIP000164074
;
HIP000256202
;
HIP000258478
;
HIP000336432
;
HIP000355042
;
HIP000361205
;
BIND
150478; 197039; 258588; 258815; 258819;
DIP
NA
MINT
MINT-62043;
HPRD
00572; 00679; 01252; 01260; 01298; 01302; 01819; 02534; 02911; 03374; 04078; 04459; 04588; 05037; 05771; 09616; 09830;
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:2114
;
KEGG GENES
KEGG GENES(2114)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Auto-annotated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Gene ontology information
Molecular function
sequence-specific DNA binding (
GO:0043565
); transcription factor activity (
GO:0003700
);
Biological process
regulation of transcription, DNA-dependent (
GO:0006355
);
Cellular component
nucleus (
GO:0005634
);
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
nuclear;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Protein structure information (GTOP)
Last modified:27-May-2015
Start
End
PDB_ID
E-value
Identity
Coverage
SCOP_ID
77
166
1sxeA
1e-16
33.3
90/97
a.60.1.1
361
445
1bc7C
2e-30
56.5
85/93
a.4.5.21
Related H-InvDB links
GTOP
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
AGhsA130124;
Affymetrix
GeneChip
HG-Focus
201329_s_at;
HG-U133
201329_s_at;
HG-U133A
201329_s_at;
HG-U133A_2
201329_s_at;
HG-U133B
NA
HG-U133_Plus_2
201329_s_at;
HG-U95
1519_at;
HG-U95A
1519_at;
HG-U95B
NA
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
3921085; 3921093; 3921098; 3921100; 3921101; 3921102; 3921105; 3921107; 3921108; 3921109; 3921112; 3921115; 3921116; 3921117; 3931970;
HuGeneFL
J04102_at;
Agilent
Human 1A Oligo Microarray:PGID215
A_23_P257924;
Whole Human Genome Oligo Microarray:PGID247
A_23_P257924; A_24_P382661; A_32_P57877;
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
160 .. 160
A/G
rs185407591
+
5'UTR
161 .. 161
G/C
rs190801053
+
5'UTR
205 .. 205
G/A
rs115148137
+
CDS
Nonsynonymous[Val15Met]
212 .. 212
A/G
rs150592938
+
CDS
Nonsynonymous[Asn17Ser]
235 .. 235
C/T
rs139909964
+
CDS
Nonsynonymous[Arg25Cys]
242 .. 242
C/T
rs143407258
+
CDS
Nonsynonymous[Pro27Leu]
278 ^ 279
-/T
rs66473060
+
CDS
351 .. 351
C/T
rs184634189
+
CDS
Synonymous[Ser63Ser]
352 .. 352
G/A
rs34373350
-
CDS
Nonsynonymous[Ala64Thr]
381 .. 381
G/A
rs139241184
+
CDS
Synonymous[Pro73Pro]
405 .. 405
A/G
rs11700777
+
CDS
Synonymous[Gln81Gln]
424 .. 424
A/C
rs144867115
+
CDS
Nonsynonymous[Ser88Arg]
434 .. 434
A/G
rs114481523
+
CDS
Nonsynonymous[Lys91Arg]
446 .. 446
G/A
rs149038175
+
CDS
Nonsynonymous[Arg95Gln]
508 .. 508
C/T
rs78391361
+
CDS
Nonsynonymous[Leu116Phe]
521 .. 521
A/G
rs201321686
+
CDS
Nonsynonymous[Asn120Ser]
558 .. 558
C/T
rs142958770
+
CDS
Synonymous[Phe132Phe]
559 .. 559
G/A
rs201897945
+
CDS
Nonsynonymous[Gly133Ser]
596 .. 596
A/T
rs1803557
+
CDS
Nonsynonymous[Glu145Val]
610 .. 610
C/T
rs147754962
+
CDS
Synonymous[Leu150Leu]
624 .. 624
T/A
rs142517222
+
CDS
Nonsynonymous[Phe154Leu]
657 ^ 658
-/C
rs34472454
+
CDS
771 .. 771
G/T
rs200961208
+
CDS
Synonymous[Ala203Ala]
801 .. 801
C/G
rs200893699
+
CDS
Synonymous[Pro213Pro]
808 .. 808
G/A
rs115908228
+
CDS
Nonsynonymous[Gly216Ser]
811 .. 811
C/A
rs61735785
+
CDS
Nonsynonymous[Leu217Ile]
815 .. 815
T/G
rs114460001
+
CDS
Nonsynonymous[Leu218Arg]
827 .. 827
G/T
rs115297166
+
CDS
Nonsynonymous[Cys222Phe]
830 .. 830
C/T
rs114562289
+
CDS
Nonsynonymous[Pro223Leu]
846 .. 846
C/T
rs115786160
+
CDS
Synonymous[Ser228Ser]
889 .. 889
C/T
rs150430243
+
CDS
Nonsynonymous[Arg243Trp]
901 .. 901
G/A
rs138127643
+
CDS
Nonsynonymous[Val247Ile]
907 .. 907
G/A
rs116698978
+
CDS
Nonsynonymous[Val249Ile]
978 .. 978
G/T
rs457705
+
CDS
Synonymous[Thr272Thr]
1005 .. 1005
C/T
rs201143455
+
CDS
Synonymous[Asn281Asn]
1010 .. 1010
C/T
rs138783369
+
CDS
Nonsynonymous[Ala283Val]
1011 .. 1011
G/A
rs115426813
+
CDS
Synonymous[Ala283Ala]
1058 .. 1058
A/C
rs146230611
+
CDS
Nonsynonymous[Gln299Pro]
1095 .. 1095
C/T
rs113417859
+
CDS
Synonymous[Phe311Phe]
1104 .. 1104
C/T
rs201065633
+
CDS
Synonymous[Phe314Phe]
1126 .. 1126
C/G
rs201705823
+
CDS
Nonsynonymous[Leu322Val]
1184 .. 1184
C/T
rs139305338
+
CDS
Nonsynonymous[Pro341Leu]
1185 .. 1185
G/A
rs461155
+
CDS
Synonymous[Pro341Pro]
1197 ^ 1198
-/C
rs34120017
+
CDS
1219 .. 1219
G/A
rs116771448
+
CDS
Nonsynonymous[Val353Met]
1324 .. 1324
G/A
rs116387099
+
CDS
Nonsynonymous[Gly388Arg]
1341 .. 1341
C/T
rs116476013
+
CDS
Synonymous[Leu393Leu]
1344 .. 1344
C/T
rs185172042
+
CDS
Synonymous[Ala394Ala]
1345 .. 1345
G/A
rs115880316
+
CDS
Nonsynonymous[Asp395Asn]
1391 .. 1391
C/T
rs200586471
+
CDS
Nonsynonymous[Pro410Leu]
1461 .. 1461
G/A
rs139455779
+
CDS
Synonymous[Thr433Thr]
1524 .. 1524
C/T
rs149674474
+
CDS
Synonymous[Pro454Pro]
1528 .. 1528
G/A
rs147744979
+
CDS
Nonsynonymous[Glu456Lys]
1546 .. 1546
G/A
rs142665354
+
CDS
Nonsynonymous[Gly462Ser]
1550 .. 1550
T/C
rs144935329
+
CDS
Nonsynonymous[Val463Ala]
1557 .. 1557
C/A
rs17854245
+
CDS
Synonymous[Pro465Pro]
1663 .. 1663
A/T
rs77488352
+
3'UTR
1712 .. 1712
A/C
rs75261542
+
3'UTR
1776 .. 1776
C/T
rs188181785
+
3'UTR
1818 .. 1818
A/G
rs711
+
3'UTR
1920 .. 1920
T/C
rs200475079
+
3'UTR
1982 .. 1982
G/A
rs141493936
+
3'UTR
2036 .. 2036
T/A
rs530
-
3'UTR
2042 .. 2042
C/T
rs8133034
+
3'UTR
2052 .. 2052
T/G
rs71316657
+
3'UTR
2102 .. 2102
T/A
rs147035130
+
3'UTR
2125 .. 2125
T/C
rs41276544
+
3'UTR
2213 .. 2213
G/A
rs191832014
+
3'UTR
2223 .. 2223
A/C
rs1051420
+
3'UTR
2244 .. 2244
T/A/C/G
rs1051425
+
3'UTR
2285 .. 2285
A/C
rs140874343
+
3'UTR
2292 ^ 2293
-/T
rs201404243
+
3'UTR
2299 ^ 2300
-/T
rs11422952
+
3'UTR
2301 ^ 2302
-/T
rs34882229
+
3'UTR
2378 .. 2378
G/T
rs184055024
+
3'UTR
2408 .. 2408
T/G
rs13046062
+
3'UTR
2435 .. 2435
C/T
rs3178021
+
3'UTR
2437 .. 2437
C/T
rs3178022
+
3'UTR
2438 .. 2438
C/T
rs3178023
+
3'UTR
2482 .. 2482
A/G
rs11540812
+
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
No data available
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene;
Repeat Mask Viewer
;