H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000027749
Accession number:
AL832837
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Similar to N-acetyl-D-glucosamine kinase.
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
AL832837.1
CAGE tag ID
NA
EST ID
NA
Clone Number
DKFZp667A0125
Experimental resources
NBRC
;
HGPD
;
Sequence data provider
Provider:
DKFZ/MIPS
;
Annotation project
H-Invitational FLcDNA
Length of cDNA
4420[bp] (No. of exon:6)[A:1024 T:1152 G:1153 C:1091]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
lymph node
Develpmental stage
adult
Sequence quality information
CDS feature
Complete CDS
Kozak sequence
NA
PolyA
Site: 4410(+) Signal: 4388-4392(+)
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NA
Notes
NA
CGGACGCGTGCGGACGCTGGGACCCACTCCGCTGTCTTCAGCTGGAGAAG CAGCTCCTTTTTCCTATTCCTGACCTCGGACAGAGTTTTGCAGAACAGGT GTCAGGTGTGAAATTGCTGAGCGCGGGCAGGGTGGGGAGTGGGAGTGGGG AGAAGCGGGACAAAGATCTTGGCTGAGGTTATCCGGGAGCTGGAGCAGCT GCGAGCCAGAGTACTAGGTTGTCCTGATGTCTGAAAACCGCTCCATCCCT GTTGGCCTTCCATGCTGCTTTCACTAGATTGCATTTTTCTGCTCAGAAGC TTGTTTACTGAACTTTTTGTAATAGCTCTGCCCCTGGGTATGACCATGGA CAAGATATTTACCCTCCCTGGGCTACAGTTACCGTATGTGCAAAGTGAGG ATAGGGATAAATAAGATCGCTATGTCCCTGCCAGTGGCAACATTCTGTGA ATCTGAGTTTGTCAAATCAGGGAATAATGGGGGTCGCTGGCCTCTCAGCC AGGGCACTGTACTTTCGAATCTGTGATTGTTTTCACTTCTCTGGCAGCCA CACCCCACTGCTGATTCACAGACCCCAAAGTCTTTTCACATGAATTGTTA GGAATGATCTTTCCCATCCTGTATTTGTAGCACTTAAAAAAAACAGTATA GGAGTCTGCATTCATCCTTGTTATAGTCATCTGTAAGATATGACCCGTTT CTTCCAGTTCGTCAAGACATGCTTTTGGATGTGGAATCTGCCACCTCATG TATTAACTACCCCCTTGAAATTTCATGTCCTCTTTGAGTTTGATGAGCGG TCTTCAGTGTCTTCTTCTGCCATTCCTAAAAGTAATGATCAATAAAGAAA CATATATCCCATCACCACAGATCTCCCTGTGAACTACTGTGACCCCCAGT AATCAACATTTGAAGGTTTAGTTGCTTAAGTATGTATGAAGTCACCCTGC CATCTGGTCAGCATTTCTAGTAAAGGTGTCTTGAAGACCTTGGCAGCTGG ATCTTGAAATGCAGTCTTTAGCTATTGTGAGGGAAGCAGAGAGATGTGCC ATTTTTCTTAGGAGTGTTTGCTGAGCATTGCCTGTGCCAGCTCCTGTGCT CGTTCTGAGAGGGTAAGTAAGTGATCAGGAGCTTGACCCAGTGGAGCTTG AAGTCTAGTGAGAGGCGGACACAAAGCGTCTGAACACACACTTCATACAG AGGAGACTTTAATAACTTTGGAGGCACTATGAAGGTTTAACTAACGGAGA CTTAACTAATTCGTTGGGGTGGGCCACACCTCAGAAAAGGTTTCGAGGAC TAGGTGAAGCGTAAGTAGAGTCTTGCAGGATAAGGAGGACATCCCCAAGG GAAGAGGAGTGAGAATTATGTTTCAGTTAAAGAGCACTCAAACACTGAAC CACAGAGGCGGGTTTGGGAGTGGTGATTGAGTTTGTCGAAGCTGTCTCCT CCATCATACTGGGCTCTGCCAGTGTGGACAGCACAAGCTTAGCTCTCTCT GTGGGTTTTTCCGAGCAAGATCAAGTGCCCTCTTGTGTTCTGTAGGGGAG GCACACGATCCGAGGTCCTTTTAGTCTCAGAGGATGGGAAGATCCTGGCA GAAGCAGATGGACTGAGCACAAACCACTGGGTAAAAACCACACTGAGGGG ATCAGAGGGCTTGGTTCTGATTTTATTCTCTGTAATTCCTGTTGAGGTGG TGGCTGGGACTCACAGAGCAGCCTGTGGGGGCAACATAGCTTCTGTAAGC CTTTGTAACTCCTTCTTCCCTTGATTGGGGCCAGCTGATCGGGACAGACA AGTGTGTGGAGAGGATCAATGAGATGGTGAACAGGGCCAAACGGAAAGCA GGGGTGGATCCTCTGGTACCGCTGCGAAGCTTGGTGAGTCTGGGGCGGAG CCTGGGAATTCAGCCATCTGTGACACTGAGACAGCTAGCAAGTTTGGACT AGATGAGTTTGATGACTGCAGAGGGAAAGACCTCCAAGACTTAGTCCCTG GTGTCAAAACTGTCATAATCTCACCCAGTCTCATGGCTTCAGTTGCCACC TACAAACTACAAATGTGTTGGAAGAGCACAGGGCTTTGGAGAGCTGCTGA GACCTTGGTTTGAATCCCAGCTTTGCTGCTCAGTATCTGCGTAACCCTGT GTAGGTTACTCACCTTCTCTAAGCCTCAGTTTTCTCATTTGGAAAATGTG AATAGTAGCTACCTCAGAGTTGTTGGGAAAGTAAAATGGCATGATACATT GCAAAGTGGTTAGTATAGAGCCTGACCCATAAGCACTCATTAAATGTTAG CTATTATTTACTCCTGGTTCAGATCTTTCTCCCAAGTTGCGGTAGCTCCA GCTACTCCCTGGACATATATGTGAGATGTTCTGCAGACACAGGCAAACCT CGTGTATACCAAACCAAGCTTACCTCTCTCCATCTCCCCTCACCTTGCCC CGCTGCCTGATCTTTCTGTTAAGAGCACTACTGCCCTTCCAAGCTCCAAA GCTGGCATTGTTGAATCCCCTCTTTCTTCTGTCCATCCCCAGTCCAGGTT GTTCTTATAACCACAGCCTTCTGAGGGGTCATTGTGTGGGGAGAAAGGAG GACTGGGGCTGGGTGAACAGGGTATGGCCAGATGGGGAGAACAGGAATCA GGCATCAGGAGGGGGCGGGGGTTGAGAAAAGAGCTGGGGCTGGGGCTCTG CACACTCGCTCACCTCCCGCGTGGCCTAGGGCCTATCTCTGAGCGGTGGG GACCAGGAGGACGCGGGGAGGATCCTGATCGAGGAGCTGAGGGACCGATT TCCCTACCTGAGTGAAAGCTACTTAATCACCACCGATGCCGCCGGCTCCA TCGCCACAGCTACACCGGATGGTGGAGTTGTGCTCATATCTGGAACAGGC TCCAACTGCAGGCTCATCAACCCTGATGGCTCCGAGAGTGGCTGCGGCGG CTGGGGCCATATGATGGGTGATGAGGGTTCAGGTGAGCTCACTGACTGGC CCAGCTCCAGGTCCTGGATCTGCTCCTTCCTTCACTCCCTGTCTTTCTCT CCTTAGCCTTGGCTCACTGAGCCCTTGGGGCTTCCTGGGGACAACTCCTG AACCTGGCCATTCCATGTGCAGGATCATGACTGAGATCATAGAGGTGCAA ATGCTGGACGAGCTGCTGACCCTGCTCTCCAGCAGTGACTTGCACCCAGG ACTCAGATTCATCCCTCAGCACAGGGGCCTGCCTGTGTTGACTTGCTGTC CTTAGGGAGGAGTTGAAACTGAGGGTAAGGCCATAGATGCCAGGAGATGC CATACCAGTCTGCTAGGGGAAAGGGTGTAGAAGCCCTTCCAACATCACCT CTCTCTAGGTAGTGTGTGTTTATTTTCATAGGGGCCCTTGAGGTCCAGTT CATGGCAGTGACTCCGGTGGTATTTGTGACAACAGACTGACCTTGCCTTA TCCTTGGGTAGGAGCTGGTGTGACACAGCCTCACCTATTCTGTTTTCCCC CTCCAACTTTCCCTTATCTTGGTCAGTCAGATACAGTTTGATAGAGACCC TCCCACCCCCCTCTCCCACCCCCTGCCACCCCTGGCTGGGAATCAGGAAA CCTAATTTGTATAAAATTCAAGCAGATCACTAACCTGTCTGGATCTGAGT TTCCCTCCTCGTGAGCTCAGGATGACTGCTCCCATCTTCTTAGCCCTCTC TGCTCCCTGCAGCCTACTGGATCGCACACCAAGCAGTGAAAATAGTGTTT GACTCCATTGACAACCTAGAGGCGGCTCCTCATGATATCGGCTACGTCAA ACAGGCCATGTTCCACTATTTCCAGGTGCCAGATCGGCTAGGGATACTCA CTCACCTGTATAGGGACTTTGATAAATGCAGGTTTGCTGGGTTTTGCCGG AAAATTGCAGAAAGTGCTCAGCAGGGAGACCCCCTTTCCCGCTATATCTT CAGGAAGGCTGGGGAGATGCTGGGCAGACACATCGTAGCAGTGTTGCCCG AGATTGACCCGGTCTTGTTCCAGGGCAAGATTGGACTCCCCATCCTGTGC GTGGGCTCTGTGTGGAAGAGCTGGGAGCTGCTGAAGGAAGGTTTTCTTTT GGCGCTGACCCAGGGCAGAGAGATCCAGGCTCAGAACTTCTTCTCCAGCT TCACCCTGATGAAGCTGAGGCACTCCTCCGCTCTGGGTGGGGCCAGCCTA GGGGCCAGGCACATCGGGCACCTCCTCCCCATGGACTATAGCGCCAATGC CATTGCCTTCTATTCCTACACCTTTTCCTAGGGGGCTGGTCCCGGCTCCA CCCCCTCCAAGCTCAGTGGACACTGGGTCTGAAAGGAAGGAGTCTTTTGC TTCCTTTCTCCTTTTTACAAAAACAAACATAGAAGAAAATAAATGCACTT TATCCACTCAAAAAAAAAAA
Gene structure information
H-Inv cluster ID
HIX0002147
Genomic location
Chromosome
2
Location
2p13.3
Position
71296108- 71305766
Strand
+
Possible duplicated location(s)
NA
Gene structure
6 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:55577
;
KEGG GENES
KEGG GENES(55577)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000213201
Predicted CDS
3808..4281; 157[aa]; Orientation:+1;
Codon Adaptation Index (CAI).
0.813
MFHYFQVPDRLGILTHLYRDFDKCRFAGFCRKIAESAQQGDPLSRYIFRK AGEMLGRHIVAVLPEIDPVLFQGKIGLPILCVGSVWKSWELLKEGFLLAL TQGREIQAQNFFSSFTLMKLRHSSALGGASLGARHIGHLLPMDYSANAIA FYSYTFS*
Motif information
a.a.
length
InterPro
Name
74
IPR002731
ATPase, BadF/BadG/BcrA/BcrD type [Domain]
Gene function information
H-Inv ID
HIT000027749
H-Inv cluster ID
HIX0002147
Accession number
AL832837.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
NO;
Splicing isoform
Coding potential
Protein coding;
Definition
Similar to N-acetyl-D-glucosamine kinase.
Similarity category
Category: Similar to known protein(Category II).
Similar to known protein (
NP_060037
) [Identity/coverage = 98.947%/48.72%] to Homo sapiens protein.
Experimental evidence
Protein evidence
PubMed ID
NA
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
NA
HGNC aliases
NA
HGNC name
NA
DDBJ
NA
UniProt
NA
EC number
EC 2.7.1.59
N-acetylglucosamine kinase;
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
00251
:Glutamate metabolism;
00530
:Aminosugars metabolism;
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000213201
No. of interaction
NA
Interaction partner(s)
NA
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:55577
;
KEGG GENES
KEGG GENES(55577)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Auto-annotated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
cytosol; nuclear; cytoskeleton;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Protein structure information (GTOP)
Last modified:27-May-2015
Start
End
PDB_ID
E-value
Identity
Coverage
SCOP_ID
1
157
2ch5A1
1e-48
98.7
154/224
c.55.1.5
Related H-InvDB links
GTOP
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
AGhsA231106;
Affymetrix
GeneChip
HG-Focus
218231_at;
HG-U133
218231_at;
HG-U133A
218231_at;
HG-U133A_2
218231_at;
HG-U133B
NA
HG-U133_Plus_2
218231_at;
HG-U95
46246_at;
HG-U95A
NA
HG-U95B
46246_at;
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
2488050; 2488051; 2488052; 2488053; 2488054; 2488055; 2488056; 2488057; 2488059; 2488060; 2488061; 2488063; 2488064; 2488066; 2488067; 2488068; 2488069; 2488077;
HuGeneFL
NA
Agilent
Human 1A Oligo Microarray:PGID215
NA
Whole Human Genome Oligo Microarray:PGID247
NA
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
169 .. 169
T/G
rs2009953
+
5'UTR
234 .. 234
A/G
rs62145136
+
5'UTR
290 .. 290
T/A
rs140568635
+
5'UTR
636 .. 636
A/T
rs185683363
+
5'UTR
778 .. 778
T/C
rs190348494
+
5'UTR
943 .. 943
C/T
rs145530214
+
5'UTR
971 .. 971
T/C
rs888352
-
5'UTR
985 .. 985
A/G
rs147750938
+
5'UTR
1147 .. 1147
C/G
rs180796534
+
5'UTR
1294 .. 1294
C/T
rs142602615
+
5'UTR
1311 .. 1311
G/A
rs10496181
+
5'UTR
1343 ^ 1344
-/C
rs35785094
+
5'UTR
1376 .. 1376
G/A
rs145777845
+
5'UTR
1442 .. 1442
C/T
rs999240
-
5'UTR
1512 .. 1512
C/T
rs41285971
+
5'UTR
1532 .. 1532
C/G
rs138356329
+
5'UTR
1588 .. 1588
G/A
rs77352101
+
5'UTR
1609 .. 1609
T/C
rs2287328
-
5'UTR
1628 .. 1628
T/A
rs17856147
+
5'UTR
1631 .. 1631
G/C
rs111712781
+
5'UTR
1633 .. 1633
A/G/T
rs111607540
+
5'UTR
1664 .. 1664
G/C
rs75589581
+
5'UTR
1730 .. 1730
G/T
rs148823405
+
5'UTR
1775 .. 1775
T/C
rs185363544
+
5'UTR
1836 .. 1836
G/A
rs192172897
+
5'UTR
1842 .. 1842
C/T
rs143520899
+
5'UTR
1849 .. 1849
C/T
rs17849984
+
5'UTR
1858 .. 1858
A/T
rs150821125
+
5'UTR
1862 .. 1862
T/A
rs2418893
+
5'UTR
1865 .. 1865
G/A
rs2418894
+
5'UTR
1880 .. 1880
C/T
rs3211088
+
5'UTR
1896 .. 1896
C/T
rs2287327
-
5'UTR
2012 .. 2012
G/A
rs148003976
+
5'UTR
2141 .. 2141
G/A
rs141750673
+
5'UTR
2151 .. 2151
G/A
rs116908418
+
5'UTR
2169 .. 2169
C/G
rs145991956
+
5'UTR
2183 .. 2183
T/C
rs1861854
-
5'UTR
2231 .. 2231
G/T
rs77575917
+
5'UTR
2252 .. 2252
C/T
rs12996507
+
5'UTR
2315 .. 2315
T/G
rs115772674
+
5'UTR
2400 .. 2400
T/C
rs13023145
+
5'UTR
2451 .. 2451
C/T
rs74890112
+
5'UTR
2608 .. 2608
G/A
rs183889158
+
5'UTR
2661 .. 2661
G/A
rs201143239
+
5'UTR
2689 .. 2689
G/A
rs112024192
+
5'UTR
2695 .. 2695
G/A
rs75976980
+
5'UTR
2707 .. 2707
C/T
rs141798565
+
5'UTR
2719 .. 2719
G/T
rs146271156
+
5'UTR
2744 .. 2744
C/T
rs143484080
+
5'UTR
2745 .. 2745
G/A
rs144077605
+
5'UTR
2754 .. 2754
C/A
rs149052839
+
5'UTR
2763 .. 2763
G/C
rs200297031
+
5'UTR
2789 .. 2789
G/C
rs17849402
+
5'UTR
2840 .. 2840
C/A/T
rs11539622
+
5'UTR
2844 .. 2844
G/A
rs146931704
+
5'UTR
2932 .. 2932
C/G
rs201875949
+
5'UTR
2933 .. 2933
C/A
rs150661812
+
5'UTR
2942 .. 2942
C/G
rs1043685
+
5'UTR
2943 .. 2943
T/A
rs1043686
+
5'UTR
2945 .. 2945
C/T
rs149375267
+
5'UTR
2949 .. 2949
G/A
rs144758868
+
5'UTR
2983 .. 2983
G/C
rs112552513
+
5'UTR
3097 .. 3097
C/G
rs77907098
+
5'UTR
3213 .. 3213
C/A
rs115328847
+
5'UTR
3311 .. 3311
T/C
rs75562744
+
5'UTR
3312 .. 3312
G/A
rs191826175
+
5'UTR
3396 .. 3396
C/T
rs145571696
+
5'UTR
3416 .. 3416
G/A
rs147709165
+
5'UTR
3617 .. 3617
T/C
rs17616553
+
5'UTR
3637 .. 3637
G/C
rs77337878
+
5'UTR
3724 .. 3724
G/A
rs183509027
+
5'UTR
3753 .. 3753
C/T
rs144182116
+
5'UTR
3774 .. 3774
G/A
rs144974375
+
5'UTR
3793 .. 3793
T/C
rs139590364
+
5'UTR
3795 .. 3795
C/T
rs145487795
+
5'UTR
3822 .. 3822
C/G
rs141245714
+
CDS
Nonsynonymous[Phe5Leu]
3835 .. 3835
C/T
rs142624273
+
CDS
Nonsynonymous[Arg10Trp]
3842 .. 3842
G/C
rs201078608
+
CDS
Nonsynonymous[Gly12Ala]
3843 .. 3843
G/T
rs148350552
+
CDS
Synonymous[Gly12Gly]
3851 .. 3851
C/T
rs148054431
+
CDS
Nonsynonymous[Thr15Ile]
3889 .. 3889
G/C
rs201526773
+
CDS
Nonsynonymous[Gly28Arg]
3900 .. 3900
G/A
rs141425614
+
CDS
Synonymous[Arg31Arg]
3901 .. 3901
A/T
rs76423411
+
CDS
AA-STOP[Lys32*]
3941 .. 3941
G/A
rs142872700
+
CDS
Nonsynonymous[Arg45His]
4027 .. 4027
A/G
rs146053228
+
CDS
Nonsynonymous[Lys74Glu]
4059 .. 4059
T/C
rs112506822
+
CDS
Synonymous[Ser84Ser]
4062 .. 4062
G/A
rs1043701
+
CDS
Synonymous[Val85Val]
4099 .. 4099
T/C
rs6713
-
CDS
Synonymous[Leu98Leu]
4103 .. 4103
C/T
rs144807367
+
CDS
Nonsynonymous[Ala99Val]
4115 .. 4115
G/A
rs188706050
+
CDS
Nonsynonymous[Gly103Asp]
4146 .. 4146
C/T
rs112235207
+
CDS
Synonymous[Ser113Ser]
4198 .. 4198
C/G
rs200730544
+
CDS
Nonsynonymous[Leu131Val]
4205 .. 4205
C/T
rs139536732
+
CDS
Nonsynonymous[Ala133Val]
4207 .. 4207
A/G
rs145168741
+
CDS
Nonsynonymous[Arg134Gly]
4242 .. 4242
C/T
rs142009799
+
CDS
Synonymous[Ser145Ser]
4281 .. 4281
G/-
rs34337134
+
CDS
4303 .. 4303
C/T
rs145881172
+
3'UTR
4304 .. 4304
C/G
rs112773895
+
3'UTR
4368 .. 4368
C/A/G
rs112601471
+
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
Type
Start
End
Strand
MIR3
314
454
+
L3
502
1014
-
L2c
1067
1412
-
L2
2016
2058
+
MIRb
2067
2308
+
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene
;
Repeat Mask Viewer
;