H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000020366
Accession number:
AK095511
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Conserved hypothetical protein.
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
AK095511.1
CAGE tag ID
NA
EST ID
NA
Clone Number
FCBBF1000270
Experimental resources
NBRC
;
HGPD
;
Sequence data provider
Project:FLJ; Provider:FLJ/HRI;
Annotation project
H-Invitational FLcDNA
Length of cDNA
3584[bp] (No. of exon:1)[A:1361 T:985 G:586 C:652]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
brain
Develpmental stage
fetal
Sequence quality information
CDS feature
N-truncated
Kozak sequence
NA
PolyA
NA
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NA
Notes
NA
CTAACACAATCTTATTTTATTCTCTTAAAAGGTTCAATACAGTATGATTA GGATAATAAAATCTAGTTTCAAAAAATATAATAAATCTAAAACAAGGAAA AAACTAAAAACTGTGCATTTGTTCTCTAATCTTTTTACCATTTTTCAATC TGGGAACACTTCTAAATTTAGGAGTAAGCTATAATAAGCCACGATCATAT GACTGCCTAGTTTCTTGAACATTAATAAATACTTAATTCCAGCTTACATG AAATATTGCATTTGGAGAGGTGGGTACTATGTAGTATAAAAACTAAGTCG AATCAGTAACTTTAACTTTTCTTCTTGTTGGTTTCCTTTTATTTTTAAAT CCCAGAACCCAAAGATTTCTAAAGATTGTGTAAGCTAAATTATTTGTACA TAACAAAAACTATATGGCTTTTCTAATACTGTAAGCTTAGAAAATCCCAG TATATTTAACCAAGCATTTTCCTTTTATCTTTTTGAATTTTAACCATTTT GAATCCTACAGCTGGTGAATTTTCATATTCTACAATTATGTTTCTCATTG CATTCTATCTTCTAGTTATGGTATAATGTTCCCACATATAAATATTATGT AAAACACAAATGCAATCAATTAATATTAACTTATTCCTAGGGATAATAAT AGCAATAATAACTCGTCATACTTTAGTGAATTCTGAATCTGAGTGAAACA AGAATACCAAGTTTCTATGATATTTCTATTGCAATATACCGAATACAACA ACTTCTCAAATATGTAAATTAATTTGTGACTCAGTTTGCTGGGTAATGTT CACATCATGACTCAAATAGATGCAACTACTTAAAAATGAATGACTTGGGA GGCTGCCATGGACTATGAAGCTGTGTGAGTAAGTAAGCTGCTTGTATCTG GCCTATATTGAGAGGAAGAAATTTTTCCATCTGACATATTTAGAGGTAAT CCCTCTATAACAGAATTAAAATATGGTATATGGATGGCATTGCTTATACA GACTTCCCACCGTTAGAATTCTGTATTGTTCCATGGATTGGGACTATTCT TGATGCTCTGGCATTTATTCTTTTAAGAATGACATCAGATGTCAAACATA TATTATTTACAGACAACCTTCCAATTTCAGTCTATCTAGATAAAAGCTAC CTTATGGATGATTGCAGAGTTTCCAGATATAGCAGTAATCATCATGTTTT ATCATGATGAACTGGAGGAAGAGTACAAATGCTTCTTTTATACAAAAATT GAGAAGACAATGTACTTCAGGTCACTTTTCTCTTATAGTCAAGGAGATAA ACTGCCAGTTCTAAAAGGTGACCATGCTACTCCTATCAAACTACCAACAT CATTTTTTCACAGAATTAGAAAAAATATTTTAAAATTCATATGGAACCAA AAAAAAGCCCCCAAAGCCAGTGCAATCCTAAGCAAAAAGAATAAAGCCAG AGGCAGCAGGCTACCTGACTTCAAACTATACTACAAGGCTACAGTAGCCA AAACAGCATGGTACTGATACAAAAACAGACACATAGACCAATGGTACAGG TTACAGAACCCAAAAATAAAGCTGCACACCTACAACCATCTGATATTTGA CAAAGTCGACAACAACAAGCAATGCAGAAAGGACTGCCTGTTCAATAAAT GGTGCTGGGTTAACAGGCTAGCCATATGCAGCAGATTGAAGCTGGACCCC TTCTTATACCATACACAAAAATCAACTCAATATGGATTAAACACTTCAAT GTAAGACCTAAAACTACAACAACCCTAGAAGAAAACCTAGGAAATACCAT TCTGGACATAGACCTTGGCAAAGAGTTCATGACAATGTCTCCAAAAGCAA ATGCAACAAAACCAAAAATACACAAATGGGACCTAATGAAACTAAAGAGC TCCTGCACAGCTAAAGAAACTATCAACAGAGTAAACAGACATCCTACAGA ATGGGTGAAAATATTTGCAAACTATTCATCTGACAAAATTCTAATATCCA GAATCTATAAGGAATTTAAACAAAACTCAACAACAAAAAGCAAACAACCC CATTAAAAAATGGGCAAAGGACATGAACAGACACTTCTCAAAAGAAGACA TACAAGTAGCCAACAAGCATATAAAAAAAAATGCTCAGCATGACAAATTA TTAGAGAAATGCAAATCAAAACCACAATGAGATACCATCTCACACCAGTC AGGATGGTTATTATTAAAAAGTCAGAAAATAACATGTTGGCAAGGTTACA GAGTAAAGCTTATACACTTCTAGTGGGAATGTAAATTAGTTCAGCCACTG TGGAAATCAATTTGGAGATTTCTCAAAGAACTTAAAACAGAACTACTATT CAACCCAGCAATCCCACTACTGGGTATATACTCAAAGGAATATAAATCGT TCTACCAAAAAGAAACATGTATTTGTATGTTCCTCACAGCACCATTCATA TTAGCAAAGACATTGAATCAACCTATATACCCACCAATGGTAGACTGGAT AAAGCAAATGCAGTACATATACATCATGGGATTCTATATAGCCATTAAAA AAAATCATGTCCATTGCACAACATGGATGCAGCTGGAGGCCATTATCCTA ATCAAATTAATGTAGGAACAGAAAACCAAATATCACGTTTCCACTTATAA GTAGGAAGTAAACACTGAGTACACATGGATAGAAGAAAGGGAACAATACA CACAGGGTCCTACTTGAGGGTAGGGGTTAGAAGGAGGGTGAGGATTGAAA AACTACCTACAAGGTACTATGCCACCATTAATAGCTGTTAAACTTCAAAA AAAAAAAAAAACTACTGTTTCATTATAAAAACAAACAAAAAACCTAGAGT GTCTCCCAGACTTCGCAGCGCAGACTTTGCAGCACACTGGAATCACTTAA GAATCTTTAAAAAGTACAGATATGTGTATCTCAACACAAGATACTCACAT CCTGATTTATTTGGGATGGGGTGTGAATCTGAGAATGATAATTTTTAAAA CTCCTCTGGTGCTTCTAATGTGAAGCCAATGACCTAGAGGATAGGAGAAG TCACTGCTATCTTCTTCCCTCCACTCAGCTGTGCCTTAGTGTCAGGAAAA GGAGGCAGTTCATGCAAAGAACCACACCAACAAGTGTGGGCCTGCTTATC TCCAACACCAGAAGTCACCTTGATTTTCAATAAATCAATATGCTTGGAGG AAGTGTGTGTGTGTGTGTGTGTGCGTGTGTGCATGGTGGGAGGCAGGGAG AGAAAGAGAAAGGGGAGAAAAATATGGACTGTGAAAAAAAGAAACATAAA GATAAGATAAAGTCCTAAATAAAATAAAGCAATCTTAAAGAGAAATGCCT TAATGATTTTAAAAAACAAACAAAAAAAGCCCCTGCAGATCATCATGGAA TATTGTAAAGAACAGTTTCTCCATCTTTATTCCCAATCACACTGTATGCC ATACACTCAAGTATGTTATTACATAAAAAGTGATTCCCACGATTGCTAAC ATGTATAGTCTACCATTCTAGAAATGAAATATCAAAATGTGTGTGTAGAG ACTTAAGAGGAAACAGAACATTCTAGGCTAGTTA
Gene structure information
H-Inv cluster ID
HIX0002543
Genomic location
Chromosome
2
Location
2q24.2
Position
162290385- 162293969
Strand
-
Possible duplicated location(s)
NA
Gene structure
1 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:100130761
;
KEGG GENES
KEGG GENES(100130761)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000439635
Predicted CDS
1664..2056; 130[aa]; Orientation:+2;
Codon Adaptation Index (CAI).
0.719
QASHMQQIEAGPLLIPYTKINSIWIKHFNVRPKTTTTLEENLGNTILDID LGKEFMTMSPKANATKPKIHKWDLMKLKSSCTAKETINRVNRHPTEWVKI FANYSSDKILISRIYKEFKQNSTTKSKQPH*
Gene function information
H-Inv ID
HIT000020366
H-Inv cluster ID
HIX0002543
Accession number
AK095511.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
Representative transcript;
Coding potential
Protein coding;
Definition
Conserved hypothetical protein.
Similarity category
Category: Conserved hypothetical protein(Category IV).
Conserved hypothetical protein.
Experimental evidence
NA
PubMed ID
NA
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
NA
HGNC aliases
NA
HGNC name
NA
DDBJ
NA
UniProt
NA
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000439635
No. of interaction
NA
Interaction partner(s)
NA
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:100130761
;
KEGG GENES
KEGG GENES(100130761)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Human curated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool
;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
cytosol; nuclear; plasma membrane; Other;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
NA
Affymetrix
GeneChip
HG-Focus
NA
HG-U133
NA
HG-U133A
NA
HG-U133A_2
NA
HG-U133B
NA
HG-U133_Plus_2
NA
HG-U95
NA
HG-U95A
NA
HG-U95B
NA
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
2512779;
HuGeneFL
NA
Agilent
Human 1A Oligo Microarray:PGID215
NA
Whole Human Genome Oligo Microarray:PGID247
NA
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
31 .. 31
G/A
rs7349346
-
5'UTR
51 .. 51
G/A
rs7349345
-
5'UTR
66 .. 66
G/A
rs114203252
-
5'UTR
69 .. 69
T/C
rs180727366
-
5'UTR
96 .. 96
G/A
rs7349344
-
5'UTR
97 ^ 98
-/G
rs35915299
-
5'UTR
191 .. 191
A/C
rs28637735
-
5'UTR
197 .. 197
A/T
rs192079127
-
5'UTR
228 .. 228
A/C
rs114397185
-
5'UTR
248 .. 248
A/G
rs187289521
-
5'UTR
256 .. 256
T/C
rs115287381
-
5'UTR
452 .. 452
A/G
rs140196298
-
5'UTR
567 .. 567
T/C
rs10930018
-
5'UTR
665 .. 665
G/A
rs181914517
-
5'UTR
701 .. 701
A/G
rs77899260
-
5'UTR
865 .. 865
A/T
rs77363740
-
5'UTR
926 .. 926
T/C
rs79626413
-
5'UTR
966 .. 966
T/C
rs114296915
-
5'UTR
1027 .. 1027
T/G
rs73971192
-
5'UTR
1209 .. 1209
G/A
rs7581103
-
5'UTR
1242 .. 1242
A/G
rs190264512
-
5'UTR
1392 .. 1392
T/C
rs184348992
-
5'UTR
1539 .. 1539
C/T
rs71597824
-
5'UTR
1566 .. 1566
A/G
rs77739172
-
5'UTR
1571 .. 1571
G/A
rs77547753
-
5'UTR
1577 .. 1577
C/T
rs75264325
+
5'UTR
1586 .. 1586
C/G
rs62376429
-
5'UTR
1607 .. 1607
C/T
rs144813480
-
5'UTR
1655 .. 1655
C/T
rs78218415
-
5'UTR
1667 .. 1667
G/A
rs79207282
-
CDS
Nonsynonymous[Ala2Thr]
1681 .. 1681
G/A
rs78390487
-
CDS
Synonymous[Gln6Gln]
1723 .. 1723
C/T
rs112339375
-
CDS
Synonymous[Ile20Ile]
1757 .. 1757
C/T
rs112817306
-
CDS
Nonsynonymous[Pro32Ser]
1785 .. 1785
A/T
rs112085071
-
CDS
Nonsynonymous[Asn41Ile]
1786 .. 1786
C/T
rs140968848
-
CDS
Synonymous[Asn41Asn]
1790 .. 1790
G/A
rs111532359
-
CDS
Nonsynonymous[Gly43Arg]
1817 .. 1817
G/C
rs144275753
-
CDS
Nonsynonymous[Gly52Arg]
1863 .. 1863
C/A
rs9683968
-
CDS
Nonsynonymous[Pro67Gln]
1910 .. 1910
G/A
rs9684646
-
CDS
Nonsynonymous[Ala83Thr]
2231 .. 2231
A/T
rs112521486
-
3'UTR
2355 .. 2355
C/T
rs74370145
+
3'UTR
2358 .. 2358
G/A
rs181341107
-
3'UTR
2579 .. 2579
G/A
rs116766036
-
3'UTR
2586 .. 2586
G/A
rs71423098
-
3'UTR
2601 .. 2601
A/C
rs74724497
-
3'UTR
2620 .. 2620
A/G
rs188535508
-
3'UTR
2623 .. 2623
A/C
rs114680284
-
3'UTR
2709 .. 2709
C/G
rs183915747
-
3'UTR
2791 .. 2791
A/C
rs72877969
-
3'UTR
2839 .. 2839
A/C
rs115360997
-
3'UTR
2941 .. 2941
A/G
rs150830316
-
3'UTR
3026 .. 3026
C/T
rs192642377
-
3'UTR
3202 ^ 3203
-/GT
rs3038155
-
3'UTR
3212 ^ 3213
-/GT/TG
rs10678987
-
3'UTR
3221 ^ 3222
-/GT
rs34218219
-
3'UTR
3223 ^ 3224
-/TG
rs149413117
-
3'UTR
3224 .. 3224
C/T
rs199858669
-
3'UTR
3251 .. 3251
A/G
rs190570926
-
3'UTR
3276 .. 3276
G/A
rs185648983
-
3'UTR
3367 .. 3367
C/A
rs55861556
-
3'UTR
3370 .. 3370
A/T
rs193020850
-
3'UTR
3465 .. 3465
G/A
rs4988784
-
3'UTR
3525 .. 3525
T/C
rs142617221
-
3'UTR
Microsatellite (Short Tandem Repeat, STR)
Location
Variation
Strand
3203..3222
(gt)10
+
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
Type
Start
End
Strand
L1PA15
1324
2772
+
MER5B
2875
3029
-
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene
;
Repeat Mask Viewer
;