*--------------------------------------------------------------------------------------------------* Release information of H-EPD http://www.h-invitational.jp/hinv/hepd/ Released on 2013/09/11 ver.1.4. *--------------------------------------------------------------------------------------------------* -------------------------------------------------------------------------------------------------- Data information -------------------------------------------------------------------------------------------------- H-InvDB: release 8.0 extended, April 2012 UniProt: UniProtKB/Swiss-Prot release July 2012 RefSeq: RefSeq release 54 (August 17, 2012) -------------------------------------------------------------------------------------------------- Data format -------------------------------------------------------------------------------------------------- 1.File UniProt By comparing with 36,665 H-InvDB protein coding representative transcripts and 20,231 UniProt reviewed human proteins, 15,317 HITs were related to reviewed human proteins one-to-one except for eleven HITs that correspond to multiple Uniprot accession numbers (ACs). These proteins and 5,762 proteins unique to Uniprot are combined and assigned original Uniprot ACs in the first annotation line of each entry. These records are started with "sp". The related H-InvDB identifiers, category (cat), and chromosome number (ch) are described in this order in the last part of this line with semicolon delimited format. Example: >sp|O15013|ARHGA_HUMAN Rho guanine nucleotide exchange factor 10 OS=Homo sapiens GN=ARHGEF10 PE=1 SV=4; HIX0021591 HIT000000001 HIP000042367;cat=I;ch=8 Example of duplicated HITs on UniProt protein: >sp|A6NNC1_1|P12LL_HUMAN Putative POM121-like protein 1-like OS=Homo #sapiens PE=5 SV=3;HIX0164251 HIT000022265_02 HIP000155997;cat=II;ch=5 >sp|A6NNC1_2|P12LL_HUMAN Putative POM121-like protein 1-like OS=Homo #sapiens PE=5 SV=3;HIX0164252 HIT000022265_03 HIP000155997;cat=II;ch=5 Example of UniProt unique proteins: >sp|Q04917|1433F_HUMAN 14-3-3 protein eta OS=Homo sapiens GN=YWHAH PE=1 SV=4;NA;cat=NA;ch=22 -------------------------------------------------------------------------------------------------- 2.File RefSeq By comparing with 34,586 RefSeq proteins with UniProt proteins, 1,578 RefSeq proteins were remained to be unique and 295 RefSeq proteins were related to H-InvDB HITs. These records are started with "gi". Example: >gi|40316912|ref|NP_000322.2| serum amyloid A-1 protein preproprotein;HIX0009484 HIT000338277 HIP000068310;cat=I;ch=11 Example of duplicated HITs on RefSeq protein: >gi|4503965_1|ref|NP_000504.1| medium-wave-sensitive opsin 1;HIX0056274 HIT000501672_01 HIP000255209;cat=I;ch=X >gi|4503965_2|ref|NP_000504.1| medium-wave-sensitive opsin 1;HIX0203337 HIT000501672_02 HIP000255209;cat=III;ch=X Example of RefSEq unique proteins: >gi|157412240|ref|NP_001094800.1| C1GALT1-specific chaperone 1-like;NA;cat=NA;ch=2 -------------------------------------------------------------------------------------------------- 3.File H-InvDB By comparing with UniProt and RefSeq proteins, 17,363 H-InvDB HITs were unique to H-InvDB. These records are started with "hi". If non-human UniProt AC is related with HIT, it described in the second column. Example: >hi|HIP000065638|HIT000001367|HIX0019274| Hypothetical protein;Q9BYA9;cat=V;ch=2