H-InvDB Enrichment Analysis Tool (HEAT) is a data-mining tool for automatically identifying features specific to a given human gene set. HEAT searches for
H-InvDB annotations that are significantly enriched in a user-defined gene set, as compared with the entire H-InvDB representative transcripts. This technique is called gene set enrichment analysis (GSEA), and is popularly used in analyzing results of microarray experiments. Fisher's exact probability is used in statistical tests of HEAT.
Submit two or more human gene IDs. Acceptable IDs are HUGO Gene Symbols, Accession Numbers of INSD (DDBJ/EMBL/GenBank), H-InvDB Transcript IDs (HIT), Locus IDs (HIX), Protein IDs (HIP), UniProt Accession Numbers, RefSeq, GeneID, PDB ID, Ensembl Transcript IDs, Ensembl Gene IDs, FLJ IDs, Clone IDs, KEGG IDs, and HPRD IDs. IDs need to be separated by spaces or commas. You can insert return codes. You can upload a text file with gene IDs.
By pushing the Submit Gene-Set button, the HEAT converts given IDs into H-InvDB representative transcript IDs. Here, the HEAT system utilizes the web service of
ID Converter System to convert human gene IDs.
The submitted gene IDs are converted into corresponding Transcript IDs (HIT) of H-InvDB representative transcripts, and are shown. If there are no corresponding IDs in H-InvDB, the IDs are left blank, and will not be used in the enrichment analysis. Push the Execute button to start finding enriched features for this gene set.
Enriched features of the given gene set is shown. For each feature, the ID and name of feature, the number of occurrences/genes of a submitted gene set, the number of occurrences/genes among all H-InvDB representative transcripts, and P-values are shown. The features with P-values smaller than 0.01, sorted by P-values, are shown. The following features of H-InvDB are analyzed.
| Annotation | Number |
| InterPro | 7,532 |
| Gene Ontology: Biological Process | 638 |
| Gene Ontology: Cellular Component | 197 |
| Gene Ontology: Molecular Function | 978 |
| KEGG pathway | 175 |
| Chromosomal band | 881 |
| Gene family | 3,631 |
| SCOP (Structural domains) | 2,655 |
| Subcellular localization prediction (by using Wolf PSORT) | 11 |
| Tissue-specific gene expression (10 tissue categories defined in H-ANGEL) | 10 |
| Sequence motifs in promoter regions (JASPAR) | 130 |
| Protein-Protein Interaction (PPI) | 10,869 |
If you click the "download" button, you can download the results in text format. If you click the number of occurrences in the feature list, HITs containing the corresponding feature will be shown in pop-up windows.
HEAT conducts gene set enrichment analysis by calculating P-values using Fisher's exact probability test. Let
N be the number of all protein-coding representative transcripts in H-InvDB,
n be the number of occurrences of a particular feature,
K be the number of genes submitted, and
k be the number of occurrences of the feature in the given gene set. Then, the P-value is calculated by the following equation.
Imanishi T, Itoh T, Suzuki Y, O'Donovan C, Fukuchi S, Koyanagi KO, Barrero RA, Tamura T, Yamaguchi-Kabata Y, Tanino M, Yura K, Miyazaki S, Ikeo K, Homma K, Kasprzyk A, Nishikawa T, Hirakawa M, Thierry-Mieg J, Thierry-Mieg D, Ashurst J, Jia L, Nakao M, Thomas MA, Mulder N, Karavidopoulou Y, Jin L, Kim S, Yasuda T, Lenhard B, Eveno E, Suzuki Y, Yamasaki C, Takeda J, Gough C, Hilton P, Fujii Y, Sakai H, Tanaka S, Amid C, Bellgard M, Bonaldo Mde F, Bono H, Bromberg SK, Brookes AJ, Bruford E, Carninci P, Chelala C, Couillault C, de Souza SJ, Debily MA, Devignes MD, Dubchak I, Endo T, Estreicher A, Eyras E, Fukami-Kobayashi K, Gopinath GR, Graudens E, Hahn Y, Han M, Han ZG, Hanada K, Hanaoka H, Harada E, Hashimoto K, Hinz U, Hirai M, Hishiki T, Hopkinson I, Imbeaud S, Inoko H, Kanapin A, Kaneko Y, Kasukawa T, Kelso J, Kersey P, Kikuno R, Kimura K, Korn B, Kuryshev V, Makalowska I, Makino T, Mano S, Mariage-Samson R, Mashima J, Matsuda H, Mewes HW, Minoshima S, Nagai K, Nagasaki H, Nagata N, Nigam R, Ogasawara O, Ohara O, Ohtsubo M, Okada N, Okido T, Oota S, Ota M, Ota T, Otsuki T, Piatier-Tonneau D, Poustka A, Ren SX, Saitou N, Sakai K, Sakamoto S, Sakate R, Schupp I, Servant F, Sherry S, Shiba R, Shimizu N, Shimoyama M, Simpson AJ, Soares B, Steward C, Suwa M, Suzuki M, Takahashi A, Tamiya G, Tanaka H, Taylor T, Terwilliger JD, Unneberg P, Veeramachaneni V, Watanabe S, Wilming L, Yasuda N, Yoo HS, Stodolsky M, Makalowski W, Go M, Nakai K, Takagi T, Kanehisa M, Sakaki Y, Quackenbush J, Okazaki Y, Hayashizaki Y, Hide W, Chakraborty R, Nishikawa K, Sugawara H, Tateno Y, Chen Z, Oishi M, Tonellato P, Apweiler R, Okubo K, Wagner L, Wiemann S, Strausberg RL, Isogai T, Auffray C, Nomura N, Gojobori T, and Sugano S (2004)
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones. PLoS Biology 2: 856-875.