Published online 1 January 2004
Nucleic Acids Research, 2004, Vol. 32, Database issue D548-D551
http://nar.oupjournals.org/cgi/content/full/32/suppl_1/D548


"EICO (Expression-based Imprint Candidate Organizer): finding disease-related imprinted genes".

Itoshi Nikaido 1, 2, 3, Chika Saito 3, Akiko Wakamoto 3, Yasuhiro Tomaru 3, Takahiro Arakawa 3, Yoshihide Hayashizaki 1, 3 and Yasushi Okazaki* 2, 3

1 Division of Genomic Information Resource Exploration, Science of Biological Supramolecular Systems, Yokohama City University, Graduate School of Integrated Science, Yokohama, Kanagawa 230-0045, Japan,
2 Division of Functional Genomics and Systems Medicine, Research Center for Genomic Medicine, Saitama Medical School, Saitama 350-1241, Japan and
3 Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045, Japan

*To whom correspondence should be addressed at Division of Functional Genomics and Systems Medicine, Research Center for Genomic Medicine, 1397-1 Yamane, Hidaka City, Saitama 350-1241, Japan.
Tel: +81 429 85 7319; Fax: +81 429 85 7329; Email: okazaki@saitama-med.ac.jp



Abstract:
Introduction:
Table 1: Contents of the EICO:
References:
Additional References:
Other Links:
Further Information and Feedback:


Abstract:

We have developed an integrated database that is specialized for the study of imprinted disease genes. The database contains novel candidate imprinted genes identified by the RIKEN full-length mouse cDNA microarray study, information on validated single nucleotide polymorphisms (SNPs) to confirm imprinting using reciprocal mouse crosses and the predicted physical position of imprinting-related disease loci in the mouse and human genomes. It has two user-friendly search interfaces: the SNP-central view (MuSCAT: MoUse SNP CATalog) and the candidate gene-central view (CITE: Candidate Imprinted Transcripts by Expression). The database, EICO (Expression-based Imprint Candidate Organizer), can be accessed via the World Wide Web ( http://fantom2.gsc.riken.jp/EICODB/ ) and the DAS client software. These data and interfaces facilitate
understanding of the mechanism of imprinting in mammalian inherited traits.

INTRODUCTION

Genomic imprinting results in the expression of individual genes from only one of two parental chromosomes and affects growth and behavior after birth in mammals (1). Aberrant imprinting can lead to various diseases due to an effective doubling of gene dosage. Conversely, genetic diseases display complex inheritance patterns, through the male or female line, when the affected gene falls within a maternally or paternally imprinted locus.
Identification of the network of imprinted genes will provide insight into the molecular mechanisms that underlie
imprinting-related phenotypes and diseases. To date 60 imprinted mouse genes have been identified using various methods ( http://www.mgu.har.mrc.ac.uk/imprinting/all_impmaps.html ). Genomic imprinting involves promoter methylation and/or natural antisense transcripts (NATs) of imprinted or neighboring genes (2); however, the details are unclear. Imprinting clearly cannot be predicted from genomic sequencing and annotation alone (1). We have established an efficient method of screening for candidate imprinted transcripts, and target genes by comparing mRNA expression profiles between parthenogenotes and androgenotes using RIKEN cDNA microarrays (3,4). Although our screening method is very efficient, a fraction (32%) of the identified candidate genes proved to be non-imprinted (3). These non-imprinted genes could be regulated by imprinted genes.

To confirm the imprinted status of candidate transcripts, we performed reciprocal crosses with Mus musculus molossinus (MSM), a Japanese wild mouse strain, and analyzed the resulting transcripts for polymorphisms that distinguish paternal from maternal loci. Since MSM is phylogenetically 1 million years apart from common laboratory mouse strains, it exhibits frequent genetic polymorphisms with laboratory mice. To this end, we searched for polymorphisms in the 3'-end of the transcripts between MSM and C57BL/6J mouse lines and the results were assembled into the EICO. In this paper, we report the construction and implementation of the EICO ( http://fantom2.gsc.riken.jp/EICODB/ ), which efficiently stores and retrieves three kinds of data: (i) candidate imprinted transcripts from microarray analysis, (ii) single nucleotide polymorphisms (SNPs) between the 3'-end sequences of the RIKEN full-length cDNAs from C57BL/6J and MSM mice, and (iii) imprinting-related
disease loci extracted from OMIM (5). The relationship between disease loci and novel imprinted mRNAs identifies new candidates that may be involved causally in imprinting-related human genetic diseases.

Table 1. Contents of the EICO.
 
Contents Number
SNPs between MSM and C57BL/6J (1) 2850 (1281 genes)
Candidate imprinted genes (2) Paternal: 698; maternal: 1403
Genes overlapped between 1 and 2 (3) 243
Candidate imprinted genes on predicted imprinting-related disease loci on human genome (4) 529 genes; 65 diseases; 109 loci
Genes overlapped between 3 and 4 114 genes

The EICO has 2850 SNPs, 2101 candidate imprinted genes and 529 candidate imprinted genes within predicted imprinting-related disease loci.


REFERENCES

1. Reik,W. and Walter,J. (2001) Genomic imprinting: parental influence on the genome. Nature Rev. Genet., 2, 21–32.

2. Sleutels,F., Zwart,R. and Barlow,D.P. (2002) The non-coding Air RNA is required for silencing autosomal imprinted genes. Nature, 415, 810–813.

3. Mizuno,Y., Sotomaru,Y., Katsuzawa,Y., Kono,T., Meguro,M., Oshimura,M., Kawai,J., Tomaru,Y., Kiyosawa,H., Nikaido,I. et al. (2002) Asb4, Ata3 and Dcn are novel imprinted genes identified by high-throughput screening using RIKEN cDNA microarray. Biochem. Biophys. Res. Commun., 290, 1499–1505.

4. Nikaido,I., Saito,C., Mizuno,Y., Meguro,M., Bono,H., Kadomura,M., Kono,T., Morris,G.A., Lyons,P.A.,
Oshimura,M. et al. (2003) Discovery of imprinted transcripts in the mouse transcriptome using large-scale expression profiling. Genome Res., 13, 1402–1409.

5. Hamosh,A., Scott,A.F., Amberger,J., Bocchini,C., Valle,D. and McKusick,V.A. (2002) Online Mendelian
Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res., 30,
52–55.
6. Ewing,B., Hillier,L., Wendl,M.C. and Green,P. (1998) Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res., 8, 175–185.

7. Ewing,B. and Green,P. (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities.
Genome Res., 8, 186–194.

8. Bono,H., Kasukawa,T., Furuno,M., Hayashizaki,Y. and Okazaki,Y. (2002) FANTOM DB: database of functional annotation of RIKEN mouse cDNA clones. Nucleic Acids Res., 30, 116–118.

9. Okazaki,Y., Furuno,M., Kasukawa,T., Adachi,J., Bono,H., Kondo,S., Nikaido,I., Osato,N., Saito,R., Suzuki,H. et al. (2002) Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature, 420, 563–573.

10. Kasukawa,T., Furuno,M., Nikaido,I., Bono,H., Hume,D.A., Bult,C., Hill,D.P., Baldarelli,R., Gough,J., Kanapin,A. et al. (2003) Development and evaluation of an automated annotation pipeline and cDNA annotation system. Genome Res., 13, 1542–1551.

11. Clamp,M., Andrews,D., Barker,D., Bevan,P., Cameron,G., Chen,Y., Clark,L., Cox,T., Cuff,J., Curwen,V. et al. (2003) Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res., 31, 38–42.

12. Dowell,R.D., Jokerst,R.M., Day,A., Eddy,S.R. and Stein,L. (2001) The Distributed Annotation System. BMC Bioinformatics, 2, 7.

13. Kiyosawa,H., Yamanaka,I., Osato,N., Kondo,S. and Hayashizaki,Y. (2003) Antisense transcripts with FANTOM2 clone set and their implications for gene regulation. Genome Res., 13, 1324–1334.

14. Numata,K., Kanai,A., Saito,R., Kondo,S., Adachi,J., Wilming,L.G., Hume,D.A., Hayashizaki,Y. and Tomita,M. (2003) Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Res., 13, 1301–1306.

15. Ning,Z., Cox,A.J. and Mullikin,J.C. (2001) SSAHA: a fast search method for large DNA databases. Genome Res., 11, 1725–1729.

16. Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 3389–3402.
 



Additional References:

1. Cao X, Aufsatz W, Zilberman D, Mette MF, Huang MS, Matzke M, and Jacobsen SE, "Role of the DRM and CMT3 Methyltransferases in RNA-Directed DNA Methylation",  Current Biology, vol 13, no. 24, pp. 2212-2217 (16 December 2003).

2. Report of "RIKEN Mouse Genome Encyclopedia" project: the whole system from mouse house to database.
    Published in Genome Research, vol. 13, no. 6b, pp. 1265-1561 (June2, 2003).

3. Topics in:  Euchromatin,  active DNA, and  RNA  ribo-regulators:

Reviews and Research:

Links to Euchromatin Activator RNA Reviews:
Links to Euchromatin Activator RNA Research:
Links to RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma Immuno-Pathology:
Links to Activated T-Lymphocyte Immunotherapy:
Links to Medical Systems Biology:

"Ultrastructural Probes of Active DNA Sites, and the RNA Activators of DNA".
 



Top of Page - Euchromatin Network - Current Research - Forums - Other Sites - Future Events -

For Further Information and Feedback:
E-mail: frenster@euchromatin.net