Itoshi Nikaido 1, 2, 3, Chika Saito 3, Akiko Wakamoto 3, Yasuhiro Tomaru 3, Takahiro Arakawa 3, Yoshihide Hayashizaki 1, 3 and Yasushi Okazaki* 2, 3
1 Division of Genomic Information Resource Exploration,
Science of Biological Supramolecular Systems, Yokohama City University,
Graduate School of Integrated Science, Yokohama, Kanagawa 230-0045, Japan,
2 Division of Functional Genomics and Systems Medicine,
Research Center for Genomic Medicine, Saitama Medical School, Saitama 350-1241,
Japan and
3 Laboratory for Genome Exploration Research Group, RIKEN
Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Yokohama, Kanagawa
230-0045, Japan
*To whom correspondence should be addressed at Division of Functional
Genomics and Systems Medicine, Research Center for Genomic Medicine, 1397-1
Yamane, Hidaka City, Saitama 350-1241, Japan.
Tel: +81 429 85 7319; Fax: +81 429 85 7329; Email: okazaki@saitama-med.ac.jp
We have developed an integrated database that is specialized for
the study of imprinted disease genes. The database contains novel candidate
imprinted genes identified by the RIKEN full-length mouse cDNA microarray
study, information on validated single nucleotide polymorphisms (SNPs)
to confirm imprinting using reciprocal mouse crosses and the predicted
physical position of imprinting-related disease loci in the mouse and human
genomes. It has two user-friendly search interfaces: the SNP-central view
(MuSCAT: MoUse SNP CATalog) and the candidate gene-central view (CITE:
Candidate Imprinted Transcripts by Expression). The database, EICO (Expression-based
Imprint Candidate Organizer), can be accessed via the World Wide Web (
http://fantom2.gsc.riken.jp/EICODB/
) and the DAS client software. These data and interfaces facilitate
understanding of the mechanism of imprinting in mammalian inherited
traits.
Genomic imprinting results in the expression of individual genes
from only one of two parental chromosomes and affects growth and behavior
after birth in mammals (1). Aberrant imprinting can lead
to various diseases due to an effective doubling of gene dosage. Conversely,
genetic diseases display complex inheritance patterns, through the male
or female line, when the affected gene falls within a maternally or paternally
imprinted locus.
Identification of the network of imprinted genes will provide insight
into the molecular mechanisms that underlie
imprinting-related phenotypes and diseases. To date 60 imprinted
mouse genes have been identified using various methods ( http://www.mgu.har.mrc.ac.uk/imprinting/all_impmaps.html
). Genomic imprinting involves promoter methylation and/or natural antisense
transcripts (NATs) of imprinted or neighboring genes (2);
however, the details are unclear. Imprinting clearly cannot be predicted
from genomic sequencing and annotation alone (1). We
have established an efficient method of screening for candidate imprinted
transcripts, and target genes by comparing mRNA expression profiles between
parthenogenotes and androgenotes using RIKEN cDNA microarrays (3,4).
Although our screening method is very efficient, a fraction (32%) of the
identified candidate genes proved to be non-imprinted (3).
These non-imprinted genes could be regulated by imprinted genes.
To confirm the imprinted status of candidate transcripts, we performed
reciprocal crosses with Mus musculus molossinus (MSM), a Japanese
wild mouse strain, and analyzed the resulting transcripts for polymorphisms
that distinguish paternal from maternal loci. Since MSM is phylogenetically
1 million years apart from common laboratory mouse strains, it exhibits
frequent genetic polymorphisms with laboratory mice. To this end, we searched
for polymorphisms in the 3'-end of the transcripts between MSM and C57BL/6J
mouse lines and the results were assembled into the EICO. In this paper,
we report the construction and implementation of the EICO ( http://fantom2.gsc.riken.jp/EICODB/
), which efficiently stores and retrieves three kinds of data: (i) candidate
imprinted transcripts from microarray analysis, (ii) single nucleotide
polymorphisms (SNPs) between the 3'-end sequences of the RIKEN full-length
cDNAs from C57BL/6J and MSM mice, and (iii) imprinting-related
disease loci extracted from OMIM (5). The relationship
between disease loci and novel imprinted mRNAs identifies new candidates
that may be involved causally in imprinting-related human genetic diseases.
Table 1. Contents of the EICO.
|
||||||||||||||
The EICO has 2850 SNPs, 2101 candidate imprinted genes and 529 candidate imprinted genes within predicted imprinting-related disease loci.
REFERENCES
1. Reik,W. and Walter,J. (2001) Genomic imprinting: parental influence
on the genome. Nature Rev. Genet., 2, 21–32.
2. Sleutels,F., Zwart,R. and Barlow,D.P. (2002) The non-coding Air
RNA is required for silencing autosomal imprinted genes. Nature, 415, 810–813.
3. Mizuno,Y., Sotomaru,Y., Katsuzawa,Y., Kono,T., Meguro,M., Oshimura,M.,
Kawai,J., Tomaru,Y., Kiyosawa,H., Nikaido,I. et al. (2002) Asb4, Ata3 and
Dcn are novel imprinted genes identified by high-throughput screening using
RIKEN cDNA microarray. Biochem. Biophys. Res. Commun., 290, 1499–1505.
4. Nikaido,I., Saito,C., Mizuno,Y., Meguro,M., Bono,H., Kadomura,M.,
Kono,T., Morris,G.A., Lyons,P.A.,
Oshimura,M. et al. (2003) Discovery of imprinted transcripts in
the mouse transcriptome using large-scale expression profiling. Genome
Res., 13, 1402–1409.
5. Hamosh,A., Scott,A.F., Amberger,J., Bocchini,C., Valle,D. and
McKusick,V.A. (2002) Online Mendelian
Inheritance in Man (OMIM), a knowledgebase of human genes and genetic
disorders. Nucleic
Acids Res., 30,
52–55.
6. Ewing,B., Hillier,L., Wendl,M.C. and Green,P. (1998) Base-calling
of automated sequencer traces using phred. I. Accuracy assessment. Genome
Res., 8, 175–185.
7. Ewing,B. and Green,P. (1998) Base-calling of automated sequencer
traces using phred. II. Error probabilities.
Genome Res., 8, 186–194.
8. Bono,H., Kasukawa,T., Furuno,M., Hayashizaki,Y. and Okazaki,Y. (2002) FANTOM DB: database of functional annotation of RIKEN mouse cDNA clones. Nucleic Acids Res., 30, 116–118.
9. Okazaki,Y., Furuno,M., Kasukawa,T., Adachi,J., Bono,H., Kondo,S., Nikaido,I., Osato,N., Saito,R., Suzuki,H. et al. (2002) Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature, 420, 563–573.
10. Kasukawa,T., Furuno,M., Nikaido,I., Bono,H., Hume,D.A., Bult,C., Hill,D.P., Baldarelli,R., Gough,J., Kanapin,A. et al. (2003) Development and evaluation of an automated annotation pipeline and cDNA annotation system. Genome Res., 13, 1542–1551.
11. Clamp,M., Andrews,D., Barker,D., Bevan,P., Cameron,G., Chen,Y., Clark,L., Cox,T., Cuff,J., Curwen,V. et al. (2003) Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res., 31, 38–42.
12. Dowell,R.D., Jokerst,R.M., Day,A., Eddy,S.R. and Stein,L. (2001) The Distributed Annotation System. BMC Bioinformatics, 2, 7.
13. Kiyosawa,H., Yamanaka,I., Osato,N., Kondo,S. and Hayashizaki,Y. (2003) Antisense transcripts with FANTOM2 clone set and their implications for gene regulation. Genome Res., 13, 1324–1334.
14. Numata,K., Kanai,A., Saito,R., Kondo,S., Adachi,J., Wilming,L.G., Hume,D.A., Hayashizaki,Y. and Tomita,M. (2003) Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Res., 13, 1301–1306.
15. Ning,Z., Cox,A.J. and Mullikin,J.C. (2001) SSAHA: a fast search method for large DNA databases. Genome Res., 11, 1725–1729.
16. Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z.,
Miller,W. and Lipman,D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation
of protein database search programs. Nucleic Acids Res., 25, 3389–3402.
1. Cao X, Aufsatz W, Zilberman D, Mette MF, Huang MS, Matzke M, and Jacobsen SE, "Role of the DRM and CMT3 Methyltransferases in RNA-Directed DNA Methylation", Current Biology, vol 13, no. 24, pp. 2212-2217 (16 December 2003).
2. Report of "RIKEN Mouse Genome Encyclopedia" project: the whole
system from mouse house to database.
Published in Genome
Research, vol. 13, no. 6b, pp. 1265-1561 (June2, 2003).
3. Topics in: Euchromatin, active DNA, and RNA ribo-regulators:
Reviews and Research:
Links to
Euchromatin Activator RNA Reviews:
Links to
Euchromatin Activator RNA Research:
Links to
RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma
Immuno-Pathology:
Links to Activated
T-Lymphocyte Immunotherapy:
Links to Medical
Systems Biology:
"Ultrastructural
Probes of Active DNA Sites, and the RNA Activators of DNA".
For Further Information and Feedback:
E-mail: frenster@euchromatin.net