Construction and application of an electronic spatiotemporal expression profile and gene ontology analysis platform based on the EST database of the silkworm, Bombyx mori
1National Engineering Laboratory for Modern Silk, Department of Applied Biology, Medical College of Soochow
University, Suzhou, 215153, P. R. China
2Biology Department, Chongqing Three Gorges University, Chongqing, 404000, China
3Bioinformatics Department, Medical College, Soochow University, Suzhou, 215153, China
Abstract
An Expressed Sequence Tag (EST) is a short sub-sequence of a transcribed cDNA sequence. ESTs represent gene expression and give good clues for gene expression analysis. Based on EST data obtained from NCBI, an EST analysis package was developed (apEST). This tool was programmed for electronic expression, protein annotation and Gene Ontology (GO) category analysis in Bombyx mori (L.) (Lepidoptera: Bombycidae). A total of 245,761 ESTs (as of 01 July 2009) were searched and downloaded in FASTA format, from which information for tissue type, development stage, sex and strain were extracted, classified and summed by running apEST. Then, corresponding distribution profiles were formed after redundant parts had been removed. Gene expression profiles for one tissue of different developmental stages and from one development stage of the different tissues were attained. A housekeeping gene and tissue-andstage- specific genes were selected by running apEST, contrasting with two other online analysis approaches, microarray-based gene expression profile on SilkDB (BmMDB) and EST profile on NCBI. A spatio-temporal expression profile of catalase run by apEST was then presented as a three-dimensional graph for the intuitive visualization of patterns. A total of 37 query genes confirmed from microarray data and RT–PCR experiments were selected as queries to test apEST. The results had great conformity among three approaches. Nevertheless, there were minor differences between apEST and BmMDB because of the unique items investigated. Therefore, complementary analysis was proposed. Application of apEST also led to the acquisition of corresponding protein annotations for EST datasets and eventually for their functions. The results were presented according to statistical information on protein annotation and Gene Ontology (GO) category. These all verified the reliability of apEST and the operability of this platform. The apEST can also be applied in other species by modifying some parameters and serves as a model for gene expression study for Lepidoptera.
Keywords: EST analysis package, UniGene, Lepidoptera
Abbreviations: EST, Expressed Sequence Tag; apEST, EST analysis package; dbEST, database EST; GO, Gene
Ontology; TPM, transcripts per million; BmMDB, Microarray-based gene expression profile on SilkDB; MSG, middle
silkgland; PSG, posterior silkgland
Correspondence:
aganmei790717@163.com,
bmichael_0214@126.com,
cdushuhuniu@hotmail.com,
dxulisd@sina.com,
e13771956726@139.com,
fsudajmm@yahoo.com.cn,
g*szsqxu@suda.edu.cn,
*Corresponding author
Associate Editor: Brad Coates was editor of this paper.
Received: 1 September 2009 | Accepted: 25 January 2010 | Published: 23 July 2010
Copyright: This is an open access paper. We use the Creative Commons Attribution 3.0 license that permits unrestricted use, provided that the paper is properly attributed.
ISSN: 1536-2442 | Volume 10, Number 114
Gan LP, Zhang WY, Niu YS, Xu L, Xi J, Ji MM, Xu SQ. 2010. Construction and application of an electronic spatiotemporal expression profile and gene ontology analysis platform based on the EST database of the silkworm, Bombyx mori. Journal of Insect Science 10:114, available online: insectscience.org/10.114



