Personal tools
You are here: Home Rätsch Group People Gunnar Rätsch, Dr. Publications RASE: Recognition of alternatively Spliced Exons in C. elegans
Navigation
 

Gunnar Rätsch, Sören Sonnenburg, and Bernhard Schölkopf (2005)

RASE: Recognition of alternatively Spliced Exons in C. elegans

In: Bioinformatics, vol. 21(Suppl. 1), pp. i369.

Motivation: Eukaryotic pre-mRNAs are spliced to form mature mRNA. Pre-mRNA alternative splicing greatly increases the \%diversity of proteins and the complexity of gene expression. Estimates show that more than half of the human genes and at least a third of the genes of less complex organisms such as nematodes or flies are alternatively spliced. In this work we consider one major form of alternative splicing, namely the exclusion of exons from the transcript. It has been shown that alternatively spliced exons have certain properties that distinguish them from constitutively spliced exons. While most recent computational studies on alternative splicing only apply to exons which are conserved among two species, our method only uses information that is available to the splicing machinery, i.e. the DNA sequence itself. We employ advanced machine learning techniques in order to answer the following two questions: a) Is a certain exon alternatively spliced? b) How can we identify yet unidentified exons within verified introns? Results: We designed a Support Vector Machine (SVM) kernel well suited for the task of classifying sequences with motifs having positional preferences. In order to solve the task a), we combine the kernel with additional local sequence information such as lengths of the exon and the flanking introns. The resulting SVM based classifier achieves a true positive rate of 48,5\% at a false positive rate of 1\%. By scanning over single EST-confirmed exons we identified 215 potentially alternatively spliced exons. For 10 randomly selected such exons we successfully performed biological verification experiments and confirmed 3 novel alternatively spliced exons. To answer question b), we additionally used SVM based predictions to recognize acceptor and donor splice sites. Combined with the above-mentioned features we were able to identify 85,2\% of skipped exons within verified introns at a false positive rate of 1\%.