Engineering support vector machine kernels that recognize translation initiation sites

Alexander Zien(Fraunhofer Institute for Algorithms and Scientific Computing), Gunnar Rätsch(Fraunhofer Institute for Open Communication Systems), S. Mika(Fraunhofer Institute for Open Communication Systems), Bernhard Schölkopf(Microsoft (United States)), Thomas Lengauer(Fraunhofer Institute for Applied Information Technology), K. Müller(Fraunhofer Institute for Open Communication Systems)
Bioinformatics
September 1, 2000
Cited by 425Open Access
Full Text

Abstract

MOTIVATION: In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS). RESULTS: The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to incorporate prior biological knowledge by engineering an appropriate kernel function. With the described techniques the recognition performance can be improved by 26% over leading existing approaches. We provide evidence that existing related methods (e.g. ESTScan) could profit from advanced TIS recognition.


Related Papers

No related papers found

Powered by citation graph analysis