InterProScan 5: genome-scale protein function classification

Philip Jones(European Bioinformatics Institute), David Binns(European Bioinformatics Institute), Hsin-Yu Chang(European Bioinformatics Institute), Matthew Fraser(European Bioinformatics Institute), Weizhong Li(European Bioinformatics Institute), Craig McAnulla(European Bioinformatics Institute), Hamish McWilliam(European Bioinformatics Institute), John Maslen(European Bioinformatics Institute), Alex Mitchell(European Bioinformatics Institute), Gift Nuka(European Bioinformatics Institute), Sebastien Pesseat(European Bioinformatics Institute), A. F. Quinn(European Bioinformatics Institute), Amaia Sangrador‐Vegas(European Bioinformatics Institute), Maxim Scheremetjew(European Bioinformatics Institute), Siew-Yit Yong(European Bioinformatics Institute), Rodrigo López(European Bioinformatics Institute), Sarah Hunter(European Bioinformatics Institute)
Bioinformatics
January 23, 2014
Cited by 9,909Open Access
Full Text

Abstract

Abstract Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterize many millions of sequences. Here, we describe a new Java-based architecture for the widely used protein function prediction software package InterProScan. Developments include improvements and additions to the outputs of the software and the complete reimplementation of the software framework, resulting in a flexible and stable system that is able to use both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis. InterProScan is freely available for download from the EMBl-EBI FTP site and the open source code is hosted at Google Code. Availability and implementation: InterProScan is distributed via FTP at ftp://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/ and the source code is available from http://code.google.com/p/interproscan/. Contact: http://www.ebi.ac.uk/support or interhelp@ebi.ac.uk or mitchell@ebi.ac.uk


Related Papers