Taverna: a tool for the composition and enactment of bioinformatics workflowsAbstract Motivation: In silico experiments in bioinformatics involve the co-ordinated use of computational tools and information repositories. A growing number of these resources are being made available with programmatic access in the form of Web services. Bioinformatics scientists will need to orchestrate these Web services in workflows as part of their analyses. Results: The Taverna project has developed a tool for the composition and enactment of bioinformatics workflows for the life sciences community. The tool includes a workbench application which provides a graphical user interface for the composition of workflows. These workflows are written in a new language called the simple conceptual unified flow language (Scufl), where by each step within a workflow represents one atomic task. Two examples are used to illustrate the ease by which in silico experiments can be represented as Scufl workflows using the workbench application. Availability: The Taverna workflow system is available as open source and can be downloaded with example Scufl workflows from http://taverna.sourceforge.net
Taverna: lessons in creating a workflow environment for the life sciencesTom Oinn, Mark Greenwood, Matthew Addis et al.|Concurrency and Computation Practice and Experience|2005 Abstract Life sciences research is based on individuals, often with diverse skills, assembled into research groups. These groups use their specialist expertise to address scientific problems. The in silico experiments undertaken by these research groups can be represented as workflows involving the co‐ordinated use of analysis programs and information repositories that may be globally distributed. With regards to Grid computing, the requirements relate to the sharing of analysis and information resources rather than sharing computational power. The my Grid project has developed the Taverna Workbench for the composition and execution of workflows for the life sciences community. This experience paper describes lessons learnt during the development of Taverna. A common theme is the importance of understanding how workflows fit into the scientists' experimental context. The lessons reflect an evolving understanding of life scientists' requirements on a workflow environment, which is relevant to other areas of data intensive and exploratory science. Copyright © 2005 John Wiley & Sons, Ltd.
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.Sarcasm is a common phenomenon in social media, and is inherently difficult to analyse, not just automatically but often for humans \ntoo. It has an important effect on sentiment, but is usually ignored in social media analysis, because it is considered too tricky to handle. \nWhile there exist a few systems which can detect sarcasm, almost no work has been carried out on studying the effect that sarcasm has on \nsentiment in tweets, and on incorporating this into automatic tools for sentiment analysis. We perform an analysis of the effect of sarcasm \nscope on the polarity of tweets, and have compiled a number of rules which enable us to improve the accuracy of sentiment analysis when \nsarcasm is known to be present. We consider in particular the effect of sentiment and sarcasm contained in hashtags, and have developed \na hashtag tokeniser for GATE, so that sentiment and sarcasm found within hashtags can be detected more easily. According to our experiments, \nthe hashtag tokenisation achieves 98% Precision, while the sarcasm detection achieved 91% Precision and polarity detection 80%.
A semantic approach to IE pattern inductionThis paper presents a novel algorithm for the acquisition of Information Extraction patterns. The approach makes the assumption that useful patterns will have similar meanings to those already identified as relevant. Patterns are compared using a variation of the standard vector space model in which information from an ontology is used to capture semantic similarity. Evaluation shows this algorithm performs well when compared with a previously reported document-centric approach.
Using Semantic Web Technologies for Representing E-science ProvenanceJun Zhao, Chris Wroe, Carole Goble et al.|Lecture notes in computer science|2004