J

José Castresana

Universitat Pompeu Fabra

ORCID: 0000-0002-9391-8050

Publishes on Genomics and Phylogenetic Studies, Genetic diversity and population structure, Extraction and Separation Processes. 119 papers and 21.8k citations.

119Publications
21.8kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis
José Castresana|Molecular Biology and Evolution|2000
Cited by 10.8kOpen Access

The use of some multiple-sequence alignments in phylogenetic analysis, particularly those that are not very well conserved, requires the elimination of poorly aligned positions and divergent regions, since they may not be homologous or may have been saturated by multiple substitutions. A computerized method that eliminates such positions and at the same time tries to minimize the loss of informative sites is presented here. The method is based on the selection of blocks of positions that fulfill a simple set of requirements with respect to the number of contiguous conserved positions, lack of gaps, and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. To illustrate the efficiency of this method, alignments of 10 mitochondrial proteins from several completely sequenced mitochondrial genomes belonging to diverse eukaryotes were used as examples. The percentages of removed positions were higher in the most divergent alignments. After removing divergent segments, the amino acid composition of the different sequences was more uniform, and pairwise distances became much smaller. Phylogenetic trees show that topologies can be different after removing conserved blocks, particularly when there are several poorly resolved nodes. Strong support was found for the grouping of animals and fungi but not for the position of more basal eukaryotes. The use of a computerized method such as the one presented here reduces to a certain extent the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible, and facilitates the reproduction of the final alignment by other researchers.

Improvement of Phylogenies after Removing Divergent and Ambiguously Aligned Blocks from Protein Sequence Alignments
Gerard Talavera, José Castresana|Systematic Biology|2007
Cited by 5.8k

Alignment quality may have as much impact on phylogenetic reconstruction as the phylogenetic methods used. Not only the alignment algorithm, but also the method used to deal with the most problematic alignment regions, may have a critical effect on the final tree. Although some authors remove such problematic regions, either manually or using automatic methods, in order to improve phylogenetic performance, others prefer to keep such regions to avoid losing any information. Our aim in the present work was to examine whether phylogenetic reconstruction improves after alignment cleaning or not. Using simulated protein alignments with gaps, we tested the relative performance in diverse phylogenetic analyses of the whole alignments versus the alignments with problematic regions removed with our previously developed Gblocks program. We also tested the performance of more or less stringent conditions in the selection of blocks. Alignments constructed with different alignment methods (ClustalW, Mafft, and Probcons) were used to estimate phylogenetic trees by maximum likelihood, neighbor joining, and parsimony. We show that, in most alignment conditions, and for alignments that are not too short, removal of blocks leads to better trees. That is, despite losing some information, there is an increase in the actual phylogenetic signal. Overall, the best trees are obtained by maximum-likelihood reconstruction of alignments cleaned by Gblocks. In general, a relaxed selection of blocks is better for short alignment, whereas a stringent selection is more adequate for longer ones. Finally, we show that cleaned alignments produce better topologies although, paradoxically, with lower bootstrap. This indicates that divergent and problematic alignment regions may lead, when present, to apparently better supported although, in fact, more biased topologies.

Phylogenetic and Ecological Analysis of Novel Marine Stramenopiles
Ramón Massana, José Castresana, Vanessa Balagué et al.|Applied and Environmental Microbiology|2004
Cited by 328Open Access

Culture-independent molecular analyses of open-sea microorganisms have revealed the existence and apparent abundance of novel eukaryotic lineages, opening new avenues for phylogenetic, evolutionary, and ecological research. Novel marine stramenopiles, identified by 18S ribosomal DNA sequences within the basal part of the stramenopile radiation but unrelated to any previously known group, constituted one of the most important novel lineages in these open-sea samples. Here we carry out a comparative analysis of novel stramenopiles, including new sequences from coastal genetic libraries presented here and sequences from recent reports from the open ocean and marine anoxic sites. Novel stramenopiles were found in all major habitats, generally accounting for a significant proportion of clones in genetic libraries. Phylogenetic analyses indicated the existence of 12 independent clusters. Some of these were restricted to anoxic or deep-sea environments, but the majority were typical components of coastal and open-sea waters. We specifically identified four clusters that were well represented in most marine surface waters (together they accounted for 74% of the novel stramenopile clones) and are the obvious targets for future research. Many sequences were retrieved from geographically distant regions, indicating that some organisms were cosmopolitan. Our study expands our knowledge on the phylogenetic diversity and distribution of novel marine stramenopiles and confirms that they are fundamental members of the marine eukaryotic picoplankton.