Publishes on Machine Learning in Materials Science, Computational Drug Discovery Methods, Organic Electronics and Photovoltaics. 193 papers and 27.7k citations.
BACKGROUND: A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry data, differences in the data stored by different formats (0D versus 3D, for example), and competition between software along with a lack of vendor-neutral formats. RESULTS: We discuss, for the first time, Open Babel, an open-source chemical toolbox that speaks the many languages of chemical data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chemical and molecular data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. CONCLUSIONS: Open Babel presents a solution to the proliferation of multiple chemical file formats. In addition, it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chemical data in areas such as organic chemistry, drug design, materials science, and computational chemistry. It is freely available under an open-source license from http://openbabel.org.
BACKGROUND: The Avogadro project has developed an advanced molecule editor and visualizer designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science, and related areas. It offers flexible, high quality rendering, and a powerful plugin architecture. Typical uses include building molecular structures, formatting input files, and analyzing output of a wide variety of computational chemistry packages. By using the CML file format as its native document type, Avogadro seeks to enhance the semantic accessibility of chemical data types. RESULTS: The work presented here details the Avogadro library, which is a framework providing a code library and application programming interface (API) with three-dimensional visualization capabilities; and has direct applications to research and education in the fields of chemistry, physics, materials science, and biology. The Avogadro application provides a rich graphical interface using dynamically loaded plugins through the library itself. The application and library can each be extended by implementing a plugin module in C++ or Python to explore different visualization techniques, build/manipulate molecular structures, and interact with other programs. We describe some example extensions, one which uses a genetic algorithm to find stable crystal structures, and one which interfaces with the PackMol program to create packed, solvated structures for molecular dynamics simulations. The 1.0 release series of Avogadro is the main focus of the results discussed here. CONCLUSIONS: Avogadro offers a semantic chemical builder and platform for visualization and analysis. For users, it offers an easy-to-use builder, integrated support for downloading from common databases such as PubChem and the Protein Data Bank, extracting chemical data from a wide variety of formats, including computational chemistry output, and native, semantic support for the CML file format. For developers, it can be easily extended via a powerful plugin mechanism to support new features in organic chemistry, inorganic complexes, drug design, materials, biomolecules, and simulations. Avogadro is freely available under an open-source license from http://avogadro.openmolecules.net.
Molecular scale charge motion in disordered organic materials at ambient temperature occurs via a hopping-type mechanism with rates dictated both by the charge transfer integral and by the reorganization energy due to geometric relaxation. This contribution presents a systematic theoretical analysis of cation internal reorganization energies for a broad family of organic oligoheterocycles-variation of reorganization energy with oligomer chain length, heteroatom identity, and a range of heterocycle substituents provides key information on important structural properties governing internal reorganization energies. At room temperature, the range in reorganization energies induced by substituent variations corresponds to a >10(2)-fold variation in intrinsic hole transfer rate, suggesting that changes in reorganization energy dominate variations in charge-transfer rates for many semiconducting/conducting oligomers.
BACKGROUND: Scripting languages such as Python are ideally suited to common programming tasks in cheminformatics such as data analysis and parsing information from files. However, for reasons of efficiency, cheminformatics toolkits such as the OpenBabel toolkit are often implemented in compiled languages such as C++. We describe Pybel, a Python module that provides access to the OpenBabel toolkit. RESULTS: Pybel wraps the direct toolkit bindings to simplify common tasks such as reading and writing molecular files and calculating fingerprints. Extensive use is made of Python iterators to simplify loops such as that over all the molecules in a file. A Pybel Molecule can be easily interconverted to an OpenBabel OBMol to access those methods or attributes not wrapped by Pybel. CONCLUSION: Pybel allows cheminformaticians to rapidly develop Python scripts that manipulate chemical information. It is open source, available cross-platform, and offers the power of the OpenBabel toolkit to Python programmers.
For electron or hole transfer between neighboring conducting polymer strands or oligomers, the intrinsic charge-transfer rate is dictated by the charge-resonance integral and by the reorganization energy due to geometric relaxation. To explain conduction anisotropy and other solid-state effects, a multivariate, systematic analysis of bandwidth as a function of intermolecular orientations is undertaken for a series of oligoheterocycles, using first-principles methods. While cofacial oligomers show the greatest bandwidths at a given intermolecular C-C contact distance, for a fixed center-to-center intermolecular distance, tilted pi-stacking increases pi-overlap (particularly for LUMO orbitals) and decreases electrostatic repulsion, yielding optimum tilt angles for packing of approximately 40-60 degrees at small intermolecular separations. The calculations also reveal that bandwidths and intrinsic mobilities of holes and electrons in conjugated oligoheterocycles can be quite comparable.