MassBank: a public repository for sharing mass spectral data for life sciences

Hisayuki Horai(Keio University), Masanori Arita(Keio University), Shigehiko Kanaya(Nara Institute of Science and Technology), Yoshito Nihei(Keio University), Tasuku Ikeda(Keio University), Kazuhiro Suwa(The University of Tokyo), Yuya Ojima(Keio University), Kenichi Tanaka(University of Toyama), Satoshi Tanaka(Japan Science and Technology Agency), Ken Aoshima(Japan Science and Technology Agency), Yoshiya Oda(Japan Science and Technology Agency), Yuji Kakazu(Keio University), Miyako Kusano(RIKEN Center for Sustainable Resource Science), Takayuki Tohge(RIKEN Center for Sustainable Resource Science), Fumio Matsuda(RIKEN Center for Sustainable Resource Science), Yuji Sawada(Japan Science and Technology Agency), Masami Yokota Hirai(Japan Science and Technology Agency), Hiroki Nakanishi(Japan Science and Technology Agency), Kazutaka Ikeda(Japan Science and Technology Agency), Naoshige Akimoto(Kyoto University), Takashi Maoka(Research Institute for Production Development), Hiroki Takahashi(Nara Institute of Science and Technology), Takeshi Ara(Kazusa DNA Research Institute), Nozomu Sakurai(Kazusa DNA Research Institute), Hideyuki Suzuki(Kazusa DNA Research Institute), Daisuke Shibata(Kazusa DNA Research Institute), Steffen Neumann(Leibniz Institute of Plant Biochemistry), Takashi Iida(Nihon University), Ken Tanaka(University of Toyama), Kimito Funatsu(The University of Tokyo), Fumito Matsuura(Fukuyama University), Tomoyoshi Soga(Keio University), Ryo Taguchi(Japan Science and Technology Agency), Kazuki Saito(RIKEN Center for Sustainable Resource Science), Takaaki Nishi­oka(Keio University)
Journal of Mass Spectrometry
July 1, 2010
Cited by 2,495Open Access
Full Text

Abstract

MassBank is the first public repository of mass spectra of small chemical compounds for life sciences (<3000 Da). The database contains 605 electron-ionization mass spectrometry (EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MS(n) data of 2337 authentic compounds of metabolites, 11 545 EI-MS and 834 other-MS data of 10,286 volatile natural and synthetic compounds, and 3045 ESI-MS(2) data of 679 synthetic drugs contributed by 16 research groups (January 2010). ESI-MS(2) data were analyzed under nonstandardized, independent experimental conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more experimental conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calculated by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS(2) data. MassBank also provides a merged spectrum for each compound prepared by merging the analyzed ESI-MS(2) data on an identical compound under different collision-induced dissociation conditions. Data merging has significantly improved the precision of the identification of a chemical compound by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chemical compounds and the publication of experimental data.


Related Papers

No related papers found

Powered by citation graph analysis