Deep Scattering Spectrum

Joakim Andén(Centre de Mathématiques Appliquées), Stéphane Mallat(École Normale Supérieure - PSL)
IEEE Transactions on Signal Processing
July 21, 2014
Cited by 651Open Access
Full Text

Abstract

A scattering transform defines a locally translation invariant representation which is stable to time-warping deformation. It extends MFCC representations by computing modulation spectrum coefficients of multiple orders, through cascades of wavelet convolutions and modulus operators. Second-order scattering coefficients characterize transient phenomena such as attacks and amplitude modulation. A frequency transposition invariant representation is obtained by applying a scattering transform along log-frequency. State-the-of-art classification results are obtained for musical genre and phone classification on GTZAN and TIMIT databases, respectively.


Related Papers

No related papers found

Powered by citation graph analysis