TY - JOUR
T1 - Interpretable and Learnable Super-Resolution Time-Frequency Representation
AU - Balestriero, Randall
AU - Glotin, Hervé
AU - Baraniuk, Richard G.
N1 - Funding Information:
Acknowledgments: Randall Balestriero and Richard Baraniuk are supported by NSF grants CCF-1911094, IIS-1838177, and IIS-1730574; ONR grants N00014-18-12571, N00014-20-1-2787, and N00014-20-1-2534; AFOSR grant FA9550-18-1-0478; and a Vannevar Bush Faculty Fellowship, ONR grant N00014-18-1-2047. Hervé Glotin is supported by AI Chair ADSIL ANR-20-CHIA-0014, SMILES ANR-18-CE40-0014.
Publisher Copyright:
© 2021 Randall Balestriero, Hervé Glotin & Richard G. Baraniuk.
PY - 2021
Y1 - 2021
N2 - We develop a novel interpretable and learnable time-frequency representation (TFR) that produces a super-resolved quadratic signal representation for time-series analysis; the proposed TFR is a Gaussian filtering of the Wigner-Ville (WV) transform of a signal parametrized with a few interpretable parameters. Our approach has two main hallmarks. First, by varying the filters applied onto the WV, our new TFR can interpolate between known TFRs such as spectrograms, wavelet transforms, and chirplet transforms. Beyond that, our representation can also reach perfect time and frequency localization, hence super-resolution; this generalizes standard TFRs whose resolution is limited by the Heisenberg uncertainty principle. Second, our proposed TFR is interpretable thanks to an explicit low-dimensional and physical parametrization of the WV Gaussian filtering. We demonstrate that our approach enables us to learn highly adapted TFRs and is able to tackle a range of large-scale classification tasks, where we reach higher performance compared to baseline and learned TFRs. Ours is to the best of our knowledge the first learnable TFR that can continuously interpolate between super-resolution representation and commonly employed TFRs based on a few learnable parameters and which preserves full interpretability of the produced TFR, even after learning.
AB - We develop a novel interpretable and learnable time-frequency representation (TFR) that produces a super-resolved quadratic signal representation for time-series analysis; the proposed TFR is a Gaussian filtering of the Wigner-Ville (WV) transform of a signal parametrized with a few interpretable parameters. Our approach has two main hallmarks. First, by varying the filters applied onto the WV, our new TFR can interpolate between known TFRs such as spectrograms, wavelet transforms, and chirplet transforms. Beyond that, our representation can also reach perfect time and frequency localization, hence super-resolution; this generalizes standard TFRs whose resolution is limited by the Heisenberg uncertainty principle. Second, our proposed TFR is interpretable thanks to an explicit low-dimensional and physical parametrization of the WV Gaussian filtering. We demonstrate that our approach enables us to learn highly adapted TFRs and is able to tackle a range of large-scale classification tasks, where we reach higher performance compared to baseline and learned TFRs. Ours is to the best of our knowledge the first learnable TFR that can continuously interpolate between super-resolution representation and commonly employed TFRs based on a few learnable parameters and which preserves full interpretability of the produced TFR, even after learning.
KW - Audio Classification
KW - Chirplet
KW - Cohen Class
KW - Environmental Sounds
KW - Explain-ability
KW - Gabor Transform
KW - Interpretability
KW - Learnable Time-Frequency Representation
KW - Spectrogram
KW - Speech Recognition
KW - Super-Resolution
KW - Time-Series
KW - Wavelet
KW - Wigner-Ville
KW - bioacoustics
KW - biosonar
UR - http://www.scopus.com/inward/record.url?scp=85163943920&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163943920&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85163943920
SN - 2640-3498
VL - 145
SP - 118
EP - 152
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 2nd Mathematical and Scientific Machine Learning Conference, MSML 2021
Y2 - 16 August 2021 through 19 August 2021
ER -