Adaptive Front-ends for End-to-end Source Separation
Shrikant Venkataramani · Paris Smaragdis
2017 Talk
in
Workshop: Machine Learning for Audio Signal Processing (ML4Audio)
in
Workshop: Machine Learning for Audio Signal Processing (ML4Audio)
Abstract
(+ Jonah Casebeer) Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. We present an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for end-to-end supervised source separation.
Chat is not available.
Successful Page Load