Oriol Vinyals: From Speech to Text and Back: Neural Sequence Models for Speech Processing
2016 Talk
in
Workshop: End-to-end Learning for Speech and Audio Processing
in
Workshop: End-to-end Learning for Speech and Audio Processing
Abstract
In my talk, I will present recent advances in neural sequence models that our group has focused on. I will describe efforts on speech synthesis using WaveNets, a model which can generate wavefroms at 16KHz, and also seq2seq models "Listen/Watch, Attend and Spell" for audio-visual speech recognition efforts for speech recognition and lip reading.
Chat is not available.
Successful Page Load