Talk
in
Workshop: End-to-end Learning for Speech and Audio Processing

Oriol Vinyals: From Speech to Text and Back: Neural Sequence Models for Speech Processing

2016 Talk
in
Workshop: End-to-end Learning for Speech and Audio Processing

Abstract

In my talk, I will present recent advances in neural sequence models that our group has focused on. I will describe efforts on speech synthesis using WaveNets, a model which can generate wavefroms at 16KHz, and also seq2seq models "Listen/Watch, Attend and Spell" for audio-visual speech recognition efforts for speech recognition and lip reading.

Chat is not available.