Timezone: »

The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer · Jan Koehler · Thomas Pfeil

Wed Dec 05 07:45 AM -- 09:45 AM (PST) @ Room 210 #51

Deep neural networks, and in particular recurrent networks, are promising candidates to control autonomous agents that interact in real-time with the physical world. However, this requires a seamless integration of temporal features into the network’s architecture. For the training of and inference with recurrent neural networks, they are usually rolled out over time, and different rollouts exist. Conventionally during inference, the layers of a network are computed in a sequential manner resulting in sparse temporal integration of information and long response times. In this study, we present a theoretical framework to describe rollouts, the level of model-parallelization they induce, and demonstrate differences in solving specific tasks. We prove that certain rollouts, also for networks with only skip and no recurrent connections, enable earlier and more frequent responses, and show empirically that these early responses have better performance. The streaming rollout maximizes these properties and enables a fully parallel execution of the network reducing runtime on massively parallel devices. Finally, we provide an open-source toolbox to design, train, evaluate, and interact with streaming rollouts.

Author Information

Volker Fischer (Robert Bosch GmbH, Bosch Center for Artificial Intelligence)
Jan Koehler (Robert Bosch GmbH)
Thomas Pfeil (Robert Bosch GmbH)

More from the Same Authors