Add lookahead row convolution layer. #2228

xinghai-sun · 2017-05-22T07:10:50Z

Add lookahead row convolution layer, for both cpu and gpu version.
Details please find in DS2 paper.
Please add a design-doc here first.

qingqing01 · 2017-05-25T02:16:23Z

Bidirectional RNN models are challenging to deploy in an online, low-latency setting, because they are built to operate on an entire sample. This row convolution layer is used to build a unidirectional model containing forward-only RNN layers without any loss in accuracy. This layer can use a future context of T steps. It is beneficial for deployment system. The details can be referred to the papers.

Difference with `sequence_conv` in PaddlePaddle.

In the PaddlePaddle, we can use paddle.layers.context_projection and paddle.layers.fc to do sequence convolution, which is 1D convolution. The following figure shows this connection. Assumed that the context length( or the filter size) of convolution kernel is 3, and the hidden size of each time step is d for both the input and output feature of this operation. Thus, the weight dimension is 3d x d. And the weights are shared among all-time step.

Different from the sequence convolution, the convolution operation is row oriented for both weight(W) and hidden state(h). This connection is as shown in the following figure. The weight dimension is 3 x d. And the weights are shared among all-time step.

How to deal with boundary.

Assumed that the context length( or the filter size) of convolution kernel is k. The paper did not mention that how to do the lookahead convolution for last k-1 time steps. I think we can pad k-1 row zeros at the last time step for the input feature.

xinghai-sun · 2017-06-01T04:34:31Z

Great doc! @qingqing01 Could you please add this part to the DS2 design doc ? Thanks!

qingqing01 · 2017-06-01T05:21:59Z

@xinghai-sun ok. I'll do it.

xinghai-sun mentioned this issue May 22, 2017

Add simplified model configuration for DeepSpeech2. #2231

Closed

xinghai-sun mentioned this issue May 22, 2017

Deep Speech 2 on PaddlePaddle: Plan & Task Breakdown PaddlePaddle/models#44

Closed

pkuyym assigned pkuyym and unassigned pkuyym May 22, 2017

qingqing01 self-assigned this May 22, 2017

xinghai-sun mentioned this issue May 25, 2017

Add design doc for DeepSpeech2 on PaddlePaddle. #2255

Merged

qingqing01 mentioned this issue Jun 4, 2017

Row convolution operation. #2373

Merged

6 tasks

qingqing01 mentioned this issue Jun 8, 2017

Update row convolution in DS2 doc. #2427

Closed

qingqing01 closed this as completed in #2373 Jun 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lookahead row convolution layer. #2228

Add lookahead row convolution layer. #2228

xinghai-sun commented May 22, 2017

qingqing01 commented May 25, 2017 •

edited

Loading

xinghai-sun commented Jun 1, 2017

qingqing01 commented Jun 1, 2017

Add lookahead row convolution layer. #2228

Add lookahead row convolution layer. #2228

Comments

xinghai-sun commented May 22, 2017

qingqing01 commented May 25, 2017 • edited Loading

Difference with sequence_conv in PaddlePaddle.

How to deal with boundary.

xinghai-sun commented Jun 1, 2017

qingqing01 commented Jun 1, 2017

qingqing01 commented May 25, 2017 •

edited

Loading

Difference with `sequence_conv` in PaddlePaddle.