Skip to content

Commit

Permalink
Merge pull request #4852 from neillu23/joss_paper
Browse files Browse the repository at this point in the history
Update for the latest changes in the draft (minor changes)
  • Loading branch information
mergify[bot] committed Jan 6, 2023
2 parents 698997d + 6fd0e84 commit a55853b
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions doc/paper/espnet-se++/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,7 @@ The `inference` function in `enh_inference.py` creates a

class SeparateSpeech

object with the data-iterator for testing and validation. During its initialization, the class builds an SSE object `ESPnetEnhancementModel` based on a pair of configuration and a pre-trained SSE model.
object with the data-iterator for testing and validation. During its initialization, this class instantiate an SSE object `ESPnetEnhancementModel` based on a pair of configuration and a pre-trained SSE model.

#### bin/enh_scoring.py
def scoring(..., ref_scp, inf_scp, ...)
Expand All @@ -136,7 +136,7 @@ The SSE scoring functions calculates several popular objective scores such as SI
### SSE Control Class `tasks/enh.py`

class EnhancementTask(AbsTask)
`EnhancementTask` is a control class which is designed for SSE task, containing class methods for building and training an SSE model. Class method `build_model` creates and returns an SSE object `ESPnetEnhancementModel`.
`EnhancementTask` is a control class which is designed for SSE tasks. It contains class methods for building and training an SSE model. Class method `build_model` creates and returns an SSE object `ESPnetEnhancementModel`.

### SSE Modules `enh/espnet_model.py`

Expand Down Expand Up @@ -242,7 +242,7 @@ The demonstrations of ESPnet-SE can be found in the following google colab links


# Development plan
The development plan of the ESPnet-SE++ can be found in https://github.com/espnet/espnet/issues/2200. In addition, we would explore the combinations with other front-end tasks, such as using ASR as a front-end model and TTS as a back-end model for speech-to-speech conversion, making the combination more flexible.
The development plan of the ESPnet-SE++ can be found in https://github.com/espnet/espnet/issues/2200. In addition, we will explore the combinations with other front-end tasks, such as using ASR as a front-end model and TTS as a back-end model for speech-to-speech conversion.

# Conclusions
In this paper, we introduce the software structure and the user interface of ESPnet-SE++, including the SSE task and joint-task models. ESPnet-SE++ provides general recipes for training models on different corpus and a simple way for adding new recipes. The joint-task implementation further shows that the modularized design improves the flexibility of ESPnet.
Expand Down

0 comments on commit a55853b

Please sign in to comment.