Spoken Language Acquisition From Conversation Based On Reinforcement Learning

Overview

Master branch contains codes for the paper:

Continuous Action Space-based Spoken Language Acquisition Agent Using Residual Sentence Embedding and Transformer Decoder

spolacq2.1 branch contains codes for the papers:

Spoken Language Acquisition Based on Reinforcement Learning and Word Unit Segmentation
Sound-Image Grounding Based Focusing Mechanism for Efficient Automatic Spoken Language Acquisition
Pronunciation adaptive self speaking agent using WaveGrad
Self-Supervised Spoken Question Understanding and Speaking With Automatic Vocabulary Learning
Automatic spoken language acquisition based on observation and dialogue

About Author

Shinozaki Lab Tokyo Tech

Usage

# setup for python==3.7.12 setuptools==59.5.0 wheel==0.37.1 with CUDA 11.1
pip install -r requirements.txt -f https://download.pytorch.org/whl/torch_stable.html

# download pretrained HiFi-GAN from https://drive.google.com/drive/folders/1YuOoV3lO2-Hhn1F2HJ2aQ4S0LC1JdKLd
# and place them in the following paths
# - models
#   - hifi-gan
#     - config.json
#     - do_02500000
#     - g_02500000

sh egs/setup.sh
sh egs/run_spolacq3.sh

If you have a trouble with building box2d-py, please try the following command:

sudo apt install xvfb xorg-dev libsdl2-dev swig cmake

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
conf		conf
egs		egs
models		models
tools		tools
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt
spolacq-tutorial.pdf		spolacq-tutorial.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conf

conf

egs

egs

models

models

tools

tools

utils

utils

.gitignore

.gitignore

LICENSE

LICENSE

readme.md

readme.md

requirements.txt

requirements.txt

spolacq-tutorial.pdf

spolacq-tutorial.pdf

Repository files navigation

Spoken Language Acquisition From Conversation Based On Reinforcement Learning

Overview

About Author

Usage

License

About

Releases

Packages

Contributors 5

Languages

License

tttslab/spolacq

Folders and files

Latest commit

History

Repository files navigation

Spoken Language Acquisition From Conversation Based On Reinforcement Learning

Overview

About Author

Usage

License

About

Resources

License

Stars

Watchers

Forks

Languages