🚀 Google Summer Of Code 2019 Project - Creation of an online Greek mail dictation system

Welcome to the home repository of "Creation of an online Greek mail dictation system using Sphinx and personalized acoustic/language model training".

This project is implemented as a Google Summer of Code 2019 Project, under the auspices of Open Technologies Aliance - GFOSS.

About the project

With over 2.6 billion active users and over 4.6 billion email accounts in operation, email is the most important and widely used communication medium on the Internet. In the last fifteen years, the huge rise of social networks and chat applications changed the role of emails, that nowadays are mainly used for business purposes rather than chatting. The email inbox is the first thing that anyone checks after entering their respective workplace. Seeing that this kind of communication has become an important part for businesses, convenience, speed and accuracy are necessary. All these benefits can be provided by enabling people to dictate their emails rather than writing them. More specifically, through email dictation users can move from one place to another while getting their work done faster than writing them and more accurately, since the computer is responsible for the correct spelling.

In the modern era of Big Data, many dictation systems have already been implemented reaching high accuracy in the proposed metrics. However, each system concerns a certain language because of the huge diversity of spoken languages. As a result, the implementation of a Greek mail dictation system should be done from scratch based on the Greek language and its unique characteristics. A basic problem is the fact that the training part requires a large set of human transcribed recordings, while very small Greek speech datasets are available. So, the project’s purpose is the implementation of a personalized Greek mail dictation system, that will be trained in the speech of each user (speaker dependent). By this way, we solve the above problem by asking the user for some dictations at the start and train the system using these recordings. Ιt is worth noting, that this restriction of the system doesn’t pose a problem, since each email address corresponds to a single user. In addition, the system’s performance will be enhanced by adapting the language model to the user's existing emails. Extra utilities, such as special dictation commands and email replay, will facilitate the user interaction and make the whole procedure faster and more practical.

Demo

The project is hosted at https://snf-870149.vm.okeanos.grnet.gr.

Note: Till now, we use self signed ssl certificates for both the webpage and the api. As a result, before using the webpage, the user should give permission in both of them by entering https://snf-870149.vm.okeanos.grnet.gr and https://snf-870149.vm.okeanos.grnet.gr:5000 and clicking Advanced and Proceed to url.

Timeline and Documentation

A detailed timeline can be found here, organized by GSoC timeline.
The whole progress of the project was tracked on a daily basis in Project.
More details can be found in Wiki and in the Final Report.

The whole model as a block diagram follows:

Overview

Getting started
Tools
API and UI
- API Documentation
- Angular UI
Other
- Licensing
- Future Work

Technologies used

The project is written in Python 3.x, using all the python packages in the requirements file.
The speech recognition part is done using the pocketsphinx library from CMUSphinx.
All language models are created using SRILM.
All the required user data is stored in a MongoDB.
The UI is based on angular 8.

Project Deliverables

Tool for extracting and cleaning sent emails of a Gmail user. Code Wiki
Tool for creating adapted language models through email clustering. Code Wiki
Tool for correcting ASR output. Code Wiki
Various tools for preparing and evaluating a speech dataset. Code Wiki
Simple tool for creating a speech dataset. Code
API written in Flask. Code Wiki
Online webpage using Angular 8. Code Wiki

People

Google Summer of Code 2019 Student: Panagiotis Antoniadis (PanosAntoniadis)
Mentor: Andreas Symeonidis (asymeon)
Mentor: Manos Tsardoulias (etsardou)

Name		Name	Last commit message	Last commit date
Latest commit History 280 Commits
angular-ui		angular-ui
api		api
data		data
docs/pics		docs/pics
email_clustering		email_clustering
email_processing		email_processing
post_processing		post_processing
speech2text		speech2text
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Google Summer Of Code 2019 Project - Creation of an online Greek mail dictation system

About the project

Demo

Timeline and Documentation

Overview

Technologies used

Project Deliverables

People

About

Releases

Packages

Contributors 3

Languages

License

eellak/gsoc2019-sphinx

Folders and files

Latest commit

History

Repository files navigation

🚀 Google Summer Of Code 2019 Project - Creation of an online Greek mail dictation system

About the project

Demo

Timeline and Documentation

Overview

Technologies used

Project Deliverables

People

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages