Voice Enabled Natural Language Search Engine for Yelp

Dataset

This system runs on Yelp Dataset Challenge dataset (https://www.yelp.com/dataset_challenge). Download the dataset and extract the files to a folder.

Packages/tools needed to run the system:

The following packages needs to be installed for the system to work:

Speech Recognition 3.3.3 (https://pypi.python.org/pypi/SpeechRecognition/)
NLTK (http://www.nltk.org)
Geonamescache (https://pypi.python.org/pypi/geonamescache)
PyLucene (http://lucene.apache.org/pylucene/index.html)
Flask (http://flask.pocoo.org)
Bootstrap (http://getbootstrap.com)

Steps to run the system

Index the Yelp dataset:

The preprocessed data will be saved to data.json. The inverted index will be saved to ./index/ folder. This step has been done and all the output files has been uploaded to rod. You can skip to step 2.

python test_indexer.py <source to yelp dataset folder> data.json

Start the system:

This will load necessary data into memory and initialize all necessary objects and start a web server. The address to access the web server will be displayed. Enter that address in a browser to access the page.

python flask_jqry.py

Steps to test the voice search system

Index the Yelp dataset (if this has been done before, you can skip this step):

The preprocessed data will be saved to data.json. The inverted index will be saved to ./index/ folder. This step has been done and all the output files has been uploaded to rod. You can skip to step 2.

python test_indexer.py <source to yelp dataset folder> data.json

Start the voice search module using the following command and then follow on screen prompt:

python test_speech.py

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
documentation		documentation
static		static
README.md		README.md
distance.py		distance.py
documentation.pdf		documentation.pdf
flask_jqry.py		flask_jqry.py
index.html		index.html
indexer.py		indexer.py
query.py		query.py
query_speechrecognition.py		query_speechrecognition.py
searcher.py		searcher.py
test_indexer.py		test_indexer.py
test_speech.py		test_speech.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Enabled Natural Language Search Engine for Yelp

Dataset

Packages/tools needed to run the system:

Steps to run the system

Steps to test the voice search system

About

Releases

Packages

Contributors 2

Languages

UMGQ/YelpNLSearch

Folders and files

Latest commit

History

Repository files navigation

Voice Enabled Natural Language Search Engine for Yelp

Dataset

Packages/tools needed to run the system:

Steps to run the system

Steps to test the voice search system

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages