A basic command line application that fetches a json catalog of products from S3, ingests the product data, and creates an IR system. For the sake of keeping this test short it is fine to store all your data into memory, you do not need to use any kind of database. You can choose the indexing method and the way you prepare the data for it. The purpose of indexing is for the user to be able to search the dataset in real-time as on a search engine.
Note:
On Windows to run the Makefile you need cygwin.
git clone https://github.com/szepnapot/python-ir-system.git
# install pipenv
# set up virtualenv
# install dependecies
make install
To start the CLI
make run
- settings can be found in config.ini
- unbound cache for analyzers
- results top to bottom decreasing, by default 3 displayed
- basic queries eg.:
.[field_name] [random search query]
,.help