Skip to content

szepnapot/python-ir-system

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python IR (Information Retrieval) system

Code style: black Python 3.7

A basic command line application that fetches a json catalog of products from S3, ingests the product data, and creates an IR system. For the sake of keeping this test short it is fine to store all your data into memory, you do not need to use any kind of database. You can choose the indexing method and the way you prepare the data for it. The purpose of indexing is for the user to be able to search the dataset in real-time as on a search engine.

Prerequisites

Note:

On Windows to run the Makefile you need cygwin.

Usage

git clone https://github.com/szepnapot/python-ir-system.git
# install pipenv
# set up virtualenv
# install dependecies

make install

To start the CLI

make run

Info

  • settings can be found in config.ini
  • unbound cache for analyzers
  • results top to bottom decreasing, by default 3 displayed
  • basic queries eg.: .[field_name] [random search query], .help

Demo

CLI demo gif


forthebadge

Releases

No releases published

Packages

No packages published