Skip to content
AMProduction edited this page Nov 24, 2021 · 13 revisions

Welcome to the SearchMyData App wiki!

1. Description

The app gives the possibility to perform a search into the Ukrainian government Open data portal datasets.
At this moment (November 2021) integrated Information on missing citizens, Information about people hiding from the authorities, Unified register of debtors, Unified State Register of Legal Entities, Individual Entrepreneurs and Public Associations and Integrated Unified State Register of Lustrated Persons.
To avoid overwhelming screen output the first 10 results are shown. Whole results datasets are saving into HTML files.


2. Requirements

Software requirements

For local run:

For Docker containers' run:

Hardware requirements

  • 16Gb+ RAM
  • SSD

3. Changelog

v1.9 Release. 24/11/2021

v1.8.5. Release. 20/11/2021

  • Refactoring to implement PEP 8

v1.8. Release. 04/11/2021

  • Performance optimization
  • Refactoring
  • Added unit tests
  • Added check if a dataset is outdated
  • Error handling added into Search functions
  • Updated dependencies

v1.7. Release. 28/10/2021

  • Dockerizing
  • Switched to Python 3.10.0
  • Added setup script for run in Docker
  • Added the run script
  • Added requirements.txt

v1.6. Release. 27/10/2021

  • Big refactoring and code optimization

v1.4. Release. 16/10/2021

v1.2. Release. 09/10/2021

  • Integrated Unified register of debtors
  • Get ZIP archive
  • Unzip resources
  • Read and parse huge CSV
  • Convert huge CSV to JSON
  • Purging temp files
  • Added performance metric into the log file: requests execution time

v1.0. First release. 26/09/2021

  • JSON downloading, parsing, and saving into MongoDB
  • Full-text search into datasets
  • Search results datasets are saving into HTML files
  • Integrated the missing person register and the wanted person register.
  • A full-text search could be performed in any of the three languages: Russian, Ukrainian, and English. Results are showing only in English.
  • Manual refreshing datasets
  • Errors handling
  • Full logging
  • Built-in managing database operations: creating/truncating collections, creating/deleting text indexes
  • Showing datasets the last update date

4. How to use

To be provided soon


5. Miscellaneous

Developed in PyCharm - The Python IDE for Professional Developers.
License kindly provided by JetBrains Community Support Team
JetBrains Logo (Main) logo PyCharm logo PyCharm logo

Clone this wiki locally