The Czech National Library Extractor is a Python-based project tailored for extracting data from the Czech National Library's catalog. It enables users to generate a CSV file containing book information derived from a search query link provided by the National Library. This tool is particularly beneficial for researchers, librarians, and anyone interested in efficiently accessing and managing book data from the Czech National Library.
-
Clone the repository:
git clone https://github.com/Boubik/CNLE.git
-
Navigate to the project directory:
cd CNLEcd CNLE -
If you are using Python 3.5 or older, install virtualenv:
pip install virtualenv
-
Make virtual Python:
python -m venv venvpython -m venv venv # For Python 3.6 and newer # or virtualenv venv # For Python 3.5 and older
-
Use virtual Python:
source venv/bin/activatefor Unix like systems orvenv\Scripts\activate.batfor Windows-
For Unix-like systems:
source venv/bin/activate -
For Windows:
venv\Scripts\activate.bat
-
-
Install the required dependencies:
pip install -r requirements.txtpip install -r requirements.txt
-
Setup all needed information in the configuration file:
config.py -
Run the project:
python app.py
-
Connects to localhost on the port 5000: http://localhost:5000
- Extracts data from the Czech National Library.
- Saves data to a CSV file.
- Deduplication system (currently in beta).
- Hosted on cnle.boubik.cz, making it accessible as a web service.
- Sends extracted data directly to your email.
This project is licensed under the MIT License.