Skip to content

Boubik/CNLE

Repository files navigation

Czech National Library Extractor

Description

The Czech National Library Extractor is a Python-based project tailored for extracting data from the Czech National Library's catalog. It enables users to generate a CSV file containing book information derived from a search query link provided by the National Library. This tool is particularly beneficial for researchers, librarians, and anyone interested in efficiently accessing and managing book data from the Czech National Library.

Table of Contents

Installation

  1. Clone the repository:

    git clone https://github.com/Boubik/CNLE.git
  2. Navigate to the project directory: cd CNLE

    cd CNLE
  3. If you are using Python 3.5 or older, install virtualenv:

    pip install virtualenv
    
  4. Make virtual Python: python -m venv venv

     python -m venv venv  # For Python 3.6 and newer
     # or
     virtualenv venv      # For Python 3.5 and older
  5. Use virtual Python: source venv/bin/activate for Unix like systems or venv\Scripts\activate.bat for Windows

    • For Unix-like systems:

      source venv/bin/activate
    • For Windows:

      venv\Scripts\activate.bat
  6. Install the required dependencies: pip install -r requirements.txt

    pip install -r requirements.txt
  7. Setup all needed information in the configuration file: config.py

  8. Run the project:

    python app.py
  9. Connects to localhost on the port 5000: http://localhost:5000

Features

  • Extracts data from the Czech National Library.
  • Saves data to a CSV file.
  • Deduplication system (currently in beta).
  • Hosted on cnle.boubik.cz, making it accessible as a web service.
  • Sends extracted data directly to your email.

License

This project is licensed under the MIT License.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors