Lovecraft-scrapper

[Test] The goal is simple: create a python program that retrieves data from a specific URL and print some related statistics.

PS: Only works for the webpages on this website: https://hplovecraft.com/writings/texts/*

Usage

usage: main.py [-h] [url]

Scrap a HPL URL and print some data.

positional arguments:
  url         the target URL to scrap

optional arguments:
  -h, --help  show this help message and exit

Requirements

>= Python 3.8 (not tested below versions)

beautifulsoup4==4.11.1

pandas==1.4.3

pre-commit==2.20.0

Install

pip install -r requirements.txt

Pre-commit install (optional)

Some hooks are executed to enforce code quality (PEP respect, valid requirements.txt, ...), to install them, execute the following command:

pre-commit install

Test

Some basics tests can be launched to test the project against various URL with the following command:

python3 -m unittest tests/test_main.py

Output example

---------PAGE---------

Page url: https://hplovecraft.com/writings/texts/poetry/p095.aspx
Title of the story: Pacifist War Song—1917
Average letter: 2091
Average word: 388
Average paragraph by chapter: 8
Letter count: 2091
Word count: 388

---------CHAPTERS---------

Chapter: I.
Letter count: 2091
Word count: 388
Number of paragraph: 8
Average letter: 108
Average word: 19

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
lovecraft_scrapper		lovecraft_scrapper
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lovecraft-scrapper

Usage

Requirements

Install

Pre-commit install (optional)

Test

Output example

About

Releases

Packages

Languages

Wasta-Geek/Lovecraft-scrapper

Folders and files

Latest commit

History

Repository files navigation

Lovecraft-scrapper

Usage

Requirements

Install

Pre-commit install (optional)

Test

Output example

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages