Skip to content

The Pendekar Laut Comic Scraper is a Python script designed to facilitate the extraction of images from web pages containing episodes of the Hong Kong comic book series "Pendekar Laut." The comic follows the adventures of the hero Pai Cheung Lang in a futuristic world beset by wars

License

Notifications You must be signed in to change notification settings

mrdodgerx/pendekar-laut

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pendekar Laut Comic Scraper

This Python script scrapes image URLs from a website hosting episodes of the Hong Kong comic book series "Pendekar Laut" and downloads the images into folders based on the episode URLs.

Installation

  1. Clone this repository:

    git clone https://github.com/mrdodgerx/pendekar-laut.git
    
  2. Install the required Python packages:

    pip install -r requirements.txt
    

Usage

  1. Modify the MAIN_URL variable in the script to the URL of the main page of the "Pendekar Laut" comic website.

  2. Run the script:

    python main.py
    

The script will fetch all episodes from the main page, find the "Read more" link for each episode, download the images from that link, and save them into folders named after each episode.

Dependencies

  • requests: For making HTTP requests to fetch webpage content.
  • Beautiful Soup: For parsing HTML content.
  • fake_headers: For generating fake user-agent headers to avoid bot detection.
  • urllib: For parsing URLs.

Synopsis

"Pendekar Laut" (Tiger Shark) is a popular Hong Kong comic book series written and illustrated by Wan Yat Leung. The story revolves around the adventures of Pai Cheung Lang, the titular character who is a skilled martial artist . Set in a fictional world inspired by Chinese martial arts and nautical themes, Pai Cheung Lang embarks on various quests and battles against formidable adversaries while seeking justice and protecting the innocent.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

The Pendekar Laut Comic Scraper is a Python script designed to facilitate the extraction of images from web pages containing episodes of the Hong Kong comic book series "Pendekar Laut." The comic follows the adventures of the hero Pai Cheung Lang in a futuristic world beset by wars

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages