A media scrapper that values simplicity and performance. Download the best story under a certain tag automatically. Builded on top of BeautifulSoup4 and Requests.
- Download all media of a story (url) in the folder under the story name.
- Create a list of stories by tag (topic), download all media of stories (urls)
- Download the repository to the local path, where the media will be saved.
- Install Packages Dependencies:
- tqdm:
pip install tqdm
- fake-useragent:
pip install fake-useragent
- BeautifulSoup4:
pip install beautifulsoup4
- Requests:
pip install requests
- Open the
MediaScrapper.ipynb
file with Jupyter Notebook. - Select the media source (code block).
- Alter the url or tag information for your need.
- Run to start scrapping media.
- Due to the high volume of traffic at night for the media sources, we suggest you to run the MediaScrapper other time.
Sharing allergic (adult) contents might be against the law. This media scrapper is purely for personal academic purpose and therefore not obliged to any legal issues related with non-personal, non-academic purposes.