Skip to content

A python web scraper for fetching summary as well as raw information table details for any query from wikipedia.

License

Notifications You must be signed in to change notification settings

gauthamp10/wikiscrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia Information Scrape

This is a python tool which can be made effective in fetching raw table information for any specific query from any wikipedia pages.It can also be used to fetch summary of any wiki page.The final output of the script can either be saved as a csv or txt file according to the type of data that wiki page contain.

Prerequisites

python3

Dependencies

Install the following three python modules before executing wikipedia_info_scraper

  • colorama
  • pandas
  • wikipedia
pip install colorama

pip install pandas

pip install wikipedia

Usage

python main.py

📝 Please note pip installable python package will be avialable soon after further testing.

Screenshots

📌 Extracting table information

Screenshot

📌 Extracting summary of the search term

Screenshot

Output

Screenshot

Author

Gautham Prakash

My other projects:github.com/gauthamp10

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

A python web scraper for fetching summary as well as raw information table details for any query from wikipedia.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages