Python wrapper for Wikipedia
-
Updated
Jul 1, 2024 - Python
Python wrapper for Wikipedia
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
A 🤖 which provides features from Wikipedia like summary, title searches, location API etc.
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
Collects a multimodal dataset of Wikipedia articles and their images
Music tagger with GUI that parses wikipedia for information. Can also download album art and lyrics.
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
Taxonomic trees (cladograms) from Wikipedia-scraped data.
Linked Data Knowledge Base Population (KBP) framework built on top of Snorkel. The default configuration uses Wikipedia as text corpus and DBpedia as target.
Wikipedia Entities Lexicon Extractor
Scraping logos of world football clubs from wikipedia
A Wikipedia Web Scraper used to download all the text information in a .txt file.
A minimally dependent Wikimedia CLI
python3 spiderbot which scraps a given Wikipedia URL and stores the info from the article in a text file.
A Python code for a Personal Assistant which performs various tasks. Search google,Wikipedia, and gives stock prices with Oil prices .
Scrapes data from Wikipedia and generates a Graph based on inputs
Scrape soccer data from Wikipedia across various European football leagues and perform interactive data visualizations on it.
Visualizing the phenomenon "Getting to Philosophy" that clicking the first link in the main text of a Wikipedia article, and then repeating the process, will lead to the 'Philosophy' article.
Add a description, image, and links to the wikipedia-scraper topic page so that developers can more easily learn about it.
To associate your repository with the wikipedia-scraper topic, visit your repo's landing page and select "manage topics."