Skip to content

This project is a slightly modified version of Wikipedia-Text-Extractor. It extracts plan text from Wikipedia of about 110 game titles given in the pageTitleList.txt file.

License

Notifications You must be signed in to change notification settings

anyaosborne/Wikipedia-Text-Extractor-Games

Repository files navigation

Wikipedia Text Extractor for Game Titles

This project is a slightly modified version of AnKushRR/Wikipedia-Text-Extractor. The getWikiPages.py extracts plan text from Wikipedia of about 110 game titles given in the pageTitleList.txt file. The output contains extracted page content for each game title. The AllGameDescriptions.txt includes all extracted content as one plane text file, which is good for using it for machine training. The getWikiGamePlayDescription.py file extracts Wikipedia content about the gameplay section only. Here is an example of the output of GameplayPuzzles.txt.

Requirements

How to Execute The Code

Output

  • A new file will be created in the main directory called AllGameDescriptions
  • A new directory will be created named Output

About

This project is a slightly modified version of Wikipedia-Text-Extractor. It extracts plan text from Wikipedia of about 110 game titles given in the pageTitleList.txt file.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages