Skip to content

ala1200/WikiScraper

Repository files navigation

WikiScraper

final projet python course 2025Z

wiki_scraper.py script automatically fetches raw text from Bulbapedia articles and cleans it by removing punctuation and formatting. It then counts word frequencies and saves the results into JSON files, which serve as the data source for the language analysis.

run wiki_scraper.py without arguments for help run pytest wiki_scraper_integration_test.py or pytest wiki_scraper_unit_tests to test script run all cells in language_analysis.ipynb to see analysys of languages

About

final projet python course 2025Z

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors