Skip to content

Extracting all the poems from Poetry Foundation, using Selenium, ,Beautiful Soup and Multiprocessing

License

Notifications You must be signed in to change notification settings

TGDivy/WebScrapping-PoetryFoundation

Repository files navigation

WebScrapping-PoetryFoundation

Extracting all the poems from Poetry Foundation, using Selenium, Beautiful Soup and Multiprocessing in Python.

The dataset extracted contains the:

  • Poem
  • Poem's Title
  • Poet
  • Tags


  • The dataset was created with intention for Artificial Poem Generation. It could be used for various other NLP tasks like classification, and semantic analysis. I hope, that dataset is helpful!

    The prominent tags featured in this dataset are highlighted by this word cloud:


    WordCloud Tags

    About

    Extracting all the poems from Poetry Foundation, using Selenium, ,Beautiful Soup and Multiprocessing

    Topics

    Resources

    License

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Languages