Skip to content

Nebula Expired Article Hunter is a marketing tool you can use to get expired content from www.archive.org A.K.A. wayback machine, you could use this kind of content to grow up your blog with evergreen information, improve your marketing campaigns without investing in writing services, or whatever you imagine is useful for.

License

Notifications You must be signed in to change notification settings

eneiromatos/NebulaExpiredArticleHunter

Repository files navigation

Nebula Expired Article Hunter

With Nebula Expired Article Hunter you can get tons of expired content that is usually no longer indexed in search engines like Google or Bing, so you can use it for your website or marketing campaigns, all by scraping expired websites for their forgotten articles.

Features

  • Low memory consumption.
  • Configurable using config.ini file:
    • Connection parameters.
    • Files and folder names.
    • Min and max words per article.
  • Verbose interface.
  • Organize the discovered articles in a friendly way.
  • You can scrape as many expired domains as you wish.

Installation and running

To install and run this project copy or clone all the files to your preferred folder and type and execute:

pip install -r requirements.txt
python main.py

It's recommended to run in a virtual environment.

Nebula Expired Article Hunter was developed under Python 3.9.0 it should be fine in any Python 3 environment.

TO DO

  • Speed up the scraping process.
  • Add proxy rotation.
  • GUI.
  • Check the articles for plagiarism.
  • Add a expired domain scraper.

About

Nebula Expired Article Hunter is a marketing tool you can use to get expired content from www.archive.org A.K.A. wayback machine, you could use this kind of content to grow up your blog with evergreen information, improve your marketing campaigns without investing in writing services, or whatever you imagine is useful for.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages