Python Script to Scrape Pastebin with Regex.
Switch branches/tags
Nothing to show
Clone or download
Tu5k4rr Regex update
Overall regex improvements.
Notable improvement was the IP-WEB to look for ip addresses with custom ports
Latest commit 7518f26 Oct 5, 2018
Permalink
Failed to load latest commit information.
PastaBean.py Regex update Oct 4, 2018
README.md Update README.md Sep 28, 2018

README.md

PastaBean

Python Script to Scrape Pastebin with Regex. This is by far NOT a 'finished project' and plan to improve this over time. My goal is to make PastaBean as flexible as I can and simple to run with minimal requirements to capture data.

Background

Created script to learn Python and capture data on the popular site https://Pastebin.com.

Features

  • Scrape Pastebin, 100 queries per 60 seconds.
  • Write matches to text file in same directory.
  • Temp removed:E-mail alert. Have to manually add credentials for sender and receiver into script(Gmail Only).
  • Logging - pasta.log

Requirements

Recommendations

  • Run on VPS
  • Run script as background process: python PastaBean.py &

Future Improvements

  • Improve current RegeX
  • Add more Regex Matches!!!
  • Enable to allow script to write to custom file path
  • Reduce duplication in e-mail alerts.
  • Decreased status output to one line.
  • Generate log file for each alarm to replace e-mail alerts
  • Expand to other similar sites like pastebin.

Contact

Feel free to contact me for any advice, ideas or queries.