PHP Scripts to analyse the results of an Amazon.de "Data Subject Access Request" / "Datenauskunft".
I tweeted the resulsts of these scripts while developing them, so have a look at this Twitter thread to get an impression of what the analysis looks like (and what it contained for me): https://threadreaderapp.com/thread/1042392922719748097.html
- Download or clone this code
- Copy over your DSAR result to
Amazon Datenauskunft/Jan Piotrowski
or better adapt the path inincludes/path.php
- Open
index.htm
on a local webserver or upload to any host with PHP
There is a mechanism to scrape titel and author information to ASINs that appear in the data:
content.php
writes all discovered ASINs intocache/asins.json
- Run
scrape_asin_from_amazon.php
to fill that array with scraped title + author information- 30 ASINs at a time
- be careful/slow or Amazon will ban/captcha you (I could scrape ~500 ASINs with no problem)
- ASINs where it can't find any data or error out are written to
cache/asinErrors.json
- Reloading the script will scrape the next set of ASINs, until errors and ASINs unfilled are identical
- Reload
content.php
to see the scraped data appear
I wrote this for myself, it works with my data.
But of course feel free to open an issue if it doesn't work for you or your find any bugs, and I will try to help you.