Extract documents list from Amazon Kindle webpage
Java
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
lib
src/kindle/jhartman/pl
.gitignore
LICENSE
README.md

README.md

Kindle-Library-List

Extract documents list from Amazon Kindle webpage and save into a txt, xml and html file.

How to use:

  1. Download (or build) KindleLibrary.jar
  2. Navigate to Manage your content and devices Amazon page
  3. Switch Show to Docs Image 1
  4. Scroll down to reach end of your list (or to see Show more button) Image 2
  5. Save the html (File -> Save Page As..., using Complete Webpage). Override the default filename with an easy name, e.g. 1.
  6. If more docs pending, press Show More button on the bottom of the page and iterate to Step 4
  7. When all pages iterated, open a command line and invoke the conversion:
Jareks-MBP:Downloads jhartman$ java -jar KindleLibrary.jar 1.htm
Amazon book list extractor
Elements found:400
Saving 1.html
Saving 1.txt
Saving 1.xml
Done!

8.Convert all html files saved earlier

Example of output html and xml looks as below

Image 3 Image 4

Libraries & References

jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.