The Amazon-Fashion dataset for clustering purposes can be generated by the following script load_dataset.py
=>amazon-fashion-ids.json contain the product ID and category class label (int).
=>Image features are stored in a binary format, which consists of 10 characters (the product ID), followed by 4096 floats (repeated for every product). available at https://jmcauley.ucsd.edu/data/amazon/