# Objective
The purpose of this script is to clean the output of the aswscrp tool to get specific item listing data. For example, we want to only get the comparable items to the FiiO music player from the product listing at: https://www.aliexpress.com/item/4000589596283.html

The sample taken from the URL would contain items such as cases/covers for the player, earphones and other brand of players that we not interested in. We are only interested in the comparable items from FiiO that are music players as well.

In [1]:
#Load Pandas and the RegEx libraries
import pandas as pd
import re

Load the `ouput.csv` that was generated using the aewscrp tool for the item at: https://www.aliexpress.com/item/4000589596283.html

In [4]:
# Read the output from aewscrp
item_list = pd.read_csv("../aewscrp/output.csv")
# Print a sample. It should contain items that are not relevant to the FiiO player
item_list.head()

Unnamed: 0,Description,URL,Price
0,FiiO M3 Pro Full Touchscreen Lossless DSD HiFi...,https://www.aliexpress.com/item/4000972188106....,79.99
1,Hidizs AP80 PRO dual ESS9218P Bluetooth Portab...,https://www.aliexpress.com/item/4000924308974....,169.0
2,FiiO M15 Flagship Android Dual AK4499 Hi-Res P...,https://www.aliexpress.com/item/4000995400006....,1349.99
3,FiiO M3Pro/M3 Pro Hi-Res MP3 Player ES9218 HiF...,https://www.aliexpress.com/item/4000960723826....,79.99
4,Hidizs AP80 PRO dual ESS921 MP3 Bluetooth Musi...,https://www.aliexpress.com/item/4000916726086....,169.0


We then filter the comparable item list for only music players, no cases/covers. Then filter players that are FiiO branded only.

In [12]:
# Filter out the player list for cases/covers
player_list = item_list[~item_list.Description.str.contains("cover|case", case=False) & item_list.Description.str.contains("Player", case=False)]
# Filter out players that are not from FiiO
fiio_items = player_list[player_list.Description.str.contains("FiiO", case=False)]
# View the output to confirm only FiiO music players are in the list
fiio_items

Unnamed: 0,Description,URL,Price
0,FiiO M3 Pro Full Touchscreen Lossless DSD HiFi...,https://www.aliexpress.com/item/4000972188106....,79.99
2,FiiO M15 Flagship Android Dual AK4499 Hi-Res P...,https://www.aliexpress.com/item/4000995400006....,1349.99
3,FiiO M3Pro/M3 Pro Hi-Res MP3 Player ES9218 HiF...,https://www.aliexpress.com/item/4000960723826....,79.99
8,FiiO M11 Pro Android Hi-Res Music Player Bluet...,https://www.aliexpress.com/item/4000465710581....,645.05
11,FiiO M11 Pro Stainless Steel limited edition H...,https://www.aliexpress.com/item/4000622421489....,783.99
14,FiiO M11 HIFI Audio Android-based Bluetooth Lo...,https://www.aliexpress.com/item/33001451741.html,427.49
16,FiiO M9 HIFI Balanced Output Portable High-Res...,https://www.aliexpress.com/item/32966863423.html,247.99
17,FiiO M6 Hi-Res Bluetooth HiFi Music Portable M...,https://www.aliexpress.com/item/32961737369.html,149.9
18,FIIO M11 PRO Samsung Exynos 7872 Android 7.0 B...,https://www.aliexpress.com/item/4000478804030....,658.63
23,FiiO M11 Pro Hi-Res Music Player AK4497EQ*2/TH...,https://www.aliexpress.com/item/4000521166245....,666.39


Save the cleaned items list into `fiio_listing.csv`.

In [15]:
fiio_items.to_csv("fiio_listing.csv", index=False)