How can we identify malicious hackers participating in different online platforms using their usernames only?
Hackers create brand around their online names. GeekMAN, a systematic human-inspired approach to
identify similar usernames across online platforms focusing on
technogeek platforms.
Two technogeek usernames
Similarity score
The list of username pairs to be comapared is placed in a csv file located at data/test.csv. Username pairs are assumed to be comma-separated (username_1, username_2).
Run the jupyter notebook geekman_matching.ipynb
A csv file is generated in output/sim_score.csv (username_1, username_2, sim_score)