Skip to content

Find out the duplicate links in the column of of csv file.

Notifications You must be signed in to change notification settings

bikcrum/Duplicate-Link-Finder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Duplicate link finder in csv

Find out the duplicate links in the column of of csv file.

How to use it?

Prerequisites

  1. Must have python version 3
  2. Install these python package (if not installed).
    1. pandas
    2. tldextract

Steps:

  1. Clone and go to the project directory
  2. Use this command python script.py <source_file_name> <column_index> Note: Index starts from 0
  3. Example 1: python script.py src.csv 2 Note: This mean you have src.csv file which contains link in index 2
  4. Additionally create filter.txt file which contains list of links to compare only those links

About

Find out the duplicate links in the column of of csv file.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages