Skip to content

A data scraber using Python3, scraping phone numbers from online selling websites

Notifications You must be signed in to change notification settings

agcashdaum/numCollect

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Phone Number Scraper

DISCRIPTION

I had the idea to use phone numbers, and may be sell them online, but hey no use.
Any way, I scaped over 180K Algerien phone number, and over 16k French phone numbers.
The numbers were scraped from online selling websites(like ebay), https://ouedkniss.com for Algeria, and https://vivastreet.com for France. In the algerian website, the phone numbers are stored as images, therefore i had to download the images,so i can extract them later using image to text. The French website was very simple to deal with.
The phone numbers are stored into CSV, DB end TXT files, you'll find them in the data folder, dataDz for Algeria, dataFr for France, boutiqueDZ.json only a later on idea that contains a list extracted from https://ouedkniss.com.
In the WebSite directory you'll find the scraping code. And some notes and extra code in the use directory.

TOOLS

  • Language
    • Python 3.6
  • Libraries
    • BeautifulSoup (Scraping the websites)
    • Json (For storing the data)
    • Sqlite3
    • Pytesseract (For text recognetion, image to text)
    • Pillow (PIL)

PS

The generated data, was deleted for reasons(I uploaded it at first, but had to delete it after.....sorry:unamused: :unamused: :unamused:) This is for educational purposes only And not meant for harm, or any kind of trouble.

The data is public, just exploited what is available only.

About

A data scraber using Python3, scraping phone numbers from online selling websites

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages