Skip to content

Scraped housing rental ads on Craigslist and developed a classification model in Python to identify fake ads on the website. Built a notification system to send spam free ads to the users based on their search results

Notifications You must be signed in to change notification settings

vijayaiitkanpur/Notification-system-to-show-spam-free-ads-for-housing-rentals-on-Craigslist-

Repository files navigation

Notification-system-to-show-spam-free-ads-for-housing-rentals-on-Craigslist-

Notification system to show spam free ads for housing rentals on Craigslist

In our modelling solution, we have 3 steps:

  1. Identification of Spam Ads

a. In this first we scrape the data from craigslist. We create 2 files i. urls_city.csv - This file contain the url of the listings captured from the craigslist ii. city_data_complete.csv - This file contains the features extracted from all the listings as captured in the urls file.

b. Then, we needed to create a training data-set containing the labels of whether the ad was SPAM or not. For that, we re-checked the captured urls (after 7 days, 14 & 30 days) to validate whether the listing has been marked as spam or not. Based on that we created a final file which contained the features of the listing and added a label of SPAM with values as True or False.

c. This data was used as a training dataset for training the various machine learning models.

  1. Getting Search Criteria Data

a. In this we scraped the data from craigslist based on the keyword specified by the user.

b. Then we passed the listings to our trained model which detects the spam-ads.

c. Based on detection, it removes all the spam-ads and creates a list of spam-free ads.

  1. Notification System a. After identification of spam-free ads, we identify the most relevant ads from the legitimate listings data which has been curated.

b. We send a notification email to the user which contains the relevant spam-free ads.

c. This process is automated and the user receives an e-mail on daily basis containing the new listings of relevant spam-free ads.

About

Scraped housing rental ads on Craigslist and developed a classification model in Python to identify fake ads on the website. Built a notification system to send spam free ads to the users based on their search results

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages