Skip to content

manoharpavuluri/mercari-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About the Dataset

The dataset is part of Mercari Price Suggestion Challenge in Kaggle.

There are ~1.5Mil observations in the data set with multiple features.

About the challenge

Mercari is like ebay for Japan. Mercari wants a process to predict a price for someone who wants to sell the items on the Mercari ecommerce platform.

Main features

  • Observation : free text, describing the item sold.
  • brand_name : brand of the item sold
  • category_name : Category of the item sold. these have muliple sub categories under the same feature name.
  • shipping : Whats the shipping cost associated with the item sold.
  • price : label feature. Price of the item sold.

Approach taken for analysis

Categories were split into sub categories.
Used NLP on the Description to find the correlation to the price.

Model used

NLP based

Visualizations

Releases

No releases published

Packages

No packages published