Skip to content

A lot of new products arrive in marketplace every day and each of them must be assigned to a specific category in category tree. It takes a lot of effort and time, so we want to learn how to predict a category based on the names and parameters of products.

Notifications You must be signed in to change notification settings

KKroliKK/test-task-Kazan-Express

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Test-task KazanExpress

The specification of test task from KazanExpress is available online

My classificator is implemented as a tree of 245 small FCNNs.
Dataset is represented as a tree structure which keeps products in leaf nodes and is able to submit correctly labeld data for each of 245 branch nodes categories FCNNs.
I used two big dictionaries with word embeddings for Russian language: fasttext and wiki2vec. Each token is converted to concatenation of two embedding vectors (overall size 600).
Solution with representation of product title as averege embeddings gave hierarchical f1-score equal to 0.933.

About

A lot of new products arrive in marketplace every day and each of them must be assigned to a specific category in category tree. It takes a lot of effort and time, so we want to learn how to predict a category based on the names and parameters of products.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages