Skip to content

Tsinghua project - Machine Learning course - Paper clustering

License

Notifications You must be signed in to change notification settings

AEnguerrand/thu-ml-hw3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

thu-ml-hw3

Usage

Run all jupyter notebook cell and put all dataset on a directory "data". Result file create is "result.json"

Usage (script version)

python3 hw3.py

Description model

For doing this competition, I have use a cluster method based on sklearn tools. Cluster is based on name of co author with a pre-processing for merge firstname and lastname.

And the cluster is build based on metrics "cosine" and others parameters for have a better score (0.594).

About

Tsinghua project - Machine Learning course - Paper clustering

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published