Algorithmic Methods of Data Mining (Sc.M. in Data Science) Homework - 4 completed by group 1!

Names

Eleonora Barocco
Hafiz Muhammad Hassan
Daniele Figoli

Description

There were two mendatory tasks.

For first one, we implemented two clustering and compared the results. We created two datasets and for each we filled the data that we got. We used KMean++ with Elbow Method and later for used jacard similarity for getting top 3 couple of clustors. After that we have created wordcloud for top 3 couple of clustors.
Second task was related to finding the dupliactes in password2.txt file which was 2.2GB file. For the machine limitation we are not able to do that with whole data but we have completed it with sample of passwords.

Files

All related description is inside below file

Homework 4

Note

We do created some sample files for storing the data after scraping or doing each task but if someone is running the file. They should be able to do that using just the Homework_4.ipynb file.

Let us know what do we need to improve. Thanks.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Homework_4.ipynb		Homework_4.ipynb
README.md		README.md
house.png		house.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Algorithmic Methods of Data Mining (Sc.M. in Data Science) Homework - 4 completed by group 1!

Names

Description

Files

All related description is inside below file

Note

About

Releases

Packages

Contributors 3

Languages

ihassantariq/grp1-hw4

Folders and files

Latest commit

History

Repository files navigation

Algorithmic Methods of Data Mining (Sc.M. in Data Science) Homework - 4 completed by group 1!

Names

Description

Files

All related description is inside below file

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages