Internship

General info

This project basically cluseters data depending on the tokens it generates from preprocessing using nltk library. WordtoVec model is used for word embedding and then different clustering algo to find suitable cluster

Technologies

The whole project is done on transac-nar-new.ipynb

Requirements

The required technologies to run this project is included here at [requirements.txt] (https://github.com/Asif-droid/Internship/blob/main/requirements.txt)

Used models

Kmeans
Minibatch_Kmeans
Bisecting_Kmeans
Dbscan
Hierarchy clustering

How to run

Download or clone the repo
open in local machine
meet the requirements run- pip install -r requirements.txt
Open the test_script file.
Give locations of dataset and trained model
Can adjust the values for Hierarchy clustering and Dbscan (defult is mx_d=1.5 for Hierarchy and eps=.55, min_samples=1 for dbscan)
Dbscan and Hierarchy clustering doesnot need any pretrained model to cluster the data. It generates cluster depending on the given dataset
Run the file
For more clearificaion see transac-nar-new.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
bisect_model.pkl		bisect_model.pkl
elbow.png		elbow.png
kmeans_model.pkl		kmeans_model.pkl
minibatch_model.pkl		minibatch_model.pkl
requirements.txt		requirements.txt
test_result.csv		test_result.csv
test_script.py		test_script.py
transac-nar-new.ipynb		transac-nar-new.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

bisect_model.pkl

bisect_model.pkl

elbow.png

elbow.png

kmeans_model.pkl

kmeans_model.pkl

minibatch_model.pkl

minibatch_model.pkl

requirements.txt

requirements.txt

test_result.csv

test_result.csv

test_script.py

test_script.py

transac-nar-new.ipynb

transac-nar-new.ipynb

Repository files navigation

Internship

Table of contents

General info

Technologies

Requirements

Used models

How to run

About

Releases

Packages

Languages

Asif-droid/Internship

Folders and files

Latest commit

History

Repository files navigation

Internship

Table of contents

General info

Technologies

Requirements

Used models

How to run

About

Resources

Stars

Watchers

Forks

Languages