OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Jun 18, 2024 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
Given transaction data, interesting buying patterns are found out using Apriori and FP-Growth algorithm.
Clustering is a data analysis technique that groups similar objects together. It identifies patterns and hidden structures in the data, enabling the discovery of relationships and segmentation of data into homogeneous clusters.
This section covers (some of the important) lessons and grades I have taken at the university and educational programs.
Code for the paper "SPEck: Mining Statistically-significant Sequential Patterns Efficiently with Exact Sampling", by Steedman Jenkins, Stefan Walzer-Goldfeld, and Matteo Riondato, appearing in the Data Mining and Knowledge Discovery Special Issue for ECML PKDD'22.
An android application to clusters employees according to their level of corruption.
Linearly estimates the time left when locally queuing to join a New World (Game) server.
k-means clustering algorithm
A Java Implementation of Latent Dirichlet Allocation (LDA) using Gibbs Sampling for Parameter Estimation and Inference
Demo of how to use data mining (in this case, the algorithm) knn to predict and classify situations
This was the DataMining course project which was held at Shahid Beheshti University. Special thanks to Mr. Ashkan Zare for his contribution to this project.
comparative study of data mining techniques in health care for heart disease
Implements the DMI imputation algorithm for imputing missing values in a dataset from Rahman, M. G., and Islam, M. Z. (2013): Missing Value Imputation Using Decision Trees and Decision Forests by Splitting and Merging Records: Two Novel Techniques
Class implementing GenClust++ clustering algorithm.
A Java program to check Plagiarisms between multiple documents using the method of Shingling, MinHashing and Locality Sensitive Hashing.
Event-Radar: Real-time Local Event Detection System for Geo-Tagged Tweet Streams
Source code for BrandFeeling back end project, using parallel programming, text mining and sentimental analsys of Social Network Data.
Add a description, image, and links to the datamining topic page so that developers can more easily learn about it.
To associate your repository with the datamining topic, visit your repo's landing page and select "manage topics."