preprocessing

Implements the DMI imputation algorithm for imputing missing values in a dataset from Rahman, M. G., and Islam, M. Z. (2013): Missing Value Imputation Using Decision Trees and Decision Forests by Splitting and Merging Records: Two Novel Techniques

java data data-mining analysis mining weka imputation data-analysis preprocessing data-cleaning datamining data-cleansing missing-values missing-value-imputation

Updated Aug 22, 2020
Java

Krzmbrzl / ArmaFiles

Star

A collection of utlities for dealing with Arma files in Java

arma arma3 preprocessing pbo

Updated Jan 2, 2019
Java

grahman20 / LFD

Star

LFD is a data-driven discretization technique that does not require any user input. LFD uses low frequency values as cut points and thus reduces the information loss due to discretization. It uses all other categorical attributes and any numerical attribute that has already been categorized.

java data-science machine-learning data-mining analytics attributes classification data-analysis preprocessing features variables discretization data-cleaning numerical categorization discretization-algorithm

Updated Mar 25, 2023
Java

fracpete / dataset-weights-weka-package

Star

Weka package with filters that allow modifying attribute/instance weights.

plugin java machine-learning weka filters preprocessing

Updated Oct 12, 2020
Java

kariminf / langpi

Star

Language processing interface: some tools to process different natural languages

natural-language-processing wordnet segmentation preprocessing tokenization stemming text-segmentation word-tokenizing

Updated Jul 28, 2017
Java

fracpete / mxexpression-weka-package

Star

Weka package using the mXparser library.

java weka preprocessing

Updated Oct 12, 2020
Java

supertom01 / BabyANTLR

Star

A parser written for the BabyCobol language, using the ANTLR framework. This repository is part of my bachelor thesis.

parsing antlr preprocessing antlr4-grammar babycobol

Updated Feb 20, 2023
Java

fracpete / data-dumper-weka-package

Star

Weka package that allows listening in on data as it passes through filter pipelines.

java package weka preprocessing

Updated Oct 12, 2020
Java

grahman20 / SiMI

Star

SiMI imputes numerical and categorical missing values by making an educated guess based on records that are similar to the record having a missing value. Using the similarity and correlations, missing values are then imputed. To achieve a higher quality of imputation some segments are merged together using a novel approach.

data-science linear-regression dataset missing-data preprocessing data-cleaning decision-tree decision-tree-classifier missing-values decision-forest decision-forest-algorithm missing-value-handling missing-data-imputation missing-value-imputation numerical-missing-value categorical-missing-value

Updated Mar 24, 2023
Java

Improve this page

Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preprocessing

Here are 32 public repositories matching this topic...

igoortc / tweets-preprocessor

IrisShaders / glsl-preprocessor

ikegami-yukino / neologdn-java

fracpete / missing-values-imputation-weka-package

raydac / jcp-ai

zjxeditor / AndroidCamera

zenwor / table_editor

fracpete / snowball-stemmers-weka-package

grahman20 / CAIRAD

fracpete / ptstemmer-weka-package

fracpete / nlp-weka-package

zislam / DMI

Krzmbrzl / ArmaFiles

grahman20 / LFD

fracpete / dataset-weights-weka-package

kariminf / langpi

fracpete / mxexpression-weka-package

supertom01 / BabyANTLR

fracpete / data-dumper-weka-package

grahman20 / SiMI

Improve this page

Add this topic to your repo