OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
-
Updated
Jul 15, 2024 - C++
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators that can be used in both C++ and Java runtime.
C++ implementation of oral cancer detection on CT images
EBIC - AI-based parallel biclustering algorithm
Generate balanced uint64 hash for string. Widely used in the generation of feature id in machine learning.
FEATure HashER
fast and comprehensive k-mer counting package
An App To Design Your Next State Of The Art Machine Learning Pipeline In A Single Place.
minimal workflow engine for data processing (POC)
Contains the codes for Extended Histogram of Gradients for object recognition developed by me during my PhD studies.
Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.
To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."