ikodaCsvLibsvmCreator collects data into a simple dataframe.
It saves the output to file in CSV or LIBSVM format. It can also divide the data into a CSV and a LIBSVM component.
In addition, ikodaCsvLibsvmCreator streams the data to a Spark cluster.
- Collects data dynamically. New columns create on the fly.
- Maintains a UID for each column.
- Maintains a category or label for each row.
- Maintains human readable text column and label names for sparse (LIBSVM) data
- Runs thread safe instances for the synchronous collection of distinct datasets.
- Saves data as a CSV or LIBSVM (either appending or overwriting).
- Divides the data for saving into LIBSVM and CSV components.
- Opens Data in CSV or LIBSVM format.
- Merges CSV files.
- Merges LIBSVM files.
- Streams data to a Spark cluster
- Emails the data in CSV format