Skip to content
FinBins project
Scala
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
project
src
target/scala-2.10
README.md
build.sbt

README.md

finbins- the FinBins project's repo.

This repo contains a number of files that read data from tree datasets provided in csv format, i.e. FSS, IDBR and FCA register. Then it links and cleans the data to do an exhaustive feature selection search by training/evaluating a Random Forest (RF) classifier on each feature combination. Results are saved to the log file. Also a confusion matrix for each is created in each combination.

Main files are:

  1. Ingestion - read the data from csv and enforces the schema
  2. LinkFCA_IDBR, LinkFSS_IDBR - links the FCA and FSS datasets to IDBR
  3. SIC_RF, SIC_RF_NORM, SIC_RF_RATIOS - trains and evaluates a RF classifier using a combination of features
You can’t perform that action at this time.