Is your feature request related to a problem? Please describe.
The frequencies function has its numerical fields as integers. This means we don't see accurate percentages.
ghci> df <- D.readCsv "./data/housing.csv"
ghci> D.frequencies "ocean_proximity" df
----------------------------------------------------------------------------
index | Statistic | <1H OCEAN | INLAND | ISLAND | NEAR BAY | NEAR OCEAN
------|----------------|-----------|--------|--------|----------|-----------
Int | Text | Int | Int | Int | Int | Int
------|----------------|-----------|--------|--------|----------|-----------
0 | Count | 9136 | 6551 | 5 | 2290 | 2658
1 | Percentage (%) | 44 | 31 | 0 | 11 | 12
In this example, island is clearly not 0%.
Describe the solution you'd like
We should change these to doubles.
Is your feature request related to a problem? Please describe.
The frequencies function has its numerical fields as integers. This means we don't see accurate percentages.
In this example, island is clearly not 0%.
Describe the solution you'd like
We should change these to doubles.