Skip to content

Latest commit

 

History

History
66 lines (49 loc) · 1.6 KB

describe.md

File metadata and controls

66 lines (49 loc) · 1.6 KB

Returns DataFrame with general statistics for all ValueColumns.

describe [ columns ]

ColumnGroup and FrameColumns are traversed recursively down to ValueColumns.

Collected statistics:

  • name — column name
  • path — path to the column (for hierarchical DataFrame)
  • type — type of values
  • count — number of rows
  • unique — number of unique values
  • nulls — number of null values
  • top — the most common not null value
  • freqtop value frequency
  • mean — mean value (for numeric columns)
  • std — standard deviation (for numeric columns)
  • min — minimal value (for comparable columns)
  • median — median value (for comparable columns)
  • max — maximum value (for comparable columns)
df.describe()

To describe only specific columns, pass them as an argument:

df.describe { age and name.allCols() }
val age by column<Int>()
val name by columnGroup()

df.describe { age and name.allCols() }
df.describe { "age" and "name".allCols() }