Skip to content

Latest commit

 

History

History
183 lines (175 loc) · 11.4 KB

how-to-questions-survey.md

File metadata and controls

183 lines (175 loc) · 11.4 KB

How-to questions survey

What are the most-asked about topics?

Frequency Rough categories
20 Pandas
10 persistence, including HDF5 (4) and Parquet (2)
8 Lorentz vectors and TVector3
8 reducing
7 jagged arrays
6 ak.fillna and ak.rpad (regularizing)
6 ak.concatenate and stack
5 strings
5 getitem
4 Numba
4 Histogramming (flattening before)
3 lazy arrays
3 ak.cross
2 Arrow
2 ak.with_field
2 ak.where and np.choose
2 ak.num (including axis=1)
1 SQL-like join
1 reducer-like
1 np.digitize
1 non-ufuncs
1 nested records
1 named axis
1 masking
1 Functional interface
1 Content vs flatten
1 Changing leaf (numeric) types
1 Caching
1 Broadcasting
1 ak.zip
1 ak.sort
1 ak.isnan

Categories of categories

  • Data into other systems: Pandas (20), NumPy/regularizing (6), Histogramming (4), Arrow (2)
  • Persistence (10), laziness (3)
  • Physics objects/vectors (8)
  • Restructuring: reducing (8), concatenate (6), getitem (5), cross (3), num (2), with_field (2), where (2), zip/sort (2), digitize (1)
  • Types: Jagged arrays (7), strings (5)
  • Fast code: Numba (4)
  • Lazy arrays (3)
  • SQL-like join (1)
  • All the rest

References

Bugs in Awkward structures