Polars is an alternative to pandas that is designed to have better performance.
patient_data.ipynb
: Jupyter notebook that explores functional differences between pandas and polars. It replicates the notebook in thepandas
directory with the same name.polars_large_data_benchmark.ipynb
: Jupyter notebook that compares the performance of polars and pandas on large data sets.create_csv_data.py
: Python script to generate one or more large CSV files for benchmarking.create_csv_data.slurm
: Slurm script to runcreate_csv_data.py
on a cluster.polars_performance.ipynb
: Jupyter notebook that compares the performance of polars and pandas on a variety of operations.data
: Directory containing the data used in the notebook.