Skip to content

Nikhil1912/CSC510-HW_37

Repository files navigation

CSC510 Software Engineering Group 37 Presents: Column Summaries

DOI GitHub issues Repo Size Unit tests codecov

Objective

The objective of this repository is to replicate, in python, this code, which was originally written in Lua.

This code reads in a .csv file and generates summaries of the columns. For numeric (Num) columns, this is median (the middle value of a sorted list of numbers thus far seen) and standard deviation (a measure of the spread of numbers); note that Num is a reservoir sampler which keeps only a finite quantity of numbers. For symbolic (Sym) columns, this is mode (the most common symbol) and entropy (the effort required to recreate a signal).

For an example .csv in the required format, check out \data\auto93.csv

Running

  • To install necessary packages, run pip install -r requirements.txt
  • To run the program, navigate to \CSC510-HW_37 and run python code\Csv.py
  • Tests are contained in \CSC510-HW_37\code\Tests.py

Example output is shown below:

hw_output_csv

hw_output_csv_2

Testing

The output of running \CSC510-HW_37\code\Tests.py is shown below:

hw_test

Contributors

  • M M Abid Naziri
  • Nikhil Mehra
  • Bella Samuelsson
  • Parth Katlana
  • Heidi Reichert

About

No description, website, or topics provided.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published