GitHub - stevekochscience/DataMungingExample: An example of short python script for real-purpose data munging

CircStatsYear.py is an example of a data munging script I've used recently at work. Previously, someone was processing exported text files from an old library circulation system (Millennium) and creating an Excel spreadsheet summary by manually searching the text files and then copying by hand. This was tedious and subject to error. I was unable to connect to the Millennium database programmatically. The end product also needed to be viewable in Excel. So, the script I created reads the exported text files and creates a new CSV file that will open in Excel and look the same as the previous Excel summaries.

The exported millennium files have been renamed to "2013_01.txt" etc. (01 = January)
These files should be in the directory named in fnambase
The year should be specified in the YEAR string
The output file will be based on year, e.g. "2013.csv"

I have included the input files for 12 months of 2013. I have also included what should be the output CSV file as 2013_ex.csv. If the code runs correctly, the output file, 2013.csv should be identical.

To run the code, type python CircStatsYear.py

I have included this code as an example for the 2014 April "data munging" session for the ABQ Python Meetup. I didn't add any extra commenting or try to make the code efficient or anything--so it should have lots of deficiencies and room for criticism! But it's a real example of something that works (for now!) and saves people time at the Library. :)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
2013.csv.w.windows		2013.csv.w.windows
2013.csv.wb.windows		2013.csv.wb.windows
2013.csv.wb.windows.nocrlf		2013.csv.wb.windows.nocrlf
2013.csv.wplus.windows		2013.csv.wplus.windows
2013_01.txt		2013_01.txt
2013_02.txt		2013_02.txt
2013_03.txt		2013_03.txt
2013_04.txt		2013_04.txt
2013_05.txt		2013_05.txt
2013_06.txt		2013_06.txt
2013_07.txt		2013_07.txt
2013_08.txt		2013_08.txt
2013_09.txt		2013_09.txt
2013_10.txt		2013_10.txt
2013_11.txt		2013_11.txt
2013_12.txt		2013_12.txt
2013_ex.csv		2013_ex.csv
CircStatsYear.py		CircStatsYear.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages