Dataport Pecan Street dataset

The provided scripts can be used to download and exploratively analyze the Dataport Pecan Street dataset. The data is provided by Dataport and Pecan Street Inc., and a key to access the data must be obtained from https://dataport.cloud/.

Python setup

To run the scripts successfully it is remommented to use anaconda to recreate the environment, as there have been issues with different version of the DB drivers.

See here, for more information about anaconda environments.

Run the following command after installing anaconda:

conda env create -f environment.yml

Then use either of the following commands to activate the environment:

On Windows:

activate pecanstreet

On Linux:

source activate pecanstreet

Then, run the following scripts within the environment.

download_pecanstreet.py

The script is currently setup to download the data for 2 years (2015-2016) for Boulder and Austin (the largest cities with available data) and also downloads the associated weather data. It fills holes of 3 hours by linear interpolation and holes larger than 3 hours to up a day by using the values of 24 hours before. If still N/A values are existing, the ID is dropped from the final results (resulting in 200 households for Austin and 20 households for Boulder).

Resulting files output in *.csv format.

Datetime format is YYYY-MM-DD hh:mm:ss±hh:mm, where the part "±hh:mm" is the offset of UTC, hence the time is local time.

Weather

Weather data is ouput per city and is of format:

temperature (in °C)
apparent_temperature (in °C)
dew_point
humidity
visibility
pressure
wind_speed
cloud_cover
wind_bearing
precip_intensity
precip_probability

Load

The output load files are one for each configured aggregation level. For single households one column per household is created and the id is the id used by the Pecanstreed database (see table university.metadata for properties of the households), for aggregations, which are created pseudo-randomly (with a seed), a increasing id is given. Load is in average kW per interval. The files are created for 15-min and hourly intervals.

analyze_pecan_data.py

For each created load file, the script creates the following plots:

Autocorrelation Plot (ACF)
Scatter plot showing the correlation of temperature and load
the trace of a week

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
analyze_pecan_data.py		analyze_pecan_data.py
download_pecanstreet.py		download_pecanstreet.py
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

analyze_pecan_data.py

analyze_pecan_data.py

download_pecanstreet.py

download_pecanstreet.py

environment.yml

environment.yml

Repository files navigation

Dataport Pecan Street dataset

Python setup

download_pecanstreet.py

Weather

Load

analyze_pecan_data.py

About

Releases

Packages

Languages

Fahad021/pecanstreet-load

Folders and files

Latest commit

History

Repository files navigation

Dataport Pecan Street dataset

Python setup

download_pecanstreet.py

Weather

Load

analyze_pecan_data.py

About

Resources

Stars

Watchers

Forks

Languages