Skip to content
This repository has been archived by the owner on Aug 21, 2020. It is now read-only.

jakeYeager/public-quakeworks-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo

The Flux Data

Earthquake event data & random number data

A verification resource created for research described in Quake Works The Flux. If you're looking for the Github data repo for this project, welcome! If you're not sure what this is, go here first.

Repository Layout

It is pretty intuitive but here is a little map so you don't have to click around too much 👍

\
---- \ quakes
|    |- quakes_1940-2018.csv
|    \- readme.md
|
---- \randoms
|    |----\ excel
|    |    |- erb-set1.csv
|    |    |- erb-set2.csv
|    |    |- erb-set3.csv
|    |    |- erb-set4.csv
|    |    |- erb-set5.csv
|    |    \- readme.md
|    |  
|    |----\ php_mtrand
|    |    |- mtr-set1.csv
|    |    |- mtr-set2.csv
|    |    |- mtr-set3.csv
|    |    |- mtr-set4.csv
|    |    |- mtr-set5.csv
|    |    \- readme.md
|    |  
|    |----\ php_rand
|    |    |- r-set1.csv
|    |    |- r-set2.csv
|    |    |- r-set3.csv
|    |    |- r-set4.csv
|    |    |- r-set5.csv
|    |    \- readme.md
|    |
|    |----\ random_org
|    |    |---\txt
|    |    |    |- randomOrg-10k-mrk-20190219_01.txt
|    |    |    |- randomOrg-10k-mrk-20190219_02.txt
...  ...  ...  \- LOTS OF TXT FILES...
|    |    |- randOrg-mrk-check.csv
|    |    \- readme.md
|- .gitignore
|- LICENCE
\- README.md

Generation Schema

Characteristics:

  • Three random number generators were used to provide five "sets" of 416,102 entities to match a corresponding seismic event record.
  • Each set contains six sample population "batches" which match the respective quantity of seismic event sample populations designated by magnitude ranges.
  • Batches were generated independently, and are not subdivisions of a mass population.
  • Each entity has three associated randomly generated "time value" number choice:
    • a "month" value (number between 1 - 12)
    • a "marker" value (number between 1 - 16)
    • and a "hour" value (number between 0 - 23)

A tree-view structure visualization would be as follows:

Generator "X"
 \
 - - - Set #1 (416,102 total entities)
 |      \
 |      | - - Batch 1 (333,857 entities): [month val] [marker val] [hour val]
 |      | - - Batch 2 (72,257 entities): [month val] [marker val] [hour val]
 |      | - - Batch 3 (8,950 entities): [month val] [marker val] [hour val]
 |      | - - Batch 4 (970 entities): [month val] [marker val] [hour val]
 |      | - - Batch 5 (63 entities): [month val] [marker val] [hour val]
 |      \ - - Batch 6 (5 entities): [month val] [marker val] [hour val]
 |      
 - - - Set #2 (416,102 total entities)
 |      \
 |      | - - Batch 1 (333,857 entities): [month val] [marker val] [hour val]
 |      | - - Batch 2 (72,257 entities): [month val] [marker val] [hour val]
 |      | - - Batch 3 (8,950 entities): [month val] [marker val] [hour val]
 |      | - - Batch 4 (970 entities): [month val] [marker val] [hour val]
 |      | - - Batch 5 (63 entities): [month val] [marker val] [hour val]
 |      \ - - Batch 6 (5 entities): [month val] [marker val] [hour val]
 |
 - - - Set #2 (416,102 total entities)
 |      \
 |        etc...

Copyright

Random Data Copyright

The data obtained from the RANDOM.ORG website used and posted here with kind permission. 💯

It should be understood that number sets generated by PHP and MS Excel are technically pseudo-random. True random data requires a cryptographic element that cannot be reversed engineered. As PHP and Excel are discreet systems, their output is insecure. Pseudo-random numbers are still chaotic in output and a one-to-one comparison between the true random numbers gained from RANDOM.ORG and the pseudo-random data shows no statistical difference in the chaotic output as it pertains to this project.

General Copyright

The data contained in this repository that I generated is released under the MIT license, which roughly means: free to reuse with reference in your work, but the code is "As Is" status. Peruse. 👈

Seismic Event Data Copyright

The information downloaded from the USGS API service was obtained freely under their Public Release of Information policy. Copyright for this information is under the Public Domain and no warranty is implied or expressed on their part or mine.

What Next...