Joseph Vantassel, The University of Texas at Austin
- Prefer easily readable formats (i.e.,
.txtand.csv) over proprietary formats (i.e.,.xlsxand.docx). - Prefer multiple simple files (i.e., multiple
.csv) over one large complex file (i.e.,.xlsxworkbook with multiple sheets). - If something is unusual about the work add a
meta.mdwith narrative information regarding what you did and why it was necessary. - Avoid saving many output figures to disk, instead organize them in a notebook or better yet create a proper summary document describing your work.
- Avoid directory and file name capitalization.
- Do not use spaces or special characters in your file and directory names.
- Prefer the use of
_(i.e.,snake_case) to delineate words instead of capitalization (i.e.,camelCase). - Keep file names short, descriptive, and easy-to-read.
- coordinates
- Includes relative and absolute coordinates for the project. It contains
three subdirectories as follows:
- construction
- Should contain the coordinates in the most raw form.
- Should provide a clear and logical progression from raw to final coordinates.
- final
- Include final coordinates, specifically:
master.csvwith the final coordinates of all passive-stations.c60.txtShould contain a separate geopsy-style coordinate file for each passive array.
- Include final coordinates, specifically:
- kml
- Includes kml files for viewing site information in Google Earth, specifically:
planning.kmlwhich shows the site characterization as planned.as-built.kmlwhich shows the acquisition as it was performed.
- Arrays should be divided into groups.
- Each sensor should be named using its number and its associated array.
- Includes kml files for viewing site information in Google Earth, specifically:
- construction
- Includes relative and absolute coordinates for the project. It contains
three subdirectories as follows:
- fk
- Includes results for frequency-wavenumber processing, specifically:
.maxand.logfiles for each 2D array,mam.ipynbfor post-processing, and.jsonwith post-processed dispersion peaks.
- Includes results for frequency-wavenumber processing, specifically:
- geopsy_db
- Includes geopsy database(s). A database should:
- Exist for each array.
- Include the relative coordinate for each station.
- Include at a minimum all three components (XYZ) together, though you may produce subdivided groups (i.e., Z alone) if desired.
- Includes geopsy database(s). A database should:
- hvsr
- Includes results of horizontal-to-vertical spectral ratio processing, specifically:
hvsr.ipynbfor processing.f0.csvort0.csvwith the mean and standard deviation of frequency or period respectively.hvsr.kmlwhich shows the distribution of frequency/period.
- Includes results of horizontal-to-vertical spectral ratio processing, specifically:
- inversion
- Includes information necessary for inversion. It includes, subdirectories
as follows:
- 0_targets
- Includes dinver-style
.targetfiles.
- Includes dinver-style
- 1_parameters
- Includes dinver-style
.paramfiles.
- Includes dinver-style
- 2_reports
- Includes dinver-style
.reportfiles.
- Includes dinver-style
- 3_text
- Includes dinver-style
_GM.txtand_DC.txtfiles.
- Includes dinver-style
- Additional files for pre- and post-processing will be kept in the inversion directory. Do not place processing files in any of the directories listed above.
- 0_targets
- Includes information necessary for inversion. It includes, subdirectories
as follows:
- masw
- Includes multi-channel analysis of surface waves data, specifically:
masw.ipynbfor processing and post-processing,.jsonfile from processing,.jsonfile from post-processing.
- Includes multi-channel analysis of surface waves data, specifically:
- raw_data
- Includes the data in its most raw form. It includes, two subdirectories
activeandpassivefor each type of testing performed. - Each directory should include a
metadirectory will any relevant meta information.
- Includes the data in its most raw form. It includes, two subdirectories
- reports
- Includes incoming and out-going summary reports. Should contain a
subdirectory indicating where it is going or who it is from (e.g.,
from_jdto indicate a report fromJane Doe). The directory should include the report and any supporting information and figures. Avoid performing analyses in these directories.
- Includes incoming and out-going summary reports. Should contain a
subdirectory indicating where it is going or who it is from (e.g.,
- site_information
- Includes site information such as general geologic information, site specific references, invasive testing data/results, and other useful references for performing surface wave inversion.
- spac
- Includes results of the spatial auto-correlation processing, specifically:
.maxand.logfiles for each 2D array.
- Includes results of the spatial auto-correlation processing, specifically:
- unprocessed_data
- Includes data in the most ready-to-be-use form, specifically:
- MASW and MAM array data should be separated into their own directories
where each array is named by its testing type and its name/id (e.g.
masw_array1ormam_c150). - MASW data should consist of unstacked,
seg2data files with correct header information. Do not rely on the data sheets to note incorrect header information, instead fix the error before transferring it. - MAM data should consist of trimmed
.mseedfiles with the header information entered correctly, this at a minimum should include thegroup id,station number, and the associatedarray. - Typed summaries of the meta-data in picture-ready
.pdfformat.
- MASW and MAM array data should be separated into their own directories
where each array is named by its testing type and its name/id (e.g.
- Includes data in the most ready-to-be-use form, specifically: