Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
This page is intended to document all aspects of data management, from the day-to-day, formal NGS and proteomics plans, and general archiving options.
Daily Data on Owl
Data, including intermediate analysis, needs to have a url. This most often means it will live on a Network Attached Storage Device (NAS; aka a server).
Using the Owl NAS to store your data:
Ask Steven or Sam to generate a user account for you. A folder will be created for you in:
owl/web/Ask Steven/Sam for the name of the folder, as well as your username and password.
Upload data to your Owl web folder:
Navigate to http://owl.fish.washington.edu/
Web Browser login. 1. If it's your first time visiting this page, your browser will present you with a warning about an insecure site or bad certificate. That's OK. Click on the option to add an exception for this site.
Enter username and password. (NOTE: If it's your first time accessing your account, please change your password by clicking on the silhouette in the upper right corner, then "Personal" in the dropdown menu).
Navigate to File Station > web > your_folder (If you don't see the File Station icon, click on the icon of four squares in the upper left corner and select File Station from the subsequent menu).
Click-and-drag files from your computer to your
Files that you have uploaded to your_folder are publicly viewable: http://owl.fish.washington.edu/your_folder
You can use the URLs for your files for linking in your notebook.
All folders need to contain a readme file.
The readme files should be plain text (i.e. do not create/edit the file with a word processor like Microsoft Word or LibreOffice Writer) and should describe the contents of the folder. If there are directories in the same folder as your readme file, the directory names should be listed and a brief description of their contents should be provided.
Please refrain from using any non alpha-numeric (including spaces) in file and folder names.
NGS Data Management Plan
As sequencing facility provdes data, files are downloaded to our local NAS (owl), in the correct species subdirectory within
MD5 checksums are generated and compared to those supplied by the sequencing facility.
Append the generated MD5 checksums to the
checksums.md5file. If that file does not yet exist, create it, and add the generated checksums to the new
The Nightingales Google Spreadsheet is updated.
Each library (i.e. each sample with a unique sequencing barcode) is entered in its own row.
SeqIDis the base name of the sequencing file (i.e. no file extensions like ".fq.gz" or ".gz")
Each library receives a unique, incremented
Each library receives a
Library_name; this may or may not be unique.
Update the Nightingales Google Fusion Table with new information from the Nightingales Google Spreadsheet. This is accomplished by:
Deleting all rows in the Nightingales Google Fusion Table (Edit > Delete all rows)
Importing data from the Nightingales Google Spreadsheet (File > Import more rows...)
The Google Docs spreadsheet Nightingales Google Spreadsheet is backed up on a regular basis? by downloading tab-delimited file and pushing to LabDocs Repository, with the file name
The nightingales directory on owl is backed up to Amazon Glacier. This is accessible by .......?
SRA Upload ...
Proteomics Data Management Plan
- As sequencing facility provides data, files are downloaded to our local NAS (owl), in the root
As opposed to the NGS data, these data will be organized based on date.
- The Spreadsheet is updated.