TraitDB is a Ruby on Rails web application for storing and searching trait data. It is in development at NESCent to support working groups.
TraitDB is a Rails 4 application. It requires ruby and rubygems to run. Other dependencies are specified in the Gemfile. To get up and running with the development environment, you will need Postgres installed. TraitDB can also be configured to work with MySQL if you wish, but Postgres is preferred.
Clone the repository
git clone email@example.com:NESCent/TraitDB.git
Install dependencies with
Set your database credentials as environment variables.
config/database.ymlwill read these values out of the environment. If your database server is on a different host, set the host/port as well:
export TRAITDB_PG_DEV_USER="traitdb_dev_user" export TRAITDB_PG_DEV_PASS="your-password-here"
rake db:setup. This Instructs Rails to connect to your database and create the required users and databases. If your database requires you to authenticate before creating users/databases, you will be prompted for credentials.
rake db:setupis successful, it will also run a
rake db:migrateto create database tables. If not successful, you can create the databases and users manually, then run
- If you wish to enable Google Sign-in (recommended), you will need to
- Register an application for Google OAuth 2.0
- Enable the Google+ API
- Set the Client ID and Client Secret credentials in your environment:
export TRAITDB_GOOGLE_APP_ID="your-google-app-id" export TRAITDB_GOOGLE_APP_SECRET="your-google-app-secret"
- Start the server with
- Visit http://localhost:3000 to access the application. You will be shown the about page. If you click Upload, you will be redirected to the sign-in screen. From here, you can sign in with OpenID or a Google Account
- Start a delayed_job worker. Delayed job is used to execute dataset imports as a background process. It includes a rake task to start a worker. You can run
rake jobs:workin an additional terminal process, or run a worker as a daemon with
Getting Started - Projects and Users
Data in TraitDB is publicly searchable and organized into projects. Initially there are no projects, and only administrators can create projects. Authentication is handled by OpenID, so in order to get started, you must:
- After signing in, there will be an entry in the users table with your email address.
- Upgrade this user to an Administrator with the following rake command:
$ rake traitdb:upgrade_admin[firstname.lastname@example.org] Upgrading email@example.com
- Reload your web browser, you will have an Admin menu option.
- Click Admin->Projects, and the New Project button.
- Fill out the project details and save the new project
Any authenticated user can upload data to any project, but only administrators can create projects and upload Import Configs.
TraitDB accepts data uploads in CSV format, with a specific focus on data validation and organization. In order to upload data into a project, you must write at least one import configuration file in YAML format. This configuration file will contain the project-specific data for your spreadsheets, as well as allowable values and rules for data relationships and which columns to import, ignore, or convert.
For detailed information on writing import configs, see the documentation on the wiki.
Examples for the configuration files are in the lib/traitdb_import directory.
Generally, the CSV files are required to have the following general characteristics
- The first row contains column header names The column names include Taxonomic ranks (e.g. Order, Genus, Species), names of traits, and column names for metadata.
- Each data row includes trait data and metadata for one Operational Taxonomic Unit (OTU)
- Data for a single trait (column) may be either categorical (One or more string tokens separated by a delimeter) or continuous (floating point values)
- Source / Reference information for a trait may be in an associated column
As an admin user, you can upload and manage Import Configs for a project. Authenticated users will be able to choose an Import Config when they upload data to the project.
At the upload stage, the user can get information about the Import Config, or download a template CSV file that conforms to it.