Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow custom transformers to be used in the dataset DSL #22

Open
wstrinz opened this issue Oct 25, 2013 · 0 comments
Open

Allow custom transformers to be used in the dataset DSL #22

wstrinz opened this issue Oct 25, 2013 · 0 comments

Comments

@wstrinz
Copy link
Owner

wstrinz commented Oct 25, 2013

PubliSci's RDFization classes all use the Data Cube vocabulary to represent its input. The DSL and server extension don't necessarily require this however, and could serve as a useful interface for accessing other conversion tools.

Currently, the data function of the dsl is used by specifying the input source, and various Data Cube or parser specific parameters, for example

data do
  source 'https://github.com/wstrinz/publisci/raw/master/spec/csv/bacon.csv'

  dimension 'producer', 'pricerange'
  measure 'chunkiness'

  option 'label_column', 'producer'
end

Resolving this issue would, for example, involve modifying dataset_dsl.rb so that something like

data do
  engine :bio-interchange
  source 'https://gist.github.com/wstrinz/7165201/raw/0f688ba7041e828d3336bb530aa7495d94022af1/example.gff3'
end

to convert a GFF3 file using BioInterchange would be possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant