Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split up input-forms.xml and explore controlled vocabularies #34

Closed
alanorth opened this issue Dec 5, 2014 · 5 comments
Closed

Split up input-forms.xml and explore controlled vocabularies #34

alanorth opened this issue Dec 5, 2014 · 5 comments
Assignees

Comments

@alanorth
Copy link
Member

alanorth commented Dec 5, 2014

input-forms.xml is one big, ugly file right now. It has many templates for many institutes, which makes it hard to manage, and always conflicts with mainline DSpace during upgrades.

We need to be able to split the institution-specific controlled vocabularies into separate files.

@alanorth
Copy link
Member Author

@swebsheet did some more work on this for ILRI subjects. He modifies input-forms.xml to use an external vocabulary as such:

        <field>
          <dc-schema>dc</dc-schema>
          <dc-element>isubject</dc-element>
          <dc-qualifier>ilrisubject</dc-qualifier>
          <repeatable>true</repeatable>
          <label>ILRI Subject Focus</label>
          <input-type>twobox</input-type>
          <hint></hint>
          <required></required>
          <vocabulary>ilri-subjects</vocabulary>
        </field>

With dspace/config/controlled-vocabularies/ilri-subjects.xml containing ILRI subjects previously in input-forms.xml in the following format:

<?xml version="1.0" encoding="utf-8"?>
<node id="ILRISubjectCategories" label="ILRI subjects">
  <isComposedBy>
    <node>
      <isComposedBy>
        <node label="VALUE CHAINS" id="I.1">
          <hasNote>VALUE CHAINS</hasNote>
        </node>
        <node label="AGRICULTURE" id="I.2">
          <hasNote>AGRICULTURE</hasNote>
        </node>
        <node label="AGRI-HEALTH" id="I.3">
          <hasNote>AGRI-HEALTH</hasNote>
        </node>
...

See dspace/config/controlled-vocabularies/srsc.xml for a complete example.

@alanorth
Copy link
Member Author

Works great!

selection_079

... but resulting metadata is hierarchical, like: ILRI vocabularies::ILRI subjects::VALUE CHAINS.

selection_082

This creates problems with our existing workflows (search, browse, data cleaning in Excel, etc)... I'm not sure if maybe we can adapt our workflow to use this?

@alanorth
Copy link
Member Author

Looks like there are two solutions to these two separate problems:

The second option doesn't cover all the UI use cases, but definitely lets us use the power of hierarchies in Discovery facets. A good example of that is on the World Bank's repository: https://openknowledge.worldbank.org/handle/10986/21552?show=full (see the okr.topic etc)

@alanorth
Copy link
Member Author

Option one works! From input-forms.xml:

<!DOCTYPE input-forms SYSTEM "input-forms.dtd" [
<!ENTITY cg-maps SYSTEM "cg-maps.xml">
]>
...
&cg-maps;

And then in cg-maps.xml we just paste the content we want to include.

@alanorth alanorth changed the title Split up input-forms.xml and controlled vocabularies Split up input-forms.xml Jun 17, 2015
@alanorth alanorth changed the title Split up input-forms.xml Split up input-forms.xml and explore controlled vocabularies Jun 17, 2015
@alanorth
Copy link
Member Author

alanorth commented Jul 6, 2016

Closing in favor of #69.

@alanorth alanorth closed this as completed Jul 6, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant