Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lina - run the categorisation script #53

Closed
katilp opened this issue Mar 17, 2021 · 0 comments
Closed

Lina - run the categorisation script #53

katilp opened this issue Mar 17, 2021 · 0 comments

Comments

@katilp
Copy link
Contributor

katilp commented Mar 17, 2021

As explained in the main "interface" script, the categorisation algorithm to define the search categories can be run separately from the rest of the steps.

Clone the data-curation repository in your local area with

git clone git@github.com:cernopendata/data-curation.git

then go to the directory where the scripts for metadata processing are located:

cd data-curation/cms-YYYY-simulated-datasets

and run the script: to run it on the list of datasets that we now prepare for the release, do

python3 ./code/interface.py --print-categorisation ./inputs/CMS-2015-mc-datasets.txt > categorisation-2015.md

This produces a list in markdown format. This is the same format that we use for pages in the getting-started guide. You can read more on it at https://www.markdownguide.org/basic-syntax/

You can view how it renders using VS Code. Start VS Code with

code .

Open the newly created categorisation-2015.md file and open the preview by (1) right-clicking on the editor tab and (2) selecting "Open preview" from the drop-down menu

image

Note that these are the search categories that are displayed on the open data portal search menu and they are necessary for the open data users to find their dataset of interest.

image

@katilp katilp added this to To do in Lina-getting-started Mar 17, 2021
@linaJazar linaJazar moved this from To do to Done in Lina-getting-started Mar 22, 2021
@linaJazar linaJazar moved this from Done to To do in Lina-getting-started Mar 22, 2021
@linaJazar linaJazar moved this from To do to In progress in Lina-getting-started Mar 22, 2021
@linaJazar linaJazar moved this from In progress to Done in Lina-getting-started Mar 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants