Split crosswalk.csv into crosswalks/*.csv #205

progval · 2018-11-12T14:30:21Z

Creates one table per platform, and add a script (scripts/aggregate.py) to generate crosswalk.csv.

I wrote scripts/aggregate.py so that its output is exactly the content of crosswalk.csv before this PR; hence the hard-coded OLD_ORDER variable.
I can remove this if you wish.

This is a proposal to solve #204. Feel free to comment on it!

cboettig · 2018-11-12T22:05:07Z

Yay, nice work! 👏 🍾

Can you also add some information to README.md documenting this? Maybe also a CONTRIBUTING.md issue template noting this structure, to remind anyone filing a PR how to do it as a separate file?

I think this will make the process much easier. We have a few stale PRs and other issues that have been caused by all the collisions of using one master file.

Thoughs @mbjones ?

progval · 2018-11-13T09:40:11Z

Yay, nice work!

Thanks!

Can you also add some information to README.md documenting this? Maybe also a CONTRIBUTING.md issue template noting this structure, to remind anyone filing a PR how to do it as a separate file?

Done: 65cf67b

progval · 2018-11-13T09:41:27Z

I just had a thought: when a mapping in crosswalks/ does not provide a term matching a CodeMeta property, should the row still be present?

On the one hand, it makes it obvious in each mapping which properties are missing, but on the other hand it makes many useless lines.

cboettig · 2018-11-15T08:10:23Z

probably best for the row to be present

but hopefully the table-join script is robust enough (e.g do you use a join on schema term as the key column?) to work either way?

progval · 2018-11-15T13:28:23Z

but hopefully the table-join script is robust enough (e.g do you use a join on schema term as the key column?) to work either way?

No; it checks all rows are present, to avoid mistakes. I can change it to make a proper join, though.

cboettig · 2018-11-15T16:05:50Z

Hmm, maybe your way is actually better to be more strict in keeping alll rows.

On Thu, Nov 15, 2018 at 5:28 AM Valentin Lorentz ***@***.***> wrote: but hopefully the table-join script is robust enough (e.g do you use a join on schema term as the key column?) to work either way? No; it checks all rows are present, to avoid mistakes. I can change it to make a proper join, though. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#205 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AANleqK-OXlK6T0zkxf0sasZLCLirGHrks5uvWv3gaJpZM4YZwyW> .

-- --- Carl Boettiger http://carlboettiger.info/

cboettig · 2018-11-19T19:47:59Z

I think this is a great step forward, so I've gone and merged this in now. Thanks @progval !

* Add scripts/split.py and scripts/aggregate.py. * Add properties_description.csv (extracted from crosswalk.csv). * Add one crosswalk table per platform (extracted from crosswalk.csv). * Make scripts executable. * Run aggregate.py on Travis to check integrity of crosswalks/*.csv files. * Write CONTRIBUTING.md, and update README.md to refer to it.

progval added 3 commits November 12, 2018 15:31

Add scripts/split.py and scripts/aggregate.py.

260e7df

Add properties_description.csv (extracted from crosswalk.csv).

032ed43

Add one crosswalk table per platform (extracted from crosswalk.csv).

ae39a3b

progval force-pushed the split-tables branch from b0833a6 to ae39a3b Compare November 12, 2018 14:31

progval added 2 commits November 12, 2018 15:34

Make scripts executable.

8a6d226

Run aggregate.py on Travis to check integrity of crosswalks/*.csv files.

3084a3b

progval force-pushed the split-tables branch from 10c005c to 3084a3b Compare November 12, 2018 14:41

Write CONTRIBUTING.md, and update README.md to refer to it.

65cf67b

cboettig merged commit 2eaa31b into codemeta:master Nov 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split crosswalk.csv into crosswalks/*.csv #205

Split crosswalk.csv into crosswalks/*.csv #205

progval commented Nov 12, 2018

cboettig commented Nov 12, 2018

progval commented Nov 13, 2018 •

edited

progval commented Nov 13, 2018

cboettig commented Nov 15, 2018

progval commented Nov 15, 2018

cboettig commented Nov 15, 2018 via email

cboettig commented Nov 19, 2018

Split crosswalk.csv into crosswalks/*.csv #205

Split crosswalk.csv into crosswalks/*.csv #205

Conversation

progval commented Nov 12, 2018

cboettig commented Nov 12, 2018

progval commented Nov 13, 2018 • edited

progval commented Nov 13, 2018

cboettig commented Nov 15, 2018

progval commented Nov 15, 2018

cboettig commented Nov 15, 2018 via email

cboettig commented Nov 19, 2018

progval commented Nov 13, 2018 •

edited