Cannot read data files with same basename but different extensions #8300

ShaneCurcuru · 2020-07-11T23:19:01Z

My Environment

Software	Version(s)
Operating System	OSX Catalina 10.15.5
`jekyll`	Latest
`github-pages`	Latest

Expected Behaviour

Given:
_data/samename.csv
_data/samename.json

I would expect that I could then access both data sources via site.data.samename.csv and site.data.samename.json (or some equivalent hash lookup similar to site.data['samename.json']).

Current Behavior

site.data.samename (or variants) only returns the contents of the .csv file. The .json file is not accessible at all.

Code Sample

The behavior is presumably because while data_reader.rb reads all data files, it indexes them by File.basename, meaning that any same-basenamed files will always only show data from the file with (presumably) the extension that happens to be last in the Dir glob here:

https://github.com/jekyll/jekyll/blob/master/lib/jekyll/readers/data_reader.rb#L36

Question: Is this an intended feature, or merely something no-one has considered as a use case yet? My use case is an open data portal, where we host various .csv files (of government budgets, etc.), with a corresponding .json file that is the dcat:Dataset for the .csv file. There are valid reasons to have similarly named files in varying formats, so this seems like it would be useful (although I suppose fixing this would mean some data reading behaves subtly differently).

Thanks, Jekyll peeps!

The text was updated successfully, but these errors were encountered:

ashmaroli · 2020-07-12T06:03:58Z

Is this an intended feature, or merely something no-one has considered as a use case yet?

Irrespective of the intention / design, this issue cannot be addressed without breaking existing workflows especially since the current behavior has remained the same from the time the feature was introduced.

Perhaps this may get considered for a v5.0.0 (which won't be happening in the near future).

See jekyll/jekyll#8300 for why same names won't work

ShaneCurcuru · 2020-07-12T13:59:25Z

Understood (and expected), I'll workaround; but please do consider this a plea for updating the docs to make it clear what order the files will be read in so others aren't surprised.

ashmaroli · 2020-07-12T15:55:51Z

consider this a plea for updating the docs

The docs already contain a fleeting reference to this behavior:

This data can be accessed via site.data.members (notice that the filename
determines the variable name).

That said, ~~I'll update~~ I've updated that line to reduce assumptions.

ashmaroli added the pinned label Jul 12, 2020

ShaneCurcuru added a commit to ArlingtonMA/arlingtonma.info that referenced this issue Jul 12, 2020

Don't use same-named data files

d9abddb

See jekyll/jekyll#8300 for why same names won't work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot read data files with same basename but different extensions #8300

Cannot read data files with same basename but different extensions #8300

ShaneCurcuru commented Jul 11, 2020

ashmaroli commented Jul 12, 2020

ShaneCurcuru commented Jul 12, 2020

ashmaroli commented Jul 12, 2020 •

edited

Cannot read data files with same basename but different extensions #8300

Cannot read data files with same basename but different extensions #8300

Comments

ShaneCurcuru commented Jul 11, 2020

My Environment

Expected Behaviour

Current Behavior

Code Sample

ashmaroli commented Jul 12, 2020

ShaneCurcuru commented Jul 12, 2020

ashmaroli commented Jul 12, 2020 • edited

ashmaroli commented Jul 12, 2020 •

edited