docs: initial user-guide pass for string functions #2635

agoose77 · 2023-08-10T10:24:29Z

This PR adds three user-guide entries under a new "strings" ToC heading. These guides handle loading strings from an array, splitting and joining, and component extraction.

Warning
Depends upon #2632

agoose77 · 2023-08-10T12:17:25Z

We should just write some demo data to disk, rather than loading a huge remote resource.

jpivarski

Nice!

I'd like to review the web-preview, but it's not getting built yet because of an error in the tests.

We should just write some demo data to disk, rather than loading a huge remote resource.

Yeah. Even though it's Zenodo and intended to be a long-lived resource, if we tie ourselves to too many of these things, we'll have to spend time repairing them if they ever do go offline. For datasets in git repos, like the bike paths, I made a fork of that repo and pointed to my fork. This one is not hosted by GitHub, though, it's Zenodo because it's very large.

But another problem with basing tutorials on very large datasets is that it introduces a large computation cost every time we build documentation, as well as a likely source of failure. (For example, the machine running the test to generate the output is overwhelmed and the test sometimes fails to build.)

A tutorial doesn't need a large dataset to be instructive; large datasets are for demos to be convincing. Could these inputs be cut down to something on the order of kB and put in the tests/samples directory? Then we'd be hosting them, and they'd be small.

(A large dataset is good for demonstrating scaling, but I'm wary of demonstrating scaling in auto-run tutorials. When I did that with the bike routes example, the top-voted comment on Hacker News was that the demo broke, due to a version issue between Numba and NumPy and the fact that our "tests are broken" error didn't stop the build.)

agoose77 · 2023-08-11T11:24:06Z

But another problem with basing tutorials on very large datasets is that it introduces a large computation cost every time we build documentation, as well as a likely source of failure. (For example, the machine running the test to generate the output is overwhelmed and the test sometimes fails to build.)

Yes, that's the main reason that I'm in favour of vendoring (a subset of) these data.

codecov · 2023-08-11T11:50:26Z

Codecov Report

Merging #2635 (ee51821) into main (2d21296) will increase coverage by 0.02%.
The diff coverage is n/a.

❗ Current head ee51821 differs from pull request most recent head 9e0e37a. Consider uploading reports for the commit 9e0e37a to get more accurate results

Additional details and impacted files

see 3 files with indirect coverage changes

jpivarski · 2023-08-11T16:32:58Z

Should this one be included in today's release? I approved it because I think it's the right documentation, though I'm not sure if the switch to vendoring a subset of the data is to happen in this PR or another one.

The tests aren't all running because it's a documentation-only PR, which I can force-merge if it's ready to go today.

agoose77 · 2023-08-11T22:00:34Z

Yes, we should include it! It contains the vendored log files as discussed :)

jpivarski · 2023-08-11T22:03:58Z

Okay! I'll merge it now. awkward-cpp is almost done.

docs: initial pass

5488bf6

agoose77 requested a review from jpivarski August 10, 2023 10:24

jpivarski approved these changes Aug 10, 2023

View reviewed changes

Merge branch 'main' into agoose77/docs-strings-initial-guide

7a3b467

agoose77 force-pushed the agoose77/docs-strings-initial-guide branch from ee51821 to 7a3b467 Compare August 11, 2023 11:44

docs: add sample data

45cd71f

agoose77 temporarily deployed to docs-preview August 11, 2023 12:14 — with GitHub Actions Inactive

Merge branch 'main' into agoose77/docs-strings-initial-guide

9e0e37a

agoose77 temporarily deployed to docs-preview August 11, 2023 13:00 — with GitHub Actions Inactive

jpivarski merged commit e2e5df6 into main Aug 11, 2023
16 checks passed

jpivarski deleted the agoose77/docs-strings-initial-guide branch August 11, 2023 22:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: initial user-guide pass for string functions #2635

docs: initial user-guide pass for string functions #2635

agoose77 commented Aug 10, 2023 •

edited

Loading

agoose77 commented Aug 10, 2023

jpivarski left a comment

agoose77 commented Aug 11, 2023

codecov bot commented Aug 11, 2023 •

edited

Loading

jpivarski commented Aug 11, 2023

agoose77 commented Aug 11, 2023 •

edited

Loading

jpivarski commented Aug 11, 2023

docs: initial user-guide pass for string functions #2635

docs: initial user-guide pass for string functions #2635

Conversation

agoose77 commented Aug 10, 2023 • edited Loading

agoose77 commented Aug 10, 2023

jpivarski left a comment

Choose a reason for hiding this comment

agoose77 commented Aug 11, 2023

codecov bot commented Aug 11, 2023 • edited Loading

Codecov Report

jpivarski commented Aug 11, 2023

agoose77 commented Aug 11, 2023 • edited Loading

jpivarski commented Aug 11, 2023

agoose77 commented Aug 10, 2023 •

edited

Loading

codecov bot commented Aug 11, 2023 •

edited

Loading

agoose77 commented Aug 11, 2023 •

edited

Loading