Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BEAM-1665] Add Pipeline I/O section to website - outline + move some existing txt #173

Closed
wants to merge 3 commits into from

Conversation

ssisk
Copy link

@ssisk ssisk commented Mar 9, 2017

This PR has a fair number of TODOs in the website content, but I have PRs queued up with more content from the IO authoring guide and none of the TODOs are for content already on the website.

It has very little actual new content - mostly, it created structure and moves existing content around.

  • I did not to go with a single page for all this content b/c both java and python have enough unique content that they deserve their own separate sections (ie, just tabs on the code isn't enough), and the "click to the next page" model currently implemented allows the user to pick java vs python, but then after reading those pages, the next page for both points at the same place - the users mostly follow the same path, but for java vs python specific content, they will diverge then converge again.
  • I moved the "list of built-in I/O" content over to it's own separate page since it'd be nice to have more content there - e.g. capabilities matrix, and it felt special enough to pull out of the programming guide.
  • We decided not to put all of this content in the contribute section of the site since the expectation is we don't think all users will contribute their IO transforms, so we want most of the docs to just be about writing an IO transforms, and they lay out the expectations in the contribute part of the IO section.

R @melap

@asfbot
Copy link

asfbot commented Mar 9, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Test/285/
--none--

@asfbot
Copy link

asfbot commented Mar 9, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Stage/327/

Jenkins built the site at commit id b0e0862 with Jekyll and staged it here. Happy reviewing.

Note that any previous site has been deleted. This staged site will be automatically deleted after its TTL expires. Push any commit to the pull request branch or re-trigger the build to get it staged again.

Copy link

@melap melap left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the delay, thanks for setting up the skeleton for a new I/O section! I've added some comments.

---

[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this page is missing a title

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done, thanks!


[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

This is the set of I/O transforms currently available in Beam.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm, what about something like "This table contains the currently available I/O transforms."

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


This is the set of I/O transforms currently available in Beam.

Consult the [Programming Guide I/O section]({{site.baseurl }}/documentation/programming-guide#io) for general usage instructions and the javadoc/pydoc for the particular I/O transforms.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I initially read it quickly as the instructions and javadoc/pydoc are both available in the programming guide. perhaps add a comma and "see" --> "usage instructions, and see the javadoc ..."

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done!


# Contributing I/O Transforms to Beam

* If you are planning to contribute your I/O transform to the beam community, you'll be going through the normal beam contribution life cycle - see the Apache Beam Contribution Guide for more details.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • in general we've been trying to always have first mention of Beam be prefixed with Apache. maybe add Apache to the H1 title?
  • beam should be capitalized (x2)
  • link to the contribution guide?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done. I ended up from the "to Beam" part, but I moved the Apache to the first instance of beam in the doc


# Next steps

If you've got a well tested I/O transform, why not contribute it to Beam? Read all about it:
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"If you have a"

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

</tr>
</table>
### Beam-provided I/O Transforms
There is a page devoted to [Beam-provided I/O Transforms]({{site.baseurl }}/documentation/io/built-in/).
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm wondering if it might be good to specify what's roughly on the page, so users can easily decide if they want/need to jump there or not. though, this would need to be updated again later, after/if more is added to the page. and admittedly, the sentence below is a bit redundant right now while it's just a list of the provided transforms.

See the Beam-provided I/O Transforms page for a list of the currently available I/O transforms.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, I struggled with the wording here, this feels better.

done.



## Pipeline I/O
There is a page devoted to [Beam-provided I/O Transforms]({{site.baseurl }}/documentation/io/built-in/).
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as above

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

title: "Authoring I/O - Overview"
permalink: /documentation/io/authoring-overview/
---

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: a couple extra blank lines

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

# Authoring I/O - Overview
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the two titles (metadata at top and this H1) are missing the "Transforms", "Authoring I/O Transforms"

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


# Authoring I/O - Overview

_A guide for those who need to connect to a data store not covered by a [Built-in I/O Transforms]({{site.baseurl }}/documentation/io/built-in/)_
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

slight tweaks, maybe:
A guide for users who need to connect to a data store that isn't supported by the [Built-in I/O Transforms].

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

…ntent

* I did not to go with a single page for all this content b/c both java and python have enough unique content that they deserve their own separate sections (ie, just tabs on the code isn't enough), and the "click to the next page" model currently implemented allows the user to pick java vs python, but then after reading those pages, the next page for both points at the same place - the users mostly follow the same path, but for java vs python specific content, they will diverge then converge again.
* I moved the "list of built-in I/O" content over to it's own separate page since it'd be nice to have more content there - e.g. capabilities matrix, and it felt special enough to pull out of the programming guide.
* We decided not to put all of this content in the contribute section of the site since the expectation is we don't think all users will contribute their IO transforms, so we want most of the docs to just be about writing an IO transforms, and they lay out the expectations in the contribute part of the IO section.
@asfbot
Copy link

asfbot commented Mar 14, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Test/300/
--none--

Copy link
Author

@ssisk ssisk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

okay! thanks so much for all your comments.

---

[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done, thanks!

title: "Authoring I/O - Overview"
permalink: /documentation/io/authoring-overview/
---

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

# Authoring I/O - Overview
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


# Authoring I/O - Overview

_A guide for those who need to connect to a data store not covered by a [Built-in I/O Transforms]({{site.baseurl }}/documentation/io/built-in/)_
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


[Pipeline I/O Table of Contents]({{site.baseurl}}/documentation/io/io-toc/)

This is the set of I/O transforms currently available in Beam.
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


This is the set of I/O transforms currently available in Beam.

Consult the [Programming Guide I/O section]({{site.baseurl }}/documentation/programming-guide#io) for general usage instructions and the javadoc/pydoc for the particular I/O transforms.
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done!


# Contributing I/O Transforms to Beam

* If you are planning to contribute your I/O transform to the beam community, you'll be going through the normal beam contribution life cycle - see the Apache Beam Contribution Guide for more details.
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done. I ended up from the "to Beam" part, but I moved the Apache to the first instance of beam in the doc


# Next steps

If you've got a well tested I/O transform, why not contribute it to Beam? Read all about it:
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

</tr>
</table>
### Beam-provided I/O Transforms
There is a page devoted to [Beam-provided I/O Transforms]({{site.baseurl }}/documentation/io/built-in/).
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, I struggled with the wording here, this feels better.

done.



## Pipeline I/O
There is a page devoted to [Beam-provided I/O Transforms]({{site.baseurl }}/documentation/io/built-in/).
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

@asfbot
Copy link

asfbot commented Mar 14, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Stage/342/

Jenkins built the site at commit id a6f367a with Jekyll and staged it here. Happy reviewing.

Note that any previous site has been deleted. This staged site will be automatically deleted after its TTL expires. Push any commit to the pull request branch or re-trigger the build to get it staged again.

@melap
Copy link

melap commented Mar 14, 2017

Thanks, LGTM as a skeleton starting point. Looking forward to seeing it full of good stuff.

@ssisk
Copy link
Author

ssisk commented Mar 15, 2017

Thanks for the review Melissa!
cc @davorbonaci @francesperry for merge (and review if you'd like)

@davorbonaci
Copy link
Member

I'm worried about publishing this as-is. We could do it, but users will just see a ton of TODOs, probably resulting in worse experience than without this content.

@ssisk, can we perhaps merge it, but comment it out, so users don't see so many TODOs?

@ssisk
Copy link
Author

ssisk commented Mar 15, 2017

sure, I'll take a pass and see what I can comment out so that it doesn't present a ton of TODOs to the users.

That'll probably look like: initially the TOC page will be there and the "using" section will be visible, but not the authoring section.

@asfbot
Copy link

asfbot commented Mar 17, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Test/308/
--none--

@ssisk
Copy link
Author

ssisk commented Mar 17, 2017

okay! I've:

  1. Updated the TOC so that it only displays one "this is a work in progress" note for the authoring content, and users won't see the authoring content yet/the other set of TODOs.
  2. Looked around to see if there's any pointers directly into the "authoring transforms" content, and changed the one reference (in the programming guide) to point at the IO TOC instead of directly at the content.

PTAL

@asfbot
Copy link

asfbot commented Mar 17, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Stage/350/

Jenkins built the site at commit id 1f4d996 with Jekyll and staged it here. Happy reviewing.

Note that any previous site has been deleted. This staged site will be automatically deleted after its TTL expires. Push any commit to the pull request branch or re-trigger the build to get it staged again.

@asfbot
Copy link

asfbot commented Mar 18, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Test/314/
--none--

@asfbot
Copy link

asfbot commented Mar 18, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Stage/356/

Jenkins built the site at commit id be4463f with Jekyll and staged it here. Happy reviewing.

Note that any previous site has been deleted. This staged site will be automatically deleted after its TTL expires. Push any commit to the pull request branch or re-trigger the build to get it staged again.

@asfbot
Copy link

asfbot commented Mar 18, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Test/315/
--none--

@asfbot
Copy link

asfbot commented Mar 18, 2017

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Website_Stage/357/

Jenkins built the site at commit id 76bf199 with Jekyll and staged it here. Happy reviewing.

Note that any previous site has been deleted. This staged site will be automatically deleted after its TTL expires. Push any commit to the pull request branch or re-trigger the build to get it staged again.

Copy link
Member

@davorbonaci davorbonaci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Test breakage seems unrelated GitHub outage. Merging.

@asfgit asfgit closed this in b574876 Mar 18, 2017
@ssisk ssisk deleted the io-guide branch March 28, 2017 23:13
robertwb pushed a commit to robertwb/incubator-beam that referenced this pull request Jun 5, 2018
robertwb pushed a commit to robertwb/incubator-beam that referenced this pull request Jun 5, 2018
melap pushed a commit to apache/beam that referenced this pull request Jun 20, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants