Fix docstrings for DataFrame, and DataFrameReader #41

sfc-gh-okostakis · 2021-08-06T17:27:07Z

This PR aims to fix ALL details related to docstrings (for the purpose of autodocs) for the files dataframe.py and dataframe_reader.py

sfc-gh-jdu

Very good start! Do we have a plan to fix docstrings for other files?

One thing I'm a little bit concerned about is that the long docstring is not formatted (separated to multiple lines). Should we also enforced our auto-formatter to control the length of a docstring in a line? (we can add this flag psf/black#1802)

src/snowflake/snowpark/dataframe.py

sfc-gh-jdu · 2021-08-06T17:44:54Z

src/snowflake/snowpark/dataframe.py

+    Example 4
+        Create a new DataFrame by applying transformations to other existing DataFrames::
+
+            df_merged_data = df_catalog.join(df_prices, df_catalog["itemId"] == df_prices["ID"])


Is getting a column from a dataframe using []-style accessors worth documenting?

In Scala it was documented in the user-guide.
@sfc-gh-mabrennan FYI

src/snowflake/snowpark/dataframe.py

src/snowflake/snowpark/dataframe_reader.py

src/snowflake/snowpark/dataframe.py

sfc-gh-mabrennan

This looks great! I made a few small suggestions.

src/snowflake/snowpark/dataframe.py

sfc-gh-mabrennan · 2021-08-08T16:12:58Z

src/snowflake/snowpark/dataframe_reader.py

-        In addition, if you want to load only a subset of files from the stage, you can use the
-        `pattern <https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html#loading-using-pattern-matching>`_
-        option to specify a regular expression that matches the files that you want to load.
+        In addition, if you want to load only a subset of files from the stage, you can


on line 308 above this, the sentence says " Loading the first two columns of a colon-delimited CSV file ..." but the example code seems to use a semicolon as a delimiter and not a colon. (This error also exists in the Scaladoc)

Switched the delimiter to a colon in the example code (I copied the examples from there).

sfc-gh-okostakis · 2021-08-09T14:44:04Z

Very good start! Do we have a plan to fix docstrings for other files?

Each one of us should gradually fix the files for which they are the "code owners". Counting on @sfc-gh-mabrennan to step in and carry us the extra mile by assisting with examples and such.

One thing I'm a little bit concerned about is that the long docstring is not formatted (separated to multiple lines). Should we also enforced our auto-formatter to control the length of a docstring in a line? (we can add this flag psf/black#1802)

In some cases creating a new line is not working well with sphinx. I would like to avoid having the auto-formatter messing up the strings and getting silly autodocs. If, however, our auto-formatter can be 100% compatible with sphinx's format, then we should utilize it.

sfc-gh-okostakis added 2 commits August 6, 2021 17:19

Fix docstrings for DataFrame, and DataFrameReader

f7214c0

autoformatter

238fbe8

sfc-gh-okostakis requested review from sfc-gh-kwagner, sfc-gh-jdu and sfc-gh-mabrennan August 6, 2021 17:27

sfc-gh-jdu reviewed Aug 6, 2021

View reviewed changes

address comments for fixes

a40d91d