Skip to content

Update the site for the 0.9.0 release#1205

Merged
rdblue merged 2 commits intoapache:masterfrom
rdblue:site-updates-0.9.0
Jul 15, 2020
Merged

Update the site for the 0.9.0 release#1205
rdblue merged 2 commits intoapache:masterfrom
rdblue:site-updates-0.9.0

Conversation

@rdblue
Copy link
Contributor

@rdblue rdblue commented Jul 15, 2020

This updates the site for the 0.9.0 release:

  • Add 0.9.0 to the releases page
  • Point Javadoc redirect to 0.9.0
  • Add warnings to Spark documentation
  • Update the Getting Started page to use Spark 3 SQL instead of the Java API and DataFrames
  • Add Spark catalogs to Configuration
  • Remove duplicate Spark API Quickstart with the same content as Java API Quickstart
  • Minor updates to CSS to reduce density

@rdblue rdblue requested review from aokolnychyi and rdsr July 15, 2020 00:31

!!! Warning
When reading with DataFrames in Spark 3, use `table` to load a table by name from a catalog.
Using `format("iceberg")` loads an isolated table reference that is not refreshed when other queries update the table.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dongjoon-hyun, this is the warning I've added to make people aware of the issues with the DataFrameReader. Please take a look if you have time. There is also one below for the v1 DataFrameWriter API.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @rdblue . It looks good to me.


!!! Warning
Spark 3.0.0 has a correctness bug that affects dynamic `INSERT OVERWRITE` with hidden partitioning, [SPARK-32168][spark-32168].
For tables with [hidden partitions](../partitioning), wait for Spark 3.0.1.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ya. We should release Apache Spark 3.0.1 soon for the users.


!!! Warning
When writing with the v1 DataFrame API in Spark 3, use `saveAsTable` or `insertInto` to load tables with a catalog.
Using `format("iceberg")` loads an isolated table reference that will not automatically refresh tables used by queries.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @rdblue !

Copy link
Contributor

@rdsr rdsr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 Looks great!

@rdblue
Copy link
Contributor Author

rdblue commented Jul 15, 2020

Thanks for reviewing, @dongjoon-hyun and @rdsr!

@rdblue
Copy link
Contributor Author

rdblue commented Jul 15, 2020

Merging since this doesn't affect the build and CI was flaky.

@rdblue rdblue merged commit 39348fe into apache:master Jul 15, 2020
HotSushi pushed a commit to HotSushi/iceberg that referenced this pull request Jul 23, 2020
cmathiesen pushed a commit to ExpediaGroup/iceberg that referenced this pull request Aug 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants