dbt-labs · jtcohen6 · Apr 3, 2020 · Apr 3, 2020 · Apr 3, 2020 · Apr 3, 2020
diff --git a/website/docs/docs/building-a-dbt-project/building-models/bigquery-configs.md b/website/docs/docs/building-a-dbt-project/building-models/bigquery-configs.md
@@ -353,7 +353,7 @@ select * from {{ ref('another_model') }}
 
 ## Merge behavior (incremental models)
 
-The `incremental_strategy` config controls how dbt builds incremental models. dbt uses a [merge statement](https://cloud.google.com/bigquery/docs/reference/standard-sql/dml-syntax) on BigQuery to refresh incremental tables.
+The [`incremental_strategy` config](configuring-incremental-models#what-is-an-incremental_strategy) controls how dbt builds incremental models. dbt uses a [merge statement](https://cloud.google.com/bigquery/docs/reference/standard-sql/dml-syntax) on BigQuery to refresh incremental tables.
 
 The `incremental_strategy` config can be set to one of two values:
  - `merge` (default)
@@ -391,6 +391,12 @@ strategy is selected.
 
 ### The `insert_overwrite` strategy
 
+<Callout type="info" title="New in dbt v0.16.0">
+
+This functionality is new in dbt v0.16.0. For upgrading instructions, check out [the docs](installation)
+
+</Callout>
+
 The `insert_overwrite` strategy generates a merge statement that replaces entire partitions
 in the destination table. **Note:** this configuration requires that the model is configured
 with a [Partition clause](#partition-clause). The `merge` statement that dbt generates
@@ -523,37 +529,3 @@ with events as (
 
 ... rest of model ...
 ```
-
-### Configuring incremental strategy
-
-The `incremental_strategy` config can either be specified in specific models, or
-for all models in your `dbt_project.yml` file:
-
-<File name='dbt_project.yml'>
-
-```yaml
-# Your dbt_project.yml file
-
-models:
-  incremental_strategy: "insert_overwrite"
-```
-
-</File>
-
-or:
-
-<File name='models/my_model.sql'>
-
-```sql
-{{
-  config(
-    materialized='incremental',
-    incremental_strategy='insert_overwrite',
-    ...
-  )
-}}
-
-select ...
-```
-
-</File>
diff --git a/...s/docs/building-a-dbt-project/building-models/configuring-incremental-models.md b/...s/docs/building-a-dbt-project/building-models/configuring-incremental-models.md
@@ -147,3 +147,47 @@ If you add a column from your incremental model, and execute a `dbt run`, this c
 Similarly, if you remove a column from your incremental model, and execute a `dbt run`, this column will _not_ be removed from your target table.
 
 Instead, whenever the logic of your incremental changes, execute a full-refresh run of both your incremental model and any downstream models.
+
+## What is an incremental_strategy?
+
+On some adapters, an optional `incremental_strategy` config controls the code that dbt uses
+to build incremental models. Different approaches may vary by effectiveness depending on the volume of data,
+the reliability of your `unique_key`, or the availability of certain features.
+
+* [Snowflake](snowflake-configs/#merge-behavior-incremental-models): `merge` (default), `delete+insert` (optional)
+* [BigQuery](bigquery-configs/#merge-behavior-incremental-models): `merge` (default), `insert_overwrite` (optional)
+* [Spark](spark-configs#incremental-models): `insert_overwrite` (default), `merge` (optional, Delta-only)
+
+### Configuring incremental strategy
+
+The `incremental_strategy` config can either be specified in specific models, or
+for all models in your `dbt_project.yml` file:
+
+<File name='dbt_project.yml'>
+
+```yaml
+# Your dbt_project.yml file
+
+models:
+  incremental_strategy: "insert_overwrite"
+```
+
+</File>
+
+or:
+
+<File name='models/my_model.sql'>
+
+```sql
+{{
+  config(
+    materialized='incremental',
+    incremental_strategy='insert_overwrite',
+    ...
+  )
+}}
+
+select ...
+```
+
+</File>
diff --git a/website/docs/docs/building-a-dbt-project/building-models/snowflake-configs.md b/website/docs/docs/building-a-dbt-project/building-models/snowflake-configs.md
@@ -42,41 +42,10 @@ select * from ...
 
 ## Merge behavior (incremental models)
 
-The `incremental_strategy` config controls how dbt builds incremental models. By default, dbt will use a [merge statement](https://docs.snowflake.net/manuals/sql-reference/sql/merge.html) on Snowflake to refresh incremental tables.
+The [`incremental_strategy` config](configuring-incremental-models#what-is-an-incremental_strategy) controls how dbt builds incremental models. By default, dbt will use a [merge statement](https://docs.snowflake.net/manuals/sql-reference/sql/merge.html) on Snowflake to refresh incremental tables.
 
 Snowflake's `merge` statement fails with a "nondeterministic merge" error if the `unique_key` specified in your model config is not actually unique. If you encounter this error, you can instruct dbt to use a two-step incremental approach by setting the `incremental_strategy` config for your model to `delete+insert`. 
 
-This config can either be specified in specific models, or for all models in your `dbt_project.yml` file:
-
-<File name='dbt_project.yml'>
-
-```yaml
-# Your dbt_project.yml file
-
-models:
-  incremental_strategy: "delete+insert"
-```
-
-</File>
-
-or:
-
-<File name='models/my_model.sql'>
-
-```sql
-{{
-  config(
-    materialized='incremental',
-    unique_key='id',
-    incremental_strategy='delete+insert'
-  )
-}}
-
-select ...
-```
-
-</File>
-
 ## Configuring table clustering
 
 dbt supports [table clustering](https://docs.snowflake.net/manuals/user-guide/tables-clustering-keys.html) on Snowflake. To control clustering for a table or incremental model, use the `cluster_by` config. When this configuration is applied, dbt will do two things:

diff --git a/website/docs/docs/building-a-dbt-project/building-models/spark-configs.md b/website/docs/docs/building-a-dbt-project/building-models/spark-configs.md
@@ -0,0 +1,123 @@
+---
+title: "Spark specific configurations"
+id: "spark-configs"
+---
+
+import Tabs from '@theme/Tabs';
+import TabItem from '@theme/TabItem';
+
+## Properties of Spark tables
+
+When materializing a model as `table`, you may include several optional configs:
+
+| Option  | Description                                        | Required?               | Example                  |
+|---------|----------------------------------------------------|-------------------------|--------------------------|
+| file_format | The file format to use when creating tables (`parquet`, `delta`, `csv`, `json`, `text`, `jdbc`, `orc`, `hive` or `libsvm`). | Optional | `parquet`|
+| location_root  | The created table uses the specified directory to store its data. The table alias is appended to it. | Optional                | `/mnt/root`              |
+| partition_by  | Partition the created table by the specified columns. A directory is created for each partition. | Optional                | `partition_1`              |
+| clustered_by  | Each partition in the created table will be split into a fixed number of buckets by the specified columns. | Optional               | `cluster_1`              |
+| buckets  | The number of buckets to create while clustering | Required if `clustered_by` is specified                | `8`              |
+
+## Incremental Models
+
+The [`incremental_strategy` config](configuring-incremental-models#what-is-an-incremental_strategy) controls how dbt builds incremental models, and it can be set to one of two values:
+ - `insert_overwrite` (default)
+ - `merge` (Delta Lake only)
+
+### The `insert_overwrite` strategy
+
+Apache Spark does not natively support `delete`, `update`, or `merge` statements. As such, [incremental models](configuring-incremental-models) are implemented differently than usual in this plugin. To use incremental models, specify a `partition_by` clause in your model config. dbt will run an `insert overwrite` statement to dynamically overwrite the partitions included in your query. Be sure to re-select _all_ of the relevant data for a partition when using incremental models.
+
+<File name='spark_incremental.sql'>
+
+```sql
+{{ config(
+    materialized='incremental',
+    partition_by=['date_day'],
+    file_format='parquet'
+) }}
+
+/*
+  Every partition returned by this query will be overwritten
+  when this model runs
+*/
+
+select
+    date_day,
+    count(*) as users
+
+from {{ ref('events') }}
+where date_day::date >= '2019-01-01'
+group by 1
+```
+
+</File>
+
+### The `merge` strategy
+
+<Callout type="info" title="New in dbt-spark v0.15.3"></Callout>
+
+There are three prerequisites for the `merge` incremental strategy:
+- Delta file format
+- Databricks Runtime 5.1 and above
+- Specify a `unique_key`
+
+dbt will run an atomic `merge` statement which looks nearly identical to the default merge behavior on Snowflake and BigQuery.
+
+<File name='delta_incremental.sql'>
+
+```sql
+{{ config(
+    materialized='incremental',
+    file_format='delta',
+    unique_key='user_id',
+    incremental_strategy='merge'
+) }}
+
+select
+    user_id,
+    max(date_day) as last_seen
+
+from {{ ref('events') }}
+where date_day::date >= '2019-01-01'
+group by 1
+```
+
+</File>
+
+## Persisting model descriptions
+
+<Callout type="info" title="New in dbt-spark v0.15.3"></Callout>
+
+The `persist_docs` config can be used to persist the dbt `description` supplied for a model to the resulting Spark table or view. The `persist_docs` config is not yet supported for objects other than tables and views.
+
+The `persist_docs` config can be specified in the `dbt_project.yml` file, or in a specific model.
+
+<File name='dbt_project.yml'>
+
+```yaml
+
+models:
+  # enable docs persistence for all models
+  persist_docs:
+    relation: true
+```
+
+</File>
+
+or:
+
+<File name='models/my_model.sql'>
+
+```sql
+{{
+  config(persist_docs={"relation": true})
+}}
+
+select ...
+```
+
+</File>
+
+When the `persist_docs` option is configured appropriately, you'll be able to see your model descriptions
+in the `Comment` field of `describe [table] extended` or `show table extended in [database] like '*'`.
diff --git a/website/docs/docs/supported-databases.md b/website/docs/docs/supported-databases.md
@@ -16,7 +16,7 @@ These database plugins are supported by the core dbt maintainers.
 | BigQuery | [Profile Setup](profile-bigquery), [Configuration](bigquery-configs) | ✅Full Support |
 | Snowflake | [Profile Setup](profile-snowflake), [Configuration](snowflake-configs) | ✅ Full Support |
 | Presto | [Profile Setup](profile-presto) | Partial Support |
-| Spark | [Profile Setup](profile-spark) | Partial Support |
+| Spark | [Profile Setup](profile-spark), [Configuration](spark-configs) | Partial Support |
 
 ##  Community Supported dbt Plugins
 

diff --git a/website/docs/docs/supported-databases/profile-presto.md b/website/docs/docs/supported-databases/profile-presto.md
@@ -15,7 +15,9 @@ my-presto-db:
   outputs:
     dev:
       type: presto
-      method: none # One of {none | kerberos}
+      method: none  # optional, one of {none | ldap | kerberos}
+      user: [user]
+      password: [password]  # required if method is ldap or kerberos
       database: [database name]
       host: [hostname]
       port: [port number]