apache · seancxmao · Sep 18, 2018 · Sep 24, 2018 · Sep 25, 2018 · Sep 26, 2018
diff --git a/docs/sql-programming-guide.md b/docs/sql-programming-guide.md
@@ -1002,6 +1002,21 @@ Configuration of Parquet can be done using the `setConf` method on `SparkSession
     </p>
   </td>
 </tr>
+<tr>
+  <td><code>spark.sql.parquet.writeLegacyFormat</code></td>
+  <td>false</td>
+  <td>
+    This configuration indicates whether we should use legacy Parquet format adopted by Spark 1.4
+    and prior versions or the standard format defined in parquet-format specification to write
+    Parquet files. This is not only related to compatibility with old Spark ones, but also other
+    systems like Hive, Impala, Presto, etc. This is especially important for decimals. If this
+    configuration is not enabled, decimals will be written in int-based format in Spark 1.5 and
+    above, other systems that only support legacy decimal format (fixed length byte array) will not
+    be able to read what Spark has written. Note other systems may have added support for the
+    standard format in more recent versions, which will make this configuration unnecessary. Please
 // =================================== 
 // ArrayType and MapType (legacy mode) 
 // =================================== 
 // Spark 1.4.x and prior versions convert `ArrayType` with nullable elements into a 3-level 
 // `LIST` structure.  This behavior is somewhat a hybrid of parquet-hive and parquet-avro 
 // (1.6.0rc3): the 3-level structure is similar to parquet-hive while the 3rd level element 
 // field name "array" is borrowed from parquet-avro. 
 case ArrayType(elementType, nullable @ true) if writeLegacyParquetFormat => 
 // =================================== 
 // ArrayType and MapType (legacy mode) 
 // =================================== 
  
 // Spark 1.4.x and prior versions convert `ArrayType` with nullable elements into a 3-level 
 // `LIST` structure.  This behavior is somewhat a hybrid of parquet-hive and parquet-avro 
 // (1.6.0rc3): the 3-level structure is similar to parquet-hive while the 3rd level element 
 // field name "array" is borrowed from parquet-avro. 
 case ArrayType(elementType, nullable @ true) if writeLegacyParquetFormat => 
+    consult documentation of related systems for details.
+  </td>
+</tr>
 </table>
 
 ## ORC Files