Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged master branch from apache #1

Merged
merged 2,831 commits into from
May 15, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
2831 commits
Select commit Hold shift + click to select a range
767ea86
[SPARK-34225][CORE][FOLLOWUP] Replace Hadoop's Path with Utils.resolv…
sarutak Apr 15, 2021
71133e1
[SPARK-35070][SQL] TRANSFORM not support alias in inputs
AngersZhuuuu Apr 15, 2021
2cb962b
[MINOR][CORE] Correct the number of started fetch requests in log
Ngone51 Apr 15, 2021
9689c44
[SPARK-34995] Port/integrate Koalas remaining codes into PySpark
itholic Apr 15, 2021
637f593
Revert "[SPARK-34995] Port/integrate Koalas remaining codes into PySp…
HyukjinKwon Apr 15, 2021
ba92de0
[SPARK-34843][SQL][FOLLOWUP] Fix a test failure in OracleIntegrationS…
sarutak Apr 15, 2021
4aee19e
[SPARK-35032][PYTHON] Port Koalas Index unit tests into PySpark
xinrong-meng Apr 15, 2021
3f4c32b
[SPARK-35099][SQL] Convert ANSI interval literals to SQL string in AN…
MaxGekk Apr 16, 2021
345c380
[SPARK-35083][CORE] Support remote scheduler pool files
ulysses-you Apr 16, 2021
95db7e6
[SPARK-35104][SQL] Fix ugly indentation of multiple JSON records in a…
sarutak Apr 16, 2021
91bd384
[SPARK-34995] Port/integrate Koalas remaining codes into PySpark
itholic Apr 16, 2021
2e1e1f8
[MINOR][DOCS] Soften security warning and keep it in cluster manageme…
srowen Apr 17, 2021
94849af
[SPARK-34787][CORE] Option variable in Spark historyServer log should…
echohlne Apr 17, 2021
2bdb26b
[SPARK-35101][INFRA] Add GitHub status check in PR instead of a comment
HyukjinKwon Apr 18, 2021
7f6dee8
[MINOR][INFRA] Upgrade Jira client to 2.0.0
dongjoon-hyun Apr 18, 2021
03191e8
[SPARK-35116][SQL][TESTS] The generated data fits the precision of Da…
beliefer Apr 18, 2021
d04b467
[SPARK-35114][SQL][TESTS] Add checks for ANSI intervals to `LiteralEx…
MaxGekk Apr 18, 2021
12abfe7
[SPARK-34716][SQL] Support ANSI SQL intervals by the aggregate functi…
beliefer Apr 18, 2021
074f770
[SPARK-35115][SQL][TESTS] Check ANSI intervals in `MutableProjectionS…
MaxGekk Apr 18, 2021
978cd0b
[SPARK-35092][UI] the auto-generated rdd's name in the storage tab sh…
echohlne Apr 19, 2021
fd08c93
[SPARK-35109][SQL] Fix minor exception messages of HashedRelation and…
c21 Apr 19, 2021
c8d78a7
[SPARK-34581][SQL] Don't optimize out grouping expressions from aggre…
peter-toth Apr 19, 2021
7a06cdd
[SPARK-35122][SQL] Migrate CACHE/UNCACHE TABLE to use AnalysisOnlyCom…
imback82 Apr 19, 2021
a74f601
[SPARK-31937][SQL] Support processing ArrayType/MapType/StructType da…
AngersZhuuuu Apr 19, 2021
70b606f
[SPARK-35045][SQL][FOLLOW-UP] Add a configuration for CSV input buffe…
HyukjinKwon Apr 19, 2021
8dc455b
[SPARK-34837][SQL] Support ANSI SQL intervals by the aggregate functi…
beliefer Apr 19, 2021
1d1ed3e
[SPARK-35107][SQL] Parse unit-to-unit interval literals to ANSI inter…
MaxGekk Apr 19, 2021
7f34035
[SPARK-34715][SQL][TESTS] Add round trip tests for period <-> month a…
beliefer Apr 19, 2021
425dc58
[SPARK-35125][K8S] Upgrade K8s client to 5.3.0 to support K8s 1.20
dongjoon-hyun Apr 19, 2021
2d161cb
[SPARK-35102][SQL] Make spark.sql.hive.version read-only, not depreca…
yaooqinn Apr 19, 2021
d37d18d
[SPARK-35136] Remove initial null value of LiveStage.info
sander-goos Apr 19, 2021
0c2e9b9
[SPARK-35138][SQL] Remove Antlr4 workaround
pan3793 Apr 19, 2021
dc7d41e
[SPARK-35120][INFRA] Guide users to sync branch and enable GitHub Act…
HyukjinKwon Apr 19, 2021
00f06dd
[SPARK-35131][K8S] Support early driver service clean-up during app t…
dongjoon-hyun Apr 19, 2021
9a6d773
[SPARK-35103][SQL] Make TypeCoercion rules more efficient
sigmod Apr 19, 2021
e55ff83
[SPARK-35117][UI] Change progress bar back to highlight ratio of task…
Kimahriman Apr 20, 2021
bad4b6f
[SPARK-35080][SQL] Only allow a subset of correlated equality predica…
allisonwang-db Apr 20, 2021
f4926d1
[SPARK-35052][SQL] Use static bits for AttributeReference and Literal
sigmod Apr 20, 2021
670c365
[SPARK-35134][BUILD][TESTS] Manually exclude redundant netty jars in …
LuciferYang Apr 20, 2021
aa0d00d
[SPARK-35018][SQL][TESTS] Check transferring of year-month intervals …
MaxGekk Apr 20, 2021
b6bb24c
[SPARK-34974][SQL] Improve subquery decorrelation framework
allisonwang-db Apr 20, 2021
b219e37
[SPARK-35068][SQL] Add tests for ANSI intervals to HiveThriftBinarySe…
AngersZhuuuu Apr 20, 2021
9c956ab
[SPARK-33976][SQL][DOCS] Add a SQL doc page for a TRANSFORM clause
AngersZhuuuu Apr 20, 2021
1e64b4f
[SPARK-34877][CORE][YARN] Add the code change for adding the Spark AM…
SaurabhChawla100 Apr 20, 2021
3614448
[SPARK-34035][SQL] Refactor ScriptTransformation to remove input para…
AngersZhuuuu Apr 20, 2021
eb9a439
[SPARK-34338][SQL] Report metrics from Datasource v2 scan
viirya Apr 20, 2021
e08c40f
[SPARK-35145][SQL] CurrentOrigin should support nested invoking
cloud-fan Apr 20, 2021
83f753e
[SPARK-34472][YARN] Ship ivySettings file to driver in cluster mode
shardulm94 Apr 20, 2021
e8d6992
[SPARK-35153][SQL] Make textual representation of ANSI interval opera…
MaxGekk Apr 20, 2021
c7e18ad
[SPARK-35132][BUILD][CORE] Upgrade netty-all to 4.1.63.Final
LuciferYang Apr 20, 2021
81c3cc2
[SPARK-35044][SQL][FOLLOWUP][TEST-HADOOP2.7] Fix hadoop 2.7 test due …
yaooqinn Apr 21, 2021
d259f93
[SPARK-35113][SQL] Support ANSI intervals in the Hash expression
AngersZhuuuu Apr 21, 2021
97ec57e
[SPARK-35120][INFRA][FOLLOW-UP] Try catch an error to show the correc…
HyukjinKwon Apr 21, 2021
4f309ce
[SPARK-35096][SQL] SchemaPruning should adhere spark.sql.caseSensitiv…
sandeep-katta Apr 21, 2021
43ad939
[SPARK-35152][SQL] ANSI mode: IntegralDivide throws exception on over…
gengliangwang Apr 21, 2021
b6350f5
[SPARK-35142][PYTHON][ML] Fix incorrect return type for `rawPredictio…
harupy Apr 21, 2021
8e9e700
[SPARK-35171][R] Declare the markdown package as a dependency of the …
xuanyuanking Apr 21, 2021
355c399
[SPARK-35140][INFRA] Add error message guidelines to PR template
karenfeng Apr 21, 2021
81dbaed
[SPARK-34692][SQL] Support Not(Int) and Not(InSet) propagate null in …
ulysses-you Apr 21, 2021
e609395
[SPARK-34897][SQL] Support reconcile schemas based on index after nes…
wangyum Apr 21, 2021
6860efe
[SPARK-35178][BUILD] Use new Apache 'closer.lua' syntax to obtain Maven
srowen Apr 22, 2021
548e66c
[SPARK-34692][SQL][FOLLOWUP] Add INSET to ReplaceNullWithFalseInPredi…
viirya Apr 22, 2021
b17a0e6
[SPARK-34674][CORE][K8S] Close SparkContext after the Main method has…
kotlovs Apr 22, 2021
bb5459f
[SPARK-35177][SQL] Fix arithmetic overflow in parsing the minimal int…
AngersZhuuuu Apr 22, 2021
c0972de
[SPARK-35180][BUILD] Allow to build SparkR with SBT
sarutak Apr 22, 2021
7242d7f
[SPARK-35127][UI] When we switch between different stage-detail pages…
echohlne Apr 22, 2021
b22d54a
[SPARK-35026][SQL] Support nested CUBE/ROLLUP/GROUPING SETS in GROUPI…
AngersZhuuuu Apr 22, 2021
7f7a3d8
[SPARK-35183][SQL] Use transformAllExpressions in CombineConcats
sigmod Apr 22, 2021
6c587d2
[SPARK-35110][SQL] Handle ANSI intervals in WindowExecBase
beliefer Apr 22, 2021
04e2305
[SPARK-35187][SQL] Fix failure on the minimal interval literal
AngersZhuuuu Apr 22, 2021
4d2b559
[SPARK-34999][PYTHON] Consolidate PySpark testing utils
xinrong-meng Apr 22, 2021
6ab0048
[SPARK-35182][K8S] Support driver-owned on-demand PVC
dongjoon-hyun Apr 23, 2021
4fcbf59
[SPARK-35040][PYTHON] Remove Spark-version related codes from test codes
xinrong-meng Apr 23, 2021
47f8687
[SPARK-35075][SQL] Add traversal pruning for subquery related rules
sigmod Apr 23, 2021
86238d0
[SPARK-35195][SQL][TEST] Move InMemoryTable etc to org.apache.spark.s…
sunchao Apr 23, 2021
cab205e
[SPARK-35141][SQL] Support two level of hash maps for final hash aggr…
c21 Apr 23, 2021
7582dc8
[SPARK-35143][SQL][SHELL] Add default log level config for spark-sql
hddong Apr 23, 2021
20d68dc
[SPARK-35159][SQL][DOCS] Extract hive format doc
AngersZhuuuu Apr 23, 2021
fdccd88
Revert "[SPARK-34581][SQL] Don't optimize out grouping expressions fr…
cloud-fan Apr 23, 2021
9af338c
[SPARK-35078][SQL] Add tree traversal pruning in expression rules
sigmod Apr 23, 2021
e503b9c
[SPARK-35201][SQL] Format empty grouping set exception in CUBE/ROLLUP
AngersZhuuuu Apr 23, 2021
a9345a0
[SPARK-35204][SQL] CatalystTypeConverters of date/timestamp should ac…
cloud-fan Apr 23, 2021
b2a2b5d
[SPARK-34297][SQL][SS] Add metrics for data loss and offset out range…
viirya Apr 23, 2021
44c1387
[SPARK-35210][BUILD] Upgrade Jetty to 9.4.40 to fix ERR_CONNECTION_RE…
sarutak Apr 24, 2021
166cc62
[SPARK-34990][SQL][TESTS] Add ParquetEncryptionSuite
andersonm-ibm Apr 24, 2021
bcac733
[SPARK-35200][CORE] Avoid to recompute the pending speculative tasks …
weixiuli Apr 24, 2021
1f150b9
[SPARK-35024][ML] Refactor LinearSVC - support virtual centering
zhengruifeng Apr 25, 2021
b108e7f
[SPARK-33913][SS] Upgrade Kafka to 2.8.0
dongjoon-hyun Apr 25, 2021
5b1353f
[SPARK-35168][SQL] mapred.reduce.tasks should be shuffle.partitions n…
yaooqinn Apr 25, 2021
6f782ef
[SPARK-35220][SQL] DayTimeIntervalType/YearMonthIntervalType show dif…
AngersZhuuuu Apr 26, 2021
2d6467d
[SPARK-35087][UI] Some columns in table Aggregated Metrics by Executo…
echohlne Apr 26, 2021
38ef477
[SPARK-32921][SHUFFLE] MapOutputTracker extensions to support push-ba…
venkata91 Apr 26, 2021
d572a85
[SPARK-35224][SQL][TESTS] Fix buffer overflow in `MutableProjectionSu…
MaxGekk Apr 26, 2021
74afc68
[SPARK-35213][SQL] Keep the correct ordering of nested structs in cha…
Kimahriman Apr 26, 2021
c0a3c0c
[SPARK-35088][SQL] Accept ANSI intervals by the Sequence expression
beliefer Apr 26, 2021
84026d7
[SPARK-35223] Add IssueNavigationLink
pan3793 Apr 26, 2021
bdac191
[SPARK-35230][SQL] Move custom metric classes to proper package
viirya Apr 26, 2021
1db031f
[SPARK-35220][DOCS][FOLLOWUP] DayTimeIntervalType/YearMonthIntervalTy…
AngersZhuuuu Apr 26, 2021
c59988a
[SPARK-34638][SQL] Single field nested column prune on generator output
viirya Apr 26, 2021
f009046
[SPARK-33985][SQL][TESTS] Add query test of combine usage of TRANSFOR…
AngersZhuuuu Apr 26, 2021
1b609c7
[SPARK-35060][SQL] Group exception messages in sql/types
beliefer Apr 26, 2021
0df3b50
[SPARK-28247][SS][TEST] Fix flaky test "query without test harness" o…
zsxwing Apr 26, 2021
f738fe0
[SPARK-35227][BUILD] Update the resolver for spark-packages in SparkS…
bozhang2820 Apr 27, 2021
7779fce
[SPARK-35225][SQL] EXPLAIN command should handle empty output of anal…
imback82 Apr 27, 2021
7f51106
[SPARK-26164][SQL] Allow concurrent writers for writing dynamic parti…
c21 Apr 27, 2021
eb08b90
[SPARK-35139][SQL] Support ANSI intervals as Arrow Column vectors
Peng-Lei Apr 27, 2021
c4ad86f
[SPARK-35235][SQL][TEST] Add row-based hash map into aggregate benchmark
c21 Apr 27, 2021
2d2f467
[SPARK-35169][SQL] Fix wrong result of min ANSI interval division by -1
AngersZhuuuu Apr 27, 2021
55dea2d
[SPARK-34837][SQL][FOLLOWUP] Fix division by zero in the avg function…
beliefer Apr 27, 2021
4ff9f1f
[SPARK-35239][SQL] Coalesce shuffle partition should handle empty inp…
ulysses-you Apr 27, 2021
16d223e
[SPARK-35091][SPARK-35090][SQL] Support extract from ANSI Intervals
yaooqinn Apr 27, 2021
592230e
[MINOR][DOCS][ML] Explicit return type of array_to_vector utility fun…
jlafaye Apr 27, 2021
26a8d2f
[SPARK-35238][DOC] Add JindoFS SDK in cloud integration documents
adrian-wang Apr 27, 2021
5b77ebb
[SPARK-35150][ML] Accelerate fallback BLAS with dev.ludovic.netlib
luhenry Apr 27, 2021
0769049
[SPARK-34979][PYTHON][DOC] Add PyArrow installation note for PySpark …
Yikun Apr 28, 2021
abb1f0c
[SPARK-35236][SQL] Support archive files as resources for CREATE FUNC…
sarutak Apr 28, 2021
10c2b68
[SPARK-35244][SQL] Invoke should throw the original exception
cloud-fan Apr 28, 2021
253a1ae
[SPARK-35246][SS] Don't allow streaming-batch intersects
jose-torres Apr 28, 2021
046c8c3
[SPARK-34878][SQL][TESTS] Check actual sizes of year-month and day-ti…
Peng-Lei Apr 28, 2021
56bb815
[SPARK-35085][SQL] Get columns operation should handle ANSI interval …
beliefer Apr 28, 2021
26a5e33
[SPARK-33976][SQL][DOCS][FOLLOWUP] Fix syntax error in select doc page
AngersZhuuuu Apr 28, 2021
8b62c29
[SPARK-35214][SQL] OptimizeSkewedJoin support ShuffledHashJoinExec
ulysses-you Apr 28, 2021
0bcf348
[SPARK-34781][SQL][FOLLOWUP] Adjust the order of AQE optimizer rules
ulysses-you Apr 28, 2021
86d3bb5
[SPARK-34981][SQL] Implement V2 function resolution and evaluation
sunchao Apr 28, 2021
403e479
[SPARK-35244][SQL][FOLLOWUP] Add null check for the exception cause
cloud-fan Apr 29, 2021
74b9326
[SPARK-35135][CORE] Turn the `WritablePartitionedIterator` from a tra…
LuciferYang Apr 29, 2021
7713565
[SPARK-34786][SQL][FOLLOWUP] Explicitly declare DecimalType(20, 0) fo…
yaooqinn Apr 29, 2021
529b875
[SPARK-35226][SQL] Support refreshKrb5Config option in JDBC datasources
sarutak Apr 29, 2021
132cbf0
[SPARK-35105][SQL] Support multiple paths for ADD FILE/JAR/ARCHIVE co…
sarutak Apr 29, 2021
068b6c8
[SPARK-35234][CORE] Reserve the format of stage failureMessage
Ngone51 Apr 29, 2021
7b78e34
[SPARK-35269][BUILD] Upgrade commons-lang3 to 3.12.0
LuciferYang Apr 29, 2021
4e3daa5
[SPARK-35254][BUILD] Upgrade SBT to 1.5.1
Apr 29, 2021
738cf7f
[SPARK-35009][CORE] Avoid creating multiple python worker monitor thr…
attilapiros Apr 29, 2021
8a5af37
[SPARK-35268][BUILD] Upgrade GenJavadoc to 0.17
sarutak Apr 29, 2021
e8bf8fe
[SPARK-35047][SQL] Allow Json datasources to write non-ascii characte…
sarutak Apr 29, 2021
77e9152
[SPARK-35255][BUILD] Automated formatting for Scala Code for Blank Lines
Apr 30, 2021
ac8813e
[SPARK-35277][BUILD] Upgrade snappy to 1.1.8.4
williamhyun Apr 30, 2021
11ea255
[SPARK-35111][SQL] Support Cast string to year-month interval
AngersZhuuuu Apr 30, 2021
39889df
[SPARK-35264][SQL] Support AQE side broadcastJoin threshold
ulysses-you Apr 30, 2021
4e8701a
[SPARK-35280][K8S] Promote KubernetesUtils to DeveloperApi
dongjoon-hyun Apr 30, 2021
72e238a
[SPARK-35273][SQL] CombineFilters support non-deterministic expressions
wangyum May 1, 2021
6ce1b16
[SPARK-35278][SQL] Invoke should find the method with correct number …
viirya May 1, 2021
cfc0495
[SPARK-34581][SQL] Don't optimize out grouping expressions from aggre…
peter-toth May 2, 2021
caa46ce
[SPARK-35112][SQL] Support Cast string to day-second interval
AngersZhuuuu May 2, 2021
cd689c9
[SPARK-35192][SQL][TESTS] Port minimal TPC-DS datagen code from datab…
maropu May 3, 2021
335f00b
[SPARK-35285][SQL] Parse ANSI interval types in SQL schema
MaxGekk May 3, 2021
2a8d7ed
[SPARK-35281][SQL] StaticInvoke should not apply boxing if return typ…
sunchao May 3, 2021
44b7931
[SPARK-35176][PYTHON] Standardize input validation error type
Yikun May 3, 2021
be6ecb6
[SPARK-35266][TESTS] Fix error in BenchmarkBase.scala that occurs whe…
May 3, 2021
54e0aa1
[MINOR][SS][DOCS] Fix a typo in the documentation of GroupState
Dobiasd May 3, 2021
8aaa9e8
[SPARK-35250][SQL][DOCS] Fix duplicated STOP_AT_DELIMITER to SKIP_VAL…
HyukjinKwon May 3, 2021
176218b
[SPARK-35292][PYTHON] Delete redundant parameter in mypy configuration
garawalid May 4, 2021
120c389
[SPARK-34887][PYTHON] Port Koalas dependencies into PySpark
xinrong-meng May 4, 2021
5ecb112
[SPARK-35300][PYTHON][DOCS] Standardize module names in install.rst
xinrong-meng May 4, 2021
a2927cb
[SPARK-35302][INFRA] Benchmark workflow should create new files for n…
HyukjinKwon May 4, 2021
9b387a1
[SPARK-35308][TESTS] Fix bug in SPARK-35266 that creates benchmark fi…
May 4, 2021
7fd3f8f
[SPARK-35294][SQL] Add tree traversal pruning in rules with dedicated…
sigmod May 4, 2021
f550e03
[SPARK-34794][SQL] Fix lambda variable name issues in nested DataFram…
May 5, 2021
bbdbe0f
[SPARK-34854][SQL][SS] Expose source metrics via progress report and …
yijiacui-db May 5, 2021
4fe4b65
[SPARK-35315][TESTS] Keep benchmark result consistent between spark-s…
sunchao May 5, 2021
7970318
[SPARK-35155][SQL] Add rule id pruning to Analyzer rules
sigmod May 6, 2021
0126924
[SPARK-35323][BUILD] Remove unused libraries from LICENSE-binary
dongjoon-hyun May 6, 2021
a0c76a8
[SPARK-35319][K8S][BUILD] Upgrade K8s client to 5.3.1
dongjoon-hyun May 6, 2021
19661f6
[SPARK-35325][SQL][TESTS] Add nested column ORC encryption test case
dongjoon-hyun May 6, 2021
5c67d0c
[SPARK-35293][SQL][TESTS] Use the newer dsdgen for TPCDSQueryTestSuite
maropu May 6, 2021
3f5a209
[SPARK-35318][SQL] Hide internal view properties for describe table cmd
linhongliu-db May 6, 2021
c6d3f37
[SPARK-35240][SS] Use CheckpointFileManager for checkpoint file manip…
viirya May 6, 2021
6cd5cf5
[SPARK-35215][SQL] Update custom metric per certain rows and at the e…
viirya May 6, 2021
dfb3343
[SPARK-34526][SS] Ignore the error when checking the path in FileStre…
xuanyuanking May 6, 2021
bb93547
[SPARK-35326][BUILD] Upgrade Jersey to 2.34
sarutak May 6, 2021
482b43d
[SPARK-35326][BUILD][FOLLOWUP] Update dependency manifest files
dongjoon-hyun May 6, 2021
e834ef7
[SPARK-35293][SQL][TESTS][FOLLOWUP] Update the hash key to refresh TP…
maropu May 6, 2021
94bbca3
[SPARK-35306][MLLIB][TESTS] Add benchmark results for BLASBenchmark c…
May 7, 2021
42f59ca
[SPARK-35133][SQL] Explain codegen works with AQE
c21 May 7, 2021
33c1034
[SPARK-34701][SQL][FOLLOW-UP] Children/innerChildren should be mutual…
imback82 May 7, 2021
e83910f
[SPARK-26164][SQL][FOLLOWUP] WriteTaskStatsTracker should know which …
cloud-fan May 7, 2021
cf2c4ba
[SPARK-35020][SQL] Group exception messages in catalyst/util
beliefer May 7, 2021
9aa18df
[SPARK-35333][SQL] Skip object null check in Invoke if possible
cloud-fan May 7, 2021
72d3266
[SPARK-35144][SQL] Migrate to transformWithPruning for object rules
sigmod May 7, 2021
d3b92ee
[SPARK-35021][SQL] Group exception messages in connector/catalog
beliefer May 7, 2021
2634dba
[SPARK-35175][BUILD] Add linter for JavaScript source files
sarutak May 7, 2021
6f0ef93
[SPARK-35297][CORE][DOC][MINOR] Modify the comment about the executor
May 7, 2021
33fbf56
[SPARK-35288][SQL] StaticInvoke should find the method without exact …
viirya May 7, 2021
b4ec9e2
[SPARK-35321][SQL] Don't register Hive permanent functions when creat…
sunchao May 7, 2021
f47e0f8
[SPARK-35261][SQL] Support static magic method for stateless Java Sca…
sunchao May 8, 2021
323a6e8
[SPARK-35232][SQL] Nested column pruning should retain column metadata
sunchao May 8, 2021
b025780
[SPARK-35331][SQL] Support resolving missing attrs for distribute/clu…
yaooqinn May 8, 2021
06c4009
[SPARK-35327][SQL][TESTS] Filters out the TPC-DS queries that can cau…
maropu May 8, 2021
e31bef1
Revert "[SPARK-35321][SQL] Don't register Hive permanent functions wh…
dongjoon-hyun May 8, 2021
5b65d8a
[SPARK-35347][SQL] Use MethodUtils for looking up methods in Invoke a…
viirya May 8, 2021
620f072
[SPARK-35231][SQL] logical.Range override maxRowsPerPartition
zhengruifeng May 9, 2021
38eb5a6
[SPARK-35354][SQL] Replace BaseJoinExec with ShuffledJoin in Coalesce…
c21 May 10, 2021
2c8ced9
[SPARK-35111][SPARK-35112][SQL][FOLLOWUP] Rename ANSI interval patter…
AngersZhuuuu May 10, 2021
245dce1
[SPARK-35261][SQL][TESTS][FOLLOW-UP] Change failOnError to false for …
sunchao May 10, 2021
20d3224
[SPARK-35358][BUILD] Increase maximum Java heap used for release buil…
viirya May 10, 2021
d808956
[MINOR][INFRA] Add python/.idea into git ignore
HyukjinKwon May 10, 2021
7182f8c
[SPARK-35360][SQL] RepairTableCommand respects `spark.sql.addPartitio…
AngersZhuuuu May 10, 2021
d2a535f
[SPARK-34246][FOLLOWUP] Change the definition of `findTightestCommonT…
gengliangwang May 10, 2021
8b94eff
[SPARK-34736][K8S][TESTS] Kubernetes and Minikube version upgrade for…
attilapiros May 10, 2021
44bd0a8
[SPARK-35088][SQL][FOLLOWUP] Improve the error message for Sequence e…
beliefer May 11, 2021
c4ca232
[SPARK-35363][SQL] Refactor sort merge join code-gen be agnostic to j…
c21 May 11, 2021
7c9a9ec
[SPARK-35146][SQL] Migrate to transformWithPruning or resolveWithPrun…
sigmod May 11, 2021
2b6640a
[SPARK-35229][WEBUI] Limit the maximum number of items on the timelin…
sarutak May 11, 2021
b59d5ab
[SPARK-35372][BUILD] Increase stack size for Scala compilation in Mav…
HyukjinKwon May 11, 2021
af0d99c
[SPARK-35375][INFRA] Use Jinja2 < 3.0.0 for Python linter dependency …
sarutak May 12, 2021
78221bd
[SPARK-35361][SQL] Improve performance for ApplyFunctionExpression
sunchao May 12, 2021
a189be8
[MINOR][DOCS] Avoid some python docs where first sentence has "e.g." …
srowen May 12, 2021
7e3446a
[SPARK-35377][INFRA] Add JS linter to GA
sarutak May 12, 2021
ecb48cc
[SPARK-35381][R] Fix lambda variable name issues in nested higher ord…
HyukjinKwon May 12, 2021
82c520a
[SPARK-35243][SQL] Support columnar execution on ANSI interval types
Peng-Lei May 12, 2021
d92018e
[SPARK-35298][SQL] Migrate to transformWithPruning for rules in Optim…
sigmod May 12, 2021
ed05954
[SPARK-29145][SQL][FOLLOWUP] Clean up code about support sub-queries …
AngersZhuuuu May 12, 2021
402375b
[SPARK-35357][GRAPHX] Allow to turn off the normalization applied by …
ebonnal May 12, 2021
101b0cc
[SPARK-35253][SQL][BUILD] Bump up the janino version to v3.1.4
maropu May 12, 2021
b52d47a
[SPARK-35295][ML] Replace fully com.github.fommil.netlib by dev.ludov…
luhenry May 12, 2021
7bcaded
[SPARK-35349][SQL] Add code-gen for left/right outer sort merge join
c21 May 12, 2021
f156a95
[SPARK-35347][SQL][FOLLOWUP] Throw exception with an explicit excepti…
viirya May 12, 2021
dac6f17
[SPARK-35387][INFRA] Increase the JVM stack size for Java 11 build test
gengliangwang May 12, 2021
77b7fe1
[SPARK-35383][CORE] Improve s3a magic committer support by inferring …
dongjoon-hyun May 12, 2021
bc95c3a
[SPARK-35361][SQL][FOLLOWUP] Switch to use while loop
sunchao May 12, 2021
b3c916e
[SPARK-35013][CORE] Don't allow to set spark.driver.cores=0
shahidki31 May 12, 2021
ae0579a
[SPARK-35369][DOC] Document ExecutorAllocationManager metrics
LucaCanali May 12, 2021
3241aeb
[SPARK-35385][SQL][TESTS] Skip duplicate queries in the TPCDS-related…
maropu May 13, 2021
c0b52da
[SPARK-35388][INFRA] Allow the PR source branch to include slashes
ueshin May 13, 2021
0ab9bd7
[SPARK-35384][SQL] Improve performance for InvokeLike.invoke
sunchao May 13, 2021
17b59a9
[SPARK-35382][PYTHON] Fix lambda variable name issues in nested DataF…
ueshin May 13, 2021
dd54649
[SPARK-35394][K8S][BUILD] Move kubernetes-client.version to root pom …
dongjoon-hyun May 13, 2021
5181543
[SPARK-35380][SQL] Loading SparkSessionExtensions from ServiceLoader
yaooqinn May 13, 2021
c1e995a
[SPARK-35350][SQL] Add code-gen for left semi sort merge join
c21 May 13, 2021
d1b8bd7
[SPARK-34720][SQL] MERGE ... UPDATE/INSERT * should do by-name resolu…
cloud-fan May 13, 2021
b6d57b6
[SPARK-34637][SQL] Support DPP + AQE when the broadcast exchange can …
JkSelf May 13, 2021
f7704ec
[SPARK-35392][ML][PYTHON] Fix flaky tests in ml/clustering.py and ml/…
zhengruifeng May 13, 2021
6c5fcac
[SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn
srowen May 13, 2021
02c99f1
[SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE
gengliangwang May 13, 2021
6f63057
[SPARK-35332][SQL] Make cache plan disable configs configurable
ulysses-you May 13, 2021
c2e15cc
[SPARK-35062][SQL] Group exception messages in sql/streaming
beliefer May 13, 2021
6aa2594
[SPARK-35366][SQL] Avoid using deprecated `buildForBatch` and `buildF…
linhongliu-db May 13, 2021
7d371d2
[SPARK-35393][PYTHON][INFRA][TESTS] Recover pip packaging test in Git…
HyukjinKwon May 13, 2021
6a949d1
[SPARK-35397][SQL] Replace sys.err usage with explicit exception type
viirya May 13, 2021
160b3be
[SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI
holdenk May 13, 2021
8fa739f
[SPARK-35329][SQL] Split generated switch code into pieces in ExpandExec
maropu May 14, 2021
b6a0a7e
[SPARK-35311][SS][UI][DOCS] Structured Streaming Web UI state informa…
gaborgsomogyi May 14, 2021
f7af9ab
[SPARK-34764][UI][FOLLOW-UP] Fix indentation and missing arguments fo…
HyukjinKwon May 14, 2021
9ea55fe
[SPARK-35207][SQL] Normalize hash function behavior with negative zer…
planga82 May 14, 2021
d424771
[MINOR][DOC] ADD toc for monitoring page
yaooqinn May 14, 2021
6218bc5
[SPARK-35332][SQL][FOLLOWUP] Refine wrong comment
ulysses-you May 14, 2021
68239d1
[SPARK-35404][CORE] Name the timers in TaskSchedulerImpl
yaooqinn May 14, 2021
94bd480
[SPARK-35206][TESTS][SQL] Extract common used get project path into a…
Ngone51 May 14, 2021
d2fbf0d
[SPARK-35405][DOC] Submitting Applications documentation has outdated…
o-shevchenko May 14, 2021
a8032e7
[SPARK-35384][SQL][FOLLOWUP] Move `HashMap.get` out of `InvokeLike.in…
sunchao May 14, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
2 changes: 1 addition & 1 deletion .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.

# https://cwiki.apache.org/confluence/display/INFRA/.asf.yaml+features+for+git+repositories
# https://cwiki.apache.org/confluence/display/INFRA/git+-+.asf.yaml+features
---
github:
description: "Apache Spark - A unified analytics engine for large-scale data processing"
Expand Down
2 changes: 2 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,8 @@ Thanks for sending a pull request! Here are some tips for you:
6. If possible, provide a concise example to reproduce the issue for a faster review.
7. If you want to add a new configuration, please read the guideline first for naming configurations in
'core/src/main/scala/org/apache/spark/internal/config/ConfigEntry.scala'.
8. If you want to add or modify an error message, please read the guideline first:
https://spark.apache.org/error-message-guidelines.html
-->

### What changes were proposed in this pull request?
Expand Down
133 changes: 0 additions & 133 deletions .github/autolabeler.yml

This file was deleted.

152 changes: 152 additions & 0 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,152 @@
#
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
#

#
# Pull Request Labeler Github Action Configuration: https://github.com/marketplace/actions/labeler
#
# Note that we currently cannot use the negatioon operator (i.e. `!`) for miniglob matches as they
# would match any file that doesn't touch them. What's needed is the concept of `any `, which takes a
# list of constraints / globs and then matches all of the constraints for either `any` of the files or
# `all` of the files in the change set.
#
# However, `any`/`all` are not supported in a released version and testing off of the `main` branch
# resulted in some other errors when testing.
#
# An issue has been opened upstream requesting that a release be cut that has support for all/any:
# - https://github.com/actions/labeler/issues/111
#
# While we wait for this issue to be handled upstream, we can remove
# the negated / `!` matches for now and at least have labels again.
#
INFRA:
- ".github/**/*"
- "appveyor.yml"
- "tools/**/*"
- "dev/create-release/**/*"
- ".asf.yaml"
- ".gitattributes"
- ".gitignore"
- "dev/github_jira_sync.py"
- "dev/merge_spark_pr.py"
- "dev/run-tests-jenkins*"
BUILD:
# Can be supported when a stable release with correct all/any is released
#- any: ['dev/**/*', '!dev/github_jira_sync.py', '!dev/merge_spark_pr.py', '!dev/.rat-excludes']
- "dev/**/*"
- "build/**/*"
- "project/**/*"
- "assembly/**/*"
- "**/*pom.xml"
- "bin/docker-image-tool.sh"
- "bin/find-spark-home*"
- "scalastyle-config.xml"
# These can be added in the above `any` clause (and the /dev/**/* glob removed) when
# `any`/`all` support is released
# - "!dev/github_jira_sync.py"
# - "!dev/merge_spark_pr.py"
# - "!dev/run-tests-jenkins*"
# - "!dev/.rat-excludes"
DOCS:
- "docs/**/*"
- "**/README.md"
- "**/CONTRIBUTING.md"
EXAMPLES:
- "examples/**/*"
- "bin/run-example*"
# CORE needs to be updated when all/any are released upstream.
CORE:
# - any: ["core/**/*", "!**/*UI.scala", "!**/ui/**/*"] # If any file matches all of the globs defined in the list started by `any`, label is applied.
- "core/**/*"
- "common/kvstore/**/*"
- "common/network-common/**/*"
- "common/network-shuffle/**/*"
- "python/pyspark/**/*.py"
- "python/pyspark/tests/**/*.py"
SPARK SUBMIT:
- "bin/spark-submit*"
SPARK SHELL:
- "repl/**/*"
- "bin/spark-shell*"
SQL:
#- any: ["**/sql/**/*", "!python/pyspark/sql/avro/**/*", "!python/pyspark/sql/streaming.py", "!python/pyspark/sql/tests/test_streaming.py"]
- "**/sql/**/*"
- "common/unsafe/**/*"
#- "!python/pyspark/sql/avro/**/*"
#- "!python/pyspark/sql/streaming.py"
#- "!python/pyspark/sql/tests/test_streaming.py"
- "bin/spark-sql*"
- "bin/beeline*"
- "sbin/*thriftserver*.sh"
- "**/*SQL*.R"
- "**/DataFrame.R"
- "**/*WindowSpec.R"
- "**/*catalog.R"
- "**/*column.R"
- "**/*functions.R"
- "**/*group.R"
- "**/*schema.R"
- "**/*types.R"
AVRO:
- "external/avro/**/*"
- "python/pyspark/sql/avro/**/*"
DSTREAM:
- "streaming/**/*"
- "data/streaming/**/*"
- "external/kinesis*"
- "external/kafka*"
- "python/pyspark/streaming/**/*"
GRAPHX:
- "graphx/**/*"
- "data/graphx/**/*"
ML:
- "**/ml/**/*"
- "**/*mllib_*.R"
MLLIB:
- "**/spark/mllib/**/*"
- "mllib-local/**/*"
- "python/pyspark/mllib/**/*"
STRUCTURED STREAMING:
- "**/sql/**/streaming/**/*"
- "external/kafka-0-10-sql/**/*"
- "python/pyspark/sql/streaming.py"
- "python/pyspark/sql/tests/test_streaming.py"
- "**/*streaming.R"
PYTHON:
- "bin/pyspark*"
- "**/python/**/*"
R:
- "**/r/**/*"
- "**/R/**/*"
- "bin/sparkR*"
YARN:
- "resource-managers/yarn/**/*"
MESOS:
- "resource-managers/mesos/**/*"
- "sbin/*mesos*.sh"
KUBERNETES:
- "resource-managers/kubernetes/**/*"
WINDOWS:
- "**/*.cmd"
- "R/pkg/tests/fulltests/test_Windows.R"
WEB UI:
- "**/ui/**/*"
- "**/*UI.scala"
DEPLOY:
- "sbin/**/*"

Loading