-
Notifications
You must be signed in to change notification settings - Fork 28.7k
Insights: apache/spark
Overview
-
0 Active issues
-
- 0 Merged pull requests
- 32 Open pull requests
- 0 Closed issues
- 0 New issues
Could not load contribution data
Please try again later
32 Pull requests opened by 24 people
-
[SPARK-50686][SQL] Hash to sort aggregation fallback - memory usage optimization
#51290 opened
Jun 26, 2025 -
[WIP][PYTHON] Arrow UDF for aggregation
#51292 opened
Jun 26, 2025 -
Fix-AQE-OOM
#51295 opened
Jun 26, 2025 -
[SPARK-52580][PS] Avoid CAST_INVALID_INPUT of `replace` in ANSI mode
#51297 opened
Jun 26, 2025 -
[SPARK-52407][SQL] Add support for Theta Sketch
#51298 opened
Jun 27, 2025 -
[SPARK-52592][PS] Prevent error when creating ps.Series from ps.Series
#51300 opened
Jun 27, 2025 -
[SPARK-52598][DOCS] Reorganize Spark Connect programming guide
#51305 opened
Jun 27, 2025 -
[SPARK-52588][SQL] Approx_top_k: accumulate, combine, estimate
#51308 opened
Jun 27, 2025 -
[SPARK-52593][PS] Avoid CAST_INVALID_INPUT of `Series.dot` and `DataFrame.dot` in ANSI mode
#51310 opened
Jun 27, 2025 -
[SPARK-52601][SQL] Support primitive types in TransformingEncoder
#51313 opened
Jun 28, 2025 -
[SPARK-46912][CORE] Using correct environment variables on workers of StandAlone cluster
#51314 opened
Jun 28, 2025 -
[MINOR][DOCS] Updated the docstring of DataStreamWriter.foreach() method
#51316 opened
Jun 29, 2025 -
[SPARK-52614][SQL] Support RowEncoder inside Product Encoder
#51319 opened
Jun 30, 2025 -
[SPARK-52615][CORE] Replace File.mkdirs with Utils.createDirectory
#51322 opened
Jun 30, 2025 -
[WIP][SPARK-52622][PS] Avoid CAST_INVALID_INPUT of `DataFrame.melt` in ANSI mode
#51326 opened
Jul 1, 2025 -
[SPARK-52632][SQL] Pretty display V2 write plan nodes
#51332 opened
Jul 1, 2025 -
[SPARK-52634][SQL][DOCS] Update the ANSI compliance page regarding the TIME type
#51333 opened
Jul 1, 2025 -
[WIP] Fix inconsistencies and refactor primitive types in parser
#51335 opened
Jul 1, 2025 -
[SPARK-52635][BUILD][3.5] Upgrade ORC to 1.9.7
#51336 opened
Jul 1, 2025 -
[WIP][SQL][TESTS] Disable stable column aliases in tests if assumed
#51337 opened
Jul 1, 2025 -
[SPARK-52649][SQL] Trim aliases before matching Sort/Having/Filter expressions in `buildAggExprList`
#51339 opened
Jul 1, 2025 -
[SS][SPARK-52637] Fix version ID mismatch issue for RocksDB compaction leading to incorrect file mapping
#51340 opened
Jul 1, 2025 -
[SPARK-52638][SQL] Allow preserving Hive-style column order to be configurable
#51342 opened
Jul 1, 2025 -
[SPARK-52640][SDP] Propagate Python Source Code Location
#51344 opened
Jul 1, 2025 -
[SPARK-52656][SQL] Fix current_time()
#51351 opened
Jul 2, 2025 -
[SPARK-52409][SDP] Only use PipelineRunEventBuffer in tests
#51352 opened
Jul 2, 2025 -
[SPARK-52663][SDP] Introduce name field to pipeline spec
#51353 opened
Jul 3, 2025 -
[SPARK-52660][SQL] Add time type to `CodeGenerator#javaClass`
#51354 opened
Jul 3, 2025 -
[SPARK-52665][BUILD] Fix make-distribution.sh [: missing `]'
#51355 opened
Jul 3, 2025 -
[SPARK-52666][SQL] Map User Defined Type to correct MutableValue in SpecificInternalRow
#51356 opened
Jul 3, 2025
30 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[SPARK-52575][SQL] Introduce contextIndependentFoldable attribute for Expressions
#51282 commented on
Jul 3, 2025 • 27 new comments -
[SPARK-52187][SQL] Introduce Join pushdown for DSv2
#50921 commented on
Jul 1, 2025 • 19 new comments -
[SDP] [SPARK-52576] Drop/recreate on full refresh and MV update
#51280 commented on
Jul 2, 2025 • 9 new comments -
[SPARK-52444][SQL][CONNECT] Add support for Variant/Char/Varchar Literal
#51215 commented on
Jul 1, 2025 • 4 new comments -
[SPARK-52082][PYTHON][DOCS] Improve ExtractPythonUDF docs
#50867 commented on
Jun 27, 2025 • 4 new comments -
[SPARK-52582][SQL] Improve the memory usage of XML parser
#51287 commented on
Jul 3, 2025 • 2 new comments -
[SPARK-49386][SPARK-27734][CORE][SQL] Add memory based thresholds for shuffle spill
#47856 commented on
Jul 3, 2025 • 1 new comment -
[SPARK-51069][SQL] Add big-endian support to UnsafeRowUtils.validateStructuralIntegrityWithReasonImpl
#49773 commented on
Jun 27, 2025 • 1 new comment -
[SPARK-42841][SQL]Assign a name to the error class _LEGACY_ERROR_TEMP_2003
#51111 commented on
Jun 27, 2025 • 1 new comment -
[SPARK-51955] Adding release() to ReadStateStore interface and reusing ReadStore for Streaming Aggregations
#50742 commented on
Jun 27, 2025 • 1 new comment -
[DRAFT][PYTHON] Improve Python UDF Arrow Serializer Performance
#51225 commented on
Jul 1, 2025 • 0 new comments -
[SPARK-52535][SQL] Improve code readability of rule ApplyColumnarRulesAndInsertTransitions
#51227 commented on
Jul 2, 2025 • 0 new comments -
[WIP][SPARK-51224][BUILD] Test Maven 4
#51230 commented on
Jul 3, 2025 • 0 new comments -
[SPARK-51035][BUILD] Upgrade Janino to 3.1.12
#51239 commented on
Jul 1, 2025 • 0 new comments -
[SPARK-52560][BUILD] Bump ap-loader 4.0(v10) to support for async-profiler 4.0
#51257 commented on
Jun 30, 2025 • 0 new comments -
[SPARK-52561][PYTHON][INFRA] Upgrade the minimum version of Python to 3.10
#51259 commented on
Jul 3, 2025 • 0 new comments -
SPARK-52564 configuration changes not require deleting the checkpoint
#51264 commented on
Jun 27, 2025 • 0 new comments -
[SPARK-51885][SQL] Change AnalysisContext.outerPlan from Option[LogicalPlan] to Seq[LogicalPlan]
#51274 commented on
Jun 27, 2025 • 0 new comments -
[CORE] Let LocalSparkContext clear active context in beforeAll
#51284 commented on
Jun 30, 2025 • 0 new comments -
[SPARK-52495][SQL] Allow including partition columns in the single variant column
#51206 commented on
Jul 2, 2025 • 0 new comments -
[SPARK-52486][SQL] Fix Spark Driver Planning OOM issue due to unworthwhile dpp expression before Execution when enabling AQE
#51184 commented on
Jun 30, 2025 • 0 new comments -
[SPARK-47547] BloomFilter fpp degradation
#50933 commented on
Jul 2, 2025 • 0 new comments -
Enable -Xsource:3 compiler flag
#50474 commented on
Jun 28, 2025 • 0 new comments -
[SPARK-46860][CORE]Enable basic authentication when downloading jars and other files from secured repos
#50377 commented on
Jul 3, 2025 • 0 new comments -
[WIP][SPARK-51439] Support SQL UDF with DEFAULT argument
#50373 commented on
Jul 3, 2025 • 0 new comments -
docs: update commet for `nullIntolerant`
#50370 commented on
Jul 3, 2025 • 0 new comments -
[SPARK-51583] [SQL] Improve error message when to_timestamp function has arguments of wrong type
#50347 commented on
Jul 3, 2025 • 0 new comments -
[SPARK-51332][SQL] DS V2 supports push down BIT_AND, BIT_OR, BIT_XOR, BIT_COUNT and BIT_GET
#50097 commented on
Jun 27, 2025 • 0 new comments -
[SPARK-50292] Add MapStatus RowCount optimize skewed job
#48825 commented on
Jun 30, 2025 • 0 new comments -
[SPARK-22876][YARN] Respect YARN AM failure validity interval
#42570 commented on
Jul 3, 2025 • 0 new comments