Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Master #8767

Closed
wants to merge 633 commits into from
Closed

Master #8767

Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
633 commits
Select commit Hold shift + click to select a range
7478c8b
[SPARK-9791] [PACKAGE] Change private class to private class to preve…
tdas Aug 24, 2015
9ce0c7a
[SPARK-7710] [SPARK-7998] [DOCS] Docs for DataFrameStatFunctions
brkyvz Aug 24, 2015
662bb96
[SPARK-10144] [UI] Actually show peak execution memory by default
Aug 24, 2015
a2f4cdc
[SPARK-8580] [SQL] Refactors ParquetHiveCompatibilitySuite and adds m…
liancheng Aug 24, 2015
cb2d2e1
[SPARK-9758] [TEST] [SQL] Compilation issue for hive test / wrong pac…
srowen Aug 24, 2015
13db11c
[SPARK-10061] [DOC] ML ensemble docs
jkbradley Aug 24, 2015
d7b4c09
[SPARK-10190] Fix NPE in CatalystTypeConverters Decimal toScala conve…
JoshRosen Aug 24, 2015
2bf338c
[SPARK-10165] [SQL] Await child resolution in ResolveFunctions
marmbrus Aug 25, 2015
6511bf5
[SPARK-10118] [SPARKR] [DOCS] Improve SparkR API docs for 1.5 release
yu-iskw Aug 25, 2015
642c43c
[SQL] [MINOR] [DOC] Clarify docs for inferring DataFrame from RDD of …
Aug 25, 2015
a0c0aae
[SPARK-10121] [SQL] Thrift server always use the latest class loader …
yhuai Aug 25, 2015
5175ca0
[SPARK-10178] [SQL] HiveComparisionTest should print out dependent ta…
marmbrus Aug 25, 2015
d9c25de
[SPARK-9786] [STREAMING] [KAFKA] fix backpressure so it works with defa…
koeninger Aug 25, 2015
f023aa2
[SPARK-10137] [STREAMING] Avoid to restart receivers if scheduleRecei…
zsxwing Aug 25, 2015
df7041d
[SPARK-10196] [SQL] Correctly saving decimals in internal rows to JSON.
yhuai Aug 25, 2015
bf03fe6
[SPARK-10136] [SQL] A more robust fix for SPARK-10136
liancheng Aug 25, 2015
82268f0
[SPARK-9293] [SPARK-9813] Analysis should check that set operations a…
JoshRosen Aug 25, 2015
d4549fe
[SPARK-10214] [SPARKR] [DOCS] Improve SparkR Column, DataFrame API docs
yu-iskw Aug 25, 2015
57b960b
[SPARK-6196] [BUILD] Remove MapR profiles in favor of hadoop-provided
srowen Aug 25, 2015
1fc3758
[SPARK-10210] [STREAMING] Filter out non-existent blocks before creat…
tdas Aug 25, 2015
2f493f7
[SPARK-10177] [SQL] fix reading Timestamp in parquet from Hive
Aug 25, 2015
7bc9a8c
[SPARK-10195] [SQL] Data sources Filter should not expose internal types
JoshRosen Aug 25, 2015
0e6368f
[SPARK-10197] [SQL] Add null check in wrapperFor (inside HiveInspecto…
yhuai Aug 25, 2015
5c14890
[DOC] add missing parameters in SparkContext.scala for scala doc
liyezhang556520 Aug 25, 2015
7f1e507
Fixed a typo in DAGScheduler.
zzvara Aug 25, 2015
69c9c17
[SPARK-9613] [CORE] Ban use of JavaConversions and migrate all existi…
srowen Aug 25, 2015
5c08c86
[SPARK-10198] [SQL] Turn off partition verification by default
marmbrus Aug 25, 2015
b37f0cc
[SPARK-8531] [ML] Update ML user guide for MinMaxScaler
hhbyyh Aug 25, 2015
881208a
[SPARK-10230] [MLLIB] Rename optimizeAlpha to optimizeDocConcentration
Aug 25, 2015
16a2be1
[SPARK-10231] [MLLIB] update @Since annotation for mllib.classification
mengxr Aug 25, 2015
71a138c
[SPARK-10048] [SPARKR] Support arbitrary nested Java array in serde.
Aug 25, 2015
c0e9ff1
[SPARK-9800] Adds docs for GradientDescent$.runMiniBatchSGD alias
Aug 25, 2015
c619c75
[SPARK-10237] [MLLIB] update since versions in mllib.fpm
mengxr Aug 25, 2015
9205907
[SPARK-9797] [MLLIB] [DOC] StreamingLinearRegressionWithSGD.setConver…
Aug 25, 2015
00ae4be
[SPARK-10239] [SPARK-10244] [MLLIB] update since versions in mllib.pm…
mengxr Aug 25, 2015
ec89bd8
[SPARK-10245] [SQL] Fix decimal literals with precision < scale
Aug 25, 2015
7467b52
[SPARK-10215] [SQL] Fix precision of division (follow the rule in Hive)
Aug 25, 2015
125205c
[SPARK-9888] [MLLIB] User guide for new LDA features
Aug 26, 2015
8668ead
[SPARK-10233] [MLLIB] update since version in mllib.evaluation
mengxr Aug 26, 2015
ab431f8
[SPARK-10238] [MLLIB] update since versions in mllib.linalg
mengxr Aug 26, 2015
c3a5484
[SPARK-10240] [SPARK-10242] [MLLIB] update since versions in mlilb.ra…
mengxr Aug 26, 2015
d703372
[SPARK-10234] [MLLIB] update since version in mllib.clustering
mengxr Aug 26, 2015
fb7e12f
[SPARK-10243] [MLLIB] update since versions in mllib.tree
mengxr Aug 26, 2015
4657fa1
[SPARK-10235] [MLLIB] update since versions in mllib.regression
mengxr Aug 26, 2015
321d775
[SPARK-10236] [MLLIB] update since versions in mllib.feature
mengxr Aug 26, 2015
75d4773
[SPARK-9316] [SPARKR] Add support for filtering using `[` (synonym fo…
felixcheung Aug 26, 2015
bb16405
Closes #8443
rxin Aug 26, 2015
6519fd0
[SPARK-9665] [MLLIB] audit MLlib API annotations
mengxr Aug 26, 2015
de7209c
HOTFIX: Increase PRB timeout
pwendell Aug 26, 2015
086d468
[SPARK-10241] [MLLIB] update since versions in mllib.recommendation
mengxr Aug 26, 2015
d41d6c4
[SPARK-10305] [SQL] fix create DataFrame from Python class
Aug 26, 2015
ad7f0f1
[SPARK-10308] [SPARKR] Add %in% to the exported namespace
shivaram Aug 27, 2015
773ca03
[MINOR] [SPARKR] Fix some validation problems in SparkR
yu-iskw Aug 27, 2015
0fac144
[SPARK-9424] [SQL] Parquet programming guide updates for 1.5
liancheng Aug 27, 2015
ce97834
[SPARK-9964] [PYSPARK] [SQL] PySpark DataFrameReader accept RDD of St…
yanboliang Aug 27, 2015
e936cf8
[SPARK-10219] [SPARKR] Fix varargsToEnv and add test case
shivaram Aug 27, 2015
de02782
[SPARK-10251] [CORE] some common types are not registered for Kryo Se…
Aug 27, 2015
9625d13
[DOCS] [STREAMING] [KAFKA] Fix typo in exactly once semantics
moutai Aug 27, 2015
1650f6f
[SPARK-10254] [ML] Removes Guava dependencies in spark.ml.feature Jav…
Aug 27, 2015
75d6230
[SPARK-10255] [ML] Removes Guava dependencies from spark.ml.param Jav…
Aug 27, 2015
1a446f7
[SPARK-10256] [ML] Removes guava dependency from spark.ml.classificat…
Aug 27, 2015
b02e818
[SPARK-9613] [HOTFIX] Fix usage of JavaConverters removed in Scala 2.11
jaceklaskowski Aug 27, 2015
e1f4de4
[SPARK-10257] [MLLIB] Removes Guava from all spark.mllib Java tests
Aug 27, 2015
fdd466b
[SPARK-10182] [MLLIB] GeneralizedLinearModel doesn't unpersist cached…
SlavikBaranov Aug 27, 2015
dc86a22
[SPARK-9148] [SPARK-10252] [SQL] Update SQL Programming Guide
marmbrus Aug 27, 2015
84baa5e
[SPARK-10315] remove document on spark.akka.failure-detector.threshold
CodingCat Aug 27, 2015
6185cdd
[SPARK-9901] User guide for RowMatrix Tall-and-skinny QR
hhbyyh Aug 27, 2015
c94ecdf
[SPARK-9906] [ML] User guide for LogisticRegressionSummary
MechCoder Aug 27, 2015
5bfe9e1
[SPARK-9680] [MLLIB] [DOC] StopWordsRemovers user guide and Java comp…
Aug 27, 2015
b3dd569
[SPARK-10287] [SQL] Fixes JSONRelation refreshing on read path
yhuai Aug 27, 2015
54cda0d
[SPARK-10321] sizeInBytes in HadoopFsRelation
Aug 27, 2015
1f90c5e
[SPARK-8505] [SPARKR] Add settings to kick `lint-r` from `./dev/run-t…
yu-iskw Aug 28, 2015
30734d4
[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator
MechCoder Aug 28, 2015
af0e124
[SPARK-9905] [ML] [DOC] Adds LinearRegressionSummary user guide
Aug 28, 2015
89b9434
[SPARK-SQL] [MINOR] Fixes some typos in HiveContext
liancheng Aug 28, 2015
7583681
[SPARK-10188] [PYSPARK] Pyspark CrossValidator with RMSE selects inco…
noel-smith Aug 28, 2015
2f99c37
[SPARK-10328] [SPARKR] Fix generic for na.omit
shivaram Aug 28, 2015
4eeda8d
[SPARK-10260] [ML] Add @Since annotation to ml.clustering
yu-iskw Aug 28, 2015
cc39803
[SPARK-10295] [CORE] Dynamic allocation in Mesos does not release whe…
srowen Aug 28, 2015
18294cd
Fix DynamodDB/DynamoDB typo in Kinesis Integration doc
yosssi Aug 28, 2015
71a077f
typo in comment
dharmeshkakadia Aug 28, 2015
1502a0f
[YARN] [MINOR] Avoid hard code port number in YarnShuffleService test
jerryshao Aug 28, 2015
e2a8430
[SPARK-9890] [DOC] [ML] User guide for CountVectorizer
hhbyyh Aug 28, 2015
499e8e1
[SPARK-8952] [SPARKR] - Wrap normalizePath calls with suppressWarnings
lresende Aug 28, 2015
d3f87dc
[SPARK-10325] Override hashCode() for public Row
JoshRosen Aug 28, 2015
c53c902
[SPARK-9284] [TESTS] Allow all tests to run without an assembly.
Aug 28, 2015
4572321
[SPARK-10336][example] fix not being able to set intercept in LR example
Aug 28, 2015
88032ec
[SPARK-9671] [MLLIB] re-org user guide and add migration guide
mengxr Aug 28, 2015
bb7f352
[SPARK-10323] [SQL] fix nullability of In/InSet/ArrayContain
Aug 28, 2015
2a4e00c
[SPARK-9803] [SPARKR] Add subset and transform + tests
felixcheung Aug 29, 2015
e8ea5ba
[SPARK-9910] [ML] User guide for train validation split
zapletal-martin Aug 29, 2015
5369be8
[SPARK-10350] [DOC] [SQL] Removed duplicated option description from …
witgo Aug 29, 2015
24ffa85
[SPARK-10289] [SQL] A direct write API for testing Parquet
liancheng Aug 29, 2015
5c3d16a
[SPARK-10344] [SQL] Add tests for extraStrategies
marmbrus Aug 29, 2015
277148b
[SPARK-10226] [SQL] Fix exclamation mark issue in SparkSQL
small-wang Aug 29, 2015
6a6f3c9
[SPARK-10330] Use SparkHadoopUtil TaskAttemptContext reflection metho…
JoshRosen Aug 29, 2015
097a7e3
[SPARK-10339] [SPARK-10334] [SPARK-10301] [SQL] Partitioned table sca…
yhuai Aug 29, 2015
13f5f8e
[SPARK-9986] [SPARK-9991] [SPARK-9993] [SQL] Create a simple test fra…
zsxwing Aug 30, 2015
905fbe4
[SPARK-10348] [MLLIB] updates ml-guide
mengxr Aug 30, 2015
ca69fc8
[SPARK-10331] [MLLIB] Update example code in ml-guide
mengxr Aug 30, 2015
1bfd934
[SPARK-10184] [CORE] Optimization for bounds determination in RangePa…
ihainan Aug 30, 2015
8d2ab75
[SPARK-10353] [MLLIB] BLAS gemm not scaling when beta = 0.0 for some …
brkyvz Aug 30, 2015
35e896a
SPARK-9545, SPARK-9547: Use Maven in PRB if title contains "[test-mav…
pwendell Aug 31, 2015
8694c3a
[SPARK-10351] [SQL] Fixes UTF8String.fromAddress to handle off-heap m…
Aug 31, 2015
f0f563a
[SPARK-100354] [MLLIB] fix some apparent memory issues in k-means|| i…
mengxr Aug 31, 2015
72f6dbf
[SPARK-8730] Fixes - Deser objects containing a primitive class attri…
EugenCepoi Aug 31, 2015
4a5fe09
[SPARK-10369] [STREAMING] Don't remove ReceiverTrackingInfo when dere…
zsxwing Aug 31, 2015
a2d5c72
[SPARK-10170] [SQL] Add DB2 JDBC dialect support.
sureshthalamati Aug 31, 2015
23e39cc
[SPARK-9954] [MLLIB] use first 128 nonzeros to compute Vector.hashCode
mengxr Aug 31, 2015
5b3245d
[SPARK-8472] [ML] [PySpark] Python API for DCT
yanboliang Aug 31, 2015
540bdee
[SPARK-10341] [SQL] fix memory starving in unsafe SMJ
Aug 31, 2015
fe16fd0
[SPARK-10349] [ML] OneVsRest use 'when ... otherwise' not UDF to gene…
yanboliang Aug 31, 2015
52ea399
[SPARK-10355] [ML] [PySpark] Add Python API for SQLTransformer
yanboliang Aug 31, 2015
d65656c
[SPARK-10378][SQL][Test] Remove HashJoinCompatibilitySuite.
rxin Sep 1, 2015
391e6be
[SPARK-10301] [SQL] Fixes schema merging for nested structs
liancheng Sep 1, 2015
e6e483c
[SPARK-9679] [ML] [PYSPARK] Add Python API for Stop Words Remover
holdenk Sep 1, 2015
3f63bd6
[SPARK-10398] [DOCS] Migrate Spark download page to use new lua mirro…
srowen Sep 1, 2015
ec01280
[SPARK-4223] [CORE] Support * in acls.
Sep 1, 2015
bf550a4
[SPARK-10162] [SQL] Fix the timezone omitting for PySpark Dataframe f…
0x0FFF Sep 1, 2015
00d9af5
[SPARK-10392] [SQL] Pyspark - Wrong DateType support on JDBC connection
0x0FFF Sep 1, 2015
c3b881a
[SPARK-7336] [HISTORYSERVER] Fix bug that applications status incorre…
ArcherShao Sep 2, 2015
56c4c17
[SPARK-10034] [SQL] add regression test for Sort on Aggregate
cloud-fan Sep 2, 2015
fc48307
[SPARK-10389] [SQL] support order by non-attribute grouping expressio…
cloud-fan Sep 2, 2015
2da3a9e
[SPARK-10004] [SHUFFLE] Perform auth checks when clients read shuffle…
Sep 2, 2015
6cd98c1
[SPARK-10417] [SQL] Iterating through Column results in infinite loop
0x0FFF Sep 2, 2015
03f3e91
[SPARK-10422] [SQL] String column in InMemoryColumnarCache needs to o…
yhuai Sep 3, 2015
44948a2
[SPARK-9723] [ML] params getordefault should throw more useful error
holdenk Sep 3, 2015
4bd85d0
[SPARK-5945] Spark should not retry a stage infinitely on a FetchFail…
Sep 3, 2015
0985d2c
[SPARK-8707] RDD#toDebugString fails if any cached RDD has invalid pa…
navis Sep 3, 2015
f6c447f
Removed code duplication in ShuffleBlockFetcherIterator
eracah Sep 3, 2015
3ddb9b3
[SPARK-10247] [CORE] improve readability of a test case in DAGSchedul…
squito Sep 3, 2015
62b4690
[SPARK-10379] preserve first page in UnsafeShuffleExternalSorter
Sep 3, 2015
0349b5b
[SPARK-10411] [SQL] Move visualization above explain output and hide …
zsxwing Sep 3, 2015
67580f1
[SPARK-10332] [CORE] Fix yarn spark executor validation
holdenk Sep 3, 2015
3abc0d5
[SPARK-9596] [SQL] treat hadoop classes as shared one in IsolatedClie…
WangTaoTheTonic Sep 3, 2015
af0e312
[SPARK-8951] [SPARKR] support Unicode characters in collect()
Sep 3, 2015
49aff7b
[SPARK-10432] spark.port.maxRetries documentation is unclear
Sep 3, 2015
d911c68
[SPARK-10431] [CORE] Fix intermittent test failure. Wait for event qu…
Sep 3, 2015
754f853
[SPARK-9869] [STREAMING] Wait for all event notifications before asse…
Sep 3, 2015
e62f4a4
[SPARK-9672] [MESOS] Don’t include SPARK_ENV_LOADED when passing env …
pashields Sep 3, 2015
11ef32c
[SPARK-10430] [CORE] Added hashCode methods in AccumulableInfo and RD…
Sep 3, 2015
db4c130
[SPARK-9591] [CORE] Job may fail for exception during getting remote …
jeanlyn Sep 3, 2015
08b0750
[SPARK-10435] Spark submit should fail fast for Mesos cluster mode wi…
Sep 3, 2015
208fbca
[SPARK-10421] [BUILD] Exclude curator artifacts from tachyon dependen…
Sep 3, 2015
cf42138
[SPARK-10003] Improve readability of DAGScheduler
Sep 4, 2015
143e521
[MINOR] Minor style fix in SparkR
shivaram Sep 4, 2015
804a012
MAINTENANCE: Automated closing of pull requests.
marmbrus Sep 4, 2015
c3c0e43
[SPARK-10176] [SQL] Show partially analyzed plans when checkAnswer fa…
cloud-fan Sep 4, 2015
3339e6f
[SPARK-10450] [SQL] Minor improvements to readability / style / typos…
Sep 4, 2015
b087d23
[SPARK-9669] [MESOS] Support PySpark on Mesos cluster mode.
tnachen Sep 4, 2015
2e1c175
[SPARK-10454] [SPARK CORE] wait for empty event queue
Sep 4, 2015
eafe372
[SPARK-10311] [STREAMING] Reload appId and attemptId when app starts …
XuTingjun Sep 4, 2015
22eab70
[SPARK-10402] [DOCS] [ML] Add defaults to the scaladoc for params in ml/
holdenk Sep 5, 2015
47058ca
[SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE_PARTITIONS.key correct…
yhuai Sep 5, 2015
6c75194
[HOTFIX] [SQL] Fixes compilation error
liancheng Sep 5, 2015
7a4f326
[SPARK-10440] [STREAMING] [DOCS] Update python API stuff in the progr…
tdas Sep 5, 2015
bca8c07
[SPARK-10434] [SQL] Fixes Parquet schema of arrays that may contain null
liancheng Sep 5, 2015
871764c
[SPARK-10013] [ML] [JAVA] [TEST] remove java assert from java unit tests
holdenk Sep 5, 2015
5ffe752
[SPARK-9767] Remove ConnectionManager.
rxin Sep 7, 2015
9d8e838
[DOC] Added R to the list of languages with "high-level API" support …
Sep 8, 2015
6ceed85
Docs small fixes
jaceklaskowski Sep 8, 2015
990c9f7
[SPARK-9170] [SQL] Use OrcStructInspector to be case preserving when …
viirya Sep 8, 2015
5b2192e
[SPARK-10480] [ML] Fix ML.LinearRegressionModel.copy()
yanboliang Sep 8, 2015
5fd5795
[SPARK-10316] [SQL] respect nondeterministic expressions in PhysicalO…
cloud-fan Sep 8, 2015
f7b55db
[SPARK-10470] [ML] ml.IsotonicRegressionModel.copy should set parent
yanboliang Sep 8, 2015
7a9dcbc
[SPARK-10441] [SQL] Save data correctly to json.
yhuai Sep 8, 2015
e6f8d36
[SPARK-10468] [ MLLIB ] Verify schema before Dataframe select API call
Sep 8, 2015
52b24a6
[SPARK-10492] [STREAMING] [DOCUMENTATION] Update Streaming documentat…
tdas Sep 8, 2015
d637a66
[SPARK-10327] [SQL] Cache Table is not working while subquery has ali…
chenghao-intel Sep 8, 2015
2143d59
[HOTFIX] Fix build break caused by #8494
marmbrus Sep 8, 2015
ae74c3f
[RELEASE] Add more contributors & only show names in release notes.
rxin Sep 9, 2015
820913f
[SPARK-10071] [STREAMING] Output a warning when writing QueueInputDSt…
zsxwing Sep 9, 2015
52fe32f
[SPARK-9834] [MLLIB] implement weighted least squares via normal equa…
mengxr Sep 9, 2015
a157348
[SPARK-10464] [MLLIB] Add WeibullGenerator for RandomDataGenerator
yanboliang Sep 9, 2015
3a11e50
[SPARK-10373] [PYSPARK] move @since into pyspark from sql
Sep 9, 2015
0e2f216
[SPARK-10094] Pyspark ML Feature transformers marked as experimental
noel-smith Sep 9, 2015
2f6fd52
[SPARK-9654] [ML] [PYSPARK] Add IndexToString to PySpark
holdenk Sep 9, 2015
91a577d
[SPARK-10249] [ML] [DOC] Add Python Code Example to StopWordsRemover …
hhbyyh Sep 9, 2015
c1bc4f4
[SPARK-10227] fatal warnings with sbt on Scala 2.11
Sep 9, 2015
2ddeb63
[SPARK-10117] [MLLIB] Implement SQL data source API for reading LIBSV…
Lewuathe Sep 9, 2015
c0052d8
[SPARK-10481] [YARN] SPARK_PREPEND_CLASSES make spark-yarn related ja…
zjffdu Sep 9, 2015
71da163
[SPARK-10461] [SQL] make sure `input.primitive` is always variable na…
cloud-fan Sep 9, 2015
45de518
[SPARK-9730] [SQL] Add Full Outer Join support for SortMergeJoin
viirya Sep 9, 2015
56a0fe5
[SPARK-9772] [PYSPARK] [ML] Add Python API for ml.feature.VectorSlicer
yanboliang Sep 10, 2015
1dc7548
[MINOR] [MLLIB] [ML] [DOC] fixed typo: label for negative result shou…
sparadiso Sep 10, 2015
48817cc
[SPARK-10497] [BUILD] [TRIVIAL] Handle both locations for JIRAError w…
holdenk Sep 10, 2015
4f1daa1
[SPARK-10065] [SQL] avoid the extra copy when generate unsafe array
cloud-fan Sep 10, 2015
f892d92
[SPARK-7142] [SQL] Minor enhancement to BooleanSimplification Optimiz…
Sep 10, 2015
49da38e
[SPARK-10301] [SPARK-10428] [SQL] Addresses comments of PR #8583 and …
liancheng Sep 10, 2015
e048111
[SPARK-10466] [SQL] UnsafeRow SerDe exception with data spill
chenghao-intel Sep 10, 2015
a76bde9
[SPARK-10469] [DOC] Try and document the three options
holdenk Sep 10, 2015
af3bc59
[SPARK-8167] Make tasks that fail from YARN preemption not fail job
mccheah Sep 10, 2015
f0562e8
[SPARK-6350] [MESOS] Fine-grained mode scheduler respects mesosExecut…
dragos Sep 10, 2015
a5ef2d0
[SPARK-10514] [MESOS] waiting for min no of total cores acquired by S…
SleepyThread Sep 10, 2015
d88abb7
[SPARK-9990] [SQL] Create local hash join operator
zsxwing Sep 10, 2015
45e3be5
[SPARK-10049] [SPARKR] Support collecting data of ArraryType in DataF…
Sep 10, 2015
3db7255
[SPARK-10443] [SQL] Refactor SortMergeOuterJoin to reduce duplication
Sep 10, 2015
4204757
Add 1.5 to master branch EC2 scripts
shivaram Sep 10, 2015
89562a1
[SPARK-7544] [SQL] [PySpark] pyspark.sql.types.Row implements __getit…
yanboliang Sep 10, 2015
0eabea8
[SPARK-9043] Serialize key, value and combiner classes in ShuffleDepe…
massie Sep 11, 2015
339a527
[SPARK-10023] [ML] [PySpark] Unified DecisionTreeParams checkpointInt…
yanboliang Sep 11, 2015
a140dd7
[SPARK-10027] [ML] [PySpark] Add Python API missing methods for ml.fe…
yanboliang Sep 11, 2015
e1d7f64
[SPARK-10472] [SQL] Fixes DataType.typeName for UDT
liancheng Sep 11, 2015
9bbe33f
[SPARK-10556] Remove explicit Scala version for sbt project build files
ahirreddy Sep 11, 2015
c268ca4
[SPARK-10518] [DOCS] Update code examples in spark.ml user guide to u…
y-shimizu Sep 11, 2015
b656e61
[SPARK-10026] [ML] [PySpark] Implement some common Params for regress…
yanboliang Sep 11, 2015
b01b262
[SPARK-9773] [ML] [PySpark] Add Python API for MultilayerPerceptronCl…
yanboliang Sep 11, 2015
960d2d0
[SPARK-10537] [ML] document LIBSVM source options in public API doc a…
mengxr Sep 11, 2015
2e3a280
[MINOR] [MLLIB] [ML] [DOC] Minor doc fixes for StringIndexer and Meta…
jkbradley Sep 11, 2015
6ce0886
[SPARK-10540] [SQL] Ignore HadoopFsRelationTest's "test all data type…
yhuai Sep 11, 2015
5f46444
[SPARK-8530] [ML] add python API for MinMaxScaler
hhbyyh Sep 11, 2015
b231ab8
[SPARK-10546] Check partitionId's range in ExternalSorter#spill()
tedyu Sep 11, 2015
c373866
[PYTHON] Fixed typo in exception message
icaromedeiros Sep 11, 2015
d5d6473
[SPARK-10442] [SQL] fix string to boolean cast
cloud-fan Sep 11, 2015
1eede3b
[SPARK-7142] [SQL] Minor enhancement to BooleanSimplification Optimiz…
Sep 11, 2015
e626ac5
[SPARK-9992] [SPARK-9994] [SPARK-9998] [SQL] Implement the local TopK…
zsxwing Sep 11, 2015
c2af42b
[SPARK-9990] [SQL] Local hash join follow-ups
Sep 11, 2015
d74c6a1
[SPARK-10564] ThreadingSuite: assertion failures in threads don't fai…
Sep 11, 2015
c34fc19
[SPARK-9014] [SQL] Allow Python spark API to use built-in exponential…
0x0FFF Sep 11, 2015
6d83678
[SPARK-10566] [CORE] SnappyCompressionCodec init exception handling m…
dimfeld Sep 12, 2015
8285e3b
[SPARK-10554] [CORE] Fix NPE with ShutdownHook
Sep 12, 2015
22730ad
[SPARK-10547] [TEST] Streamline / improve style of Java API tests
srowen Sep 12, 2015
f4a2280
[SPARK-6548] Adding stddev to DataFrame functions
JihongMA Sep 12, 2015
b3a7480
[SPARK-10330] Add Scalastyle rule to require use of SparkHadoopUtil J…
JoshRosen Sep 12, 2015
1dc614b
[SPARK-10222] [GRAPHX] [DOCS] More thoroughly deprecate Bagel in favo…
srowen Sep 13, 2015
d815654
[SPARK-9720] [ML] Identifiable types need UID in toString methods
BertrandDechoux Sep 14, 2015
32407bf
[SPARK-9899] [SQL] log warning for direct output committer with specu…
cloud-fan Sep 14, 2015
cf2821e
[SPARK-10584] [DOC] [SQL] Documentation about spark.sql.hive.metastor…
sarutak Sep 14, 2015
ce6f3f1
[SPARK-10194] [MLLIB] [PYSPARK] SGD algorithms need convergenceTol pa…
yanboliang Sep 14, 2015
8a634e9
[SPARK-10573] [ML] IndexToString output schema should be StringType
pnpritchard Sep 14, 2015
7e32387
[SPARK-10522] [SQL] Nanoseconds of Timestamp in Parquet should be pos…
Sep 14, 2015
64f0415
[SPARK-6981] [SQL] Factor out SparkPlanner and QueryExecution from SQ…
evacchi Sep 14, 2015
217e496
[SPARK-9996] [SPARK-9997] [SQL] Add local expand and NestedLoopJoin o…
zsxwing Sep 14, 2015
16b6d18
[SPARK-10594] [YARN] Remove reference to --num-executors, add --prope…
erickt Sep 14, 2015
4e2242b
[SPARK-10576] [BUILD] Move .java files out of src/main/scala
srowen Sep 14, 2015
ffbbc2c
[SPARK-10549] scala 2.11 spark on yarn with security - Repl doesn't work
Sep 14, 2015
fd1e8cd
[SPARK-10543] [CORE] Peak Execution Memory Quantile should be Per-tas…
saurfang Sep 14, 2015
7b6c856
[SPARK-10564] ThreadingSuite: assertion failures in threads don't fai…
Sep 14, 2015
1a09552
[SPARK-9851] Support submitting map stages individually in DAGScheduler
mateiz Sep 15, 2015
5520418
[SPARK-10542] [PYSPARK] fix serialize namedtuple
Sep 15, 2015
4ae4d54
[SPARK-9793] [MLLIB] [PYSPARK] PySpark DenseVector, SparseVector impl…
yanboliang Sep 15, 2015
610971e
[SPARK-10273] Add @since annotation to pyspark.mllib.feature
noel-smith Sep 15, 2015
a224935
[SPARK-10275] [MLLIB] Add @since annotation to pyspark.mllib.random
yu-iskw Sep 15, 2015
833be73
Small fixes to docs
Sep 15, 2015
6503c4b
[SPARK-10598] [DOCS]
insidedctm Sep 15, 2015
09b7e7c
Update version to 1.6.0-SNAPSHOT.
rxin Sep 15, 2015
c35fdcb
[SPARK-10491] [MLLIB] move RowMatrix.dspr to BLAS
hhbyyh Sep 15, 2015
8abef21
[SPARK-10300] [BUILD] [TESTS] Add support for test tags in run-tests.py.
Sep 15, 2015
7ca30b5
[PYSPARK] [MLLIB] [DOCS] Replaced addversion with versionadded in mll…
noel-smith Sep 15, 2015
0d9ab01
Closes #8738
mengxr Sep 15, 2015
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 2 additions & 0 deletions .rat-excludes
Original file line number Diff line number Diff line change
Expand Up @@ -93,3 +93,5 @@ INDEX
.lintr
gen-java.*
.*avpr
org.apache.spark.sql.sources.DataSourceRegister
.*parquet
2 changes: 1 addition & 1 deletion R/create-docs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ pushd $FWDIR
mkdir -p pkg/html
pushd pkg/html

Rscript -e 'library(SparkR, lib.loc="../../lib"); library(knitr); knit_rd("SparkR")'
Rscript -e 'libDir <- "../../lib"; library(SparkR, lib.loc=libDir); library(knitr); knit_rd("SparkR", links = tools::findHTMLlinks(paste(libDir, "SparkR", sep="/")))'

popd

Expand Down
5 changes: 0 additions & 5 deletions R/install-dev.bat
Original file line number Diff line number Diff line change
Expand Up @@ -25,8 +25,3 @@ set SPARK_HOME=%~dp0..
MKDIR %SPARK_HOME%\R\lib

R.exe CMD INSTALL --library="%SPARK_HOME%\R\lib" %SPARK_HOME%\R\pkg\

rem Zip the SparkR package so that it can be distributed to worker nodes on YARN
pushd %SPARK_HOME%\R\lib
%JAVA_HOME%\bin\jar.exe cfM "%SPARK_HOME%\R\lib\sparkr.zip" SparkR
popd
4 changes: 0 additions & 4 deletions R/install-dev.sh
Original file line number Diff line number Diff line change
Expand Up @@ -42,8 +42,4 @@ Rscript -e ' if("devtools" %in% rownames(installed.packages())) { library(devtoo
# Install SparkR to $LIB_DIR
R CMD INSTALL --library=$LIB_DIR $FWDIR/pkg/

# Zip the SparkR package so that it can be distributed to worker nodes on YARN
cd $LIB_DIR
jar cfM "$LIB_DIR/sparkr.zip" SparkR

popd > /dev/null
3 changes: 2 additions & 1 deletion R/pkg/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
Package: SparkR
Type: Package
Title: R frontend for Spark
Version: 1.4.0
Version: 1.6.0
Date: 2013-09-09
Author: The Apache Software Foundation
Maintainer: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
Expand Down Expand Up @@ -29,6 +29,7 @@ Collate:
'client.R'
'context.R'
'deserialize.R'
'functions.R'
'mllib.R'
'serialize.R'
'sparkR.R'
Expand Down
90 changes: 86 additions & 4 deletions R/pkg/NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -47,12 +47,12 @@ exportMethods("arrange",
"join",
"limit",
"merge",
"mutate",
"na.omit",
"names",
"ncol",
"nrow",
"orderBy",
"mutate",
"names",
"persist",
"printSchema",
"rbind",
Expand All @@ -69,9 +69,11 @@ exportMethods("arrange",
"selectExpr",
"show",
"showDF",
"subset",
"summarize",
"summary",
"take",
"transform",
"unionAll",
"unique",
"unpersist",
Expand All @@ -82,59 +84,139 @@ exportMethods("arrange",

exportClasses("Column")

exportMethods("abs",
exportMethods("%in%",
"abs",
"acos",
"add_months",
"alias",
"approxCountDistinct",
"asc",
"ascii",
"asin",
"atan",
"atan2",
"avg",
"base64",
"between",
"bin",
"bitwiseNOT",
"cast",
"cbrt",
"ceil",
"ceiling",
"concat",
"concat_ws",
"contains",
"conv",
"cos",
"cosh",
"count",
"countDistinct",
"crc32",
"date_add",
"date_format",
"date_sub",
"datediff",
"dayofmonth",
"dayofyear",
"desc",
"endsWith",
"exp",
"explode",
"expm1",
"expr",
"factorial",
"first",
"floor",
"format_number",
"format_string",
"from_unixtime",
"from_utc_timestamp",
"getField",
"getItem",
"greatest",
"hex",
"hour",
"hypot",
"ifelse",
"initcap",
"instr",
"isNaN",
"isNotNull",
"isNull",
"last",
"last_day",
"least",
"length",
"levenshtein",
"like",
"lit",
"locate",
"log",
"log10",
"log1p",
"log2",
"lower",
"lpad",
"ltrim",
"max",
"md5",
"mean",
"min",
"minute",
"month",
"months_between",
"n",
"n_distinct",
"nanvl",
"negate",
"next_day",
"otherwise",
"pmod",
"quarter",
"rand",
"randn",
"regexp_extract",
"regexp_replace",
"reverse",
"rint",
"rlike",
"round",
"rpad",
"rtrim",
"second",
"sha1",
"sha2",
"shiftLeft",
"shiftRight",
"shiftRightUnsigned",
"sign",
"signum",
"sin",
"sinh",
"size",
"soundex",
"sqrt",
"startsWith",
"substr",
"substring_index",
"sum",
"sumDistinct",
"tan",
"tanh",
"toDegrees",
"toRadians",
"upper")
"to_date",
"to_utc_timestamp",
"translate",
"trim",
"unbase64",
"unhex",
"unix_timestamp",
"upper",
"weekofyear",
"when",
"year")

exportClasses("GroupedData")
exportMethods("agg")
Expand Down
Loading