Mirror of Apache Hivemall (incubating)
Java Scala Shell Python Awk Dockerfile
Clone or download
myui [HIVEMALL-210][BUGFIX] Fix a bug in lda_predict/plsa_predict
## What changes were proposed in this pull request?

Fixed a bug in lda_predict/plsa_predict that duplicated term probability is [unexpectedly replaced](https://github.com/apache/incubator-hivemall/blame/a8a97d6e873d5a8a30b06f92ddc14d1ec95c2738/core/src/main/java/hivemall/topicmodel/LDAPredictUDAF.java#L396)

## What type of PR is it?

Bug Fix

## What is the Jira issue?

https://issues.apache.org/jira/browse/HIVEMALL-210

## How was this patch tested?

unit tests and manual tests

## Checklist

- [x] Did you apply source code formatter, i.e., `./bin/format_code.sh`, for your commit?
- [x] Did you run system tests on Hive (or Spark)?

Author: Makoto Yui <myui@apache.org>

Closes #154 from myui/HIVEMALL-210.
Latest commit b88e9f5 Aug 6, 2018
Permalink
Failed to load latest commit information.
.github Update GitHub PR template for code formatter May 24, 2018
bin [HIVEMALL-207] Remove ddl/*.td.hql files maintained for a specific co… Jun 22, 2018
conf Close #127: [HIVEMALL-2] Change Maven release scheme for ASF release Dec 26, 2017
core [HIVEMALL-210][BUGFIX] Fix a bug in lda_predict/plsa_predict Aug 6, 2018
dist Fixed to include relocated HCatalog in hivemall-all.jar Jun 14, 2018
docs/gitbook [HIVEMALL-145] Merge Brickhouse functions Jun 6, 2018
mixserv Applied formatter Apr 27, 2018
nlp [HIVEMALL-208] Upgrade to Lucene 5.5.5 Jul 5, 2018
resources [HIVEMALL-207] Remove ddl/*.td.hql files maintained for a specific co… Jun 22, 2018
spark [HIVEMALL-145] Merge Brickhouse functions Jun 6, 2018
src/site Make size of incubator logo smaller Apr 26, 2018
tools [HIVEMALL-145] Merge Brickhouse functions Jun 6, 2018
xgboost Applied formatter Apr 27, 2018
.dockerignore Close #68: [HIVEMALL-84] Add Docker Support Apr 25, 2017
.gitignore Close #131: [v0.5.0-rc3] Merge v0.5.0 branch Feb 20, 2018
.rat-excludes Close #131: [v0.5.0-rc3] Merge v0.5.0 branch Feb 20, 2018
.travis.yml mvn validate and compile-xgboost is no more used Feb 21, 2018
DISCLAIMER Updated license headers Oct 28, 2016
KEYS Close #127: [HIVEMALL-2] Change Maven release scheme for ASF release Dec 26, 2017
LICENSE Fixed LICENSE file Mar 6, 2018
NOTICE [HIVEMALL-145] Merge Brickhouse functions Jun 6, 2018
README.md Request contributers to use ./bin/format_code.sh May 16, 2018
VERSION Close #131: [v0.5.0-rc3] Merge v0.5.0 branch Feb 20, 2018
pom.xml [HIVEMALL-145] Merge Brickhouse functions Jun 6, 2018

README.md

Apache Hivemall: Hive scalable machine learning library

Build Status Documentation Status License Coverage Status Twitter Follow

Apache Hivemall is a scalable machine learning library that runs on Apache Hive, Apache Spark, and Apache Pig. Hivemall is designed to be scalable to the number of training instances as well as the number of training features.

Apache Incubator

Usage

Hivemall

Find more examples on our user guide and find a brief introduction to Hivemall in this slide.

Support

Support is through user@hivemall.incubator.apache.org, not by a direct e-mail.

Contributing

If you are planning to contribute to this repository, we first request you to create an issue at our JIRA page even if the topic is not related to source code itself (e.g., documentation, new idea and proposal).

All Hivemall functions are defined under resources/ddl. In order to update the definition files, the following script helps inserting function name and class path of your new UDF:

$ ./bin/update_ddls.sh

Note that, before creating a pull request including Java code, please make sure your code follows our coding conventions by applying formatter:

$ ./bin/format_code.sh