Default to DEBUG=OFF and PROFILE=OFF. #662

rcurtin · 2016-05-31T19:04:33Z

This is because people are using mlpack from git and wondering why it is so
slow.

This issue came to my attention because I came across this paper:
http://arxiv.org/abs/1602.02514
and in the revision I read (the first or probably the second revision on arXiv), I found that the numbers for mlpack were extremely slow. After consulting with the author it turned out he had just compiled as-is off Github, with debugging symbols and all. So I think maybe compiling without debugging symbols by default will be helpful for people who just want to use mlpack, and don't bother to look into how to configure it, and then wonder why mlpack is so slow.

I'm opening this as a PR for discussion, because there are advantages and disadvantages to this. Once we reach some consensus we can make a decision. So I'm for the change (which makes sense because I opened the PR :)).

…le constant.

…ication problem with n classes.

…itm for classification models. To be precise, this is is a Variance Reduces classification reinforcement learning rule.

…xtract a retina-like representation of the input image.

…ilable afterwards.

Properly use Enum type.

Typo fix in knn_main.cpp

Instead of including: methods/neighbor_search/ns_traversal_info.hpp Include the definition in: core/tree/traversal_info.hpp

Properly use Enum type, in rann and range_search.

Remove duplicated code for traversal info.

Deprecated arma function replaced by new arma constant

vc2_test.csv wasn't in proper csv format

…ormance_issue fix performance issue--forgot to move name parameters

…c template parameter must be the first.

…tion take a look at mlpack#619.

Fix erroneous ball tree definition.

This is because people are using mlpack from git and wondering why it is so slow.

zoq · 2016-06-01T12:05:40Z

At least for me, the master/developer branch is meant to be used for development only, not to run fair benchmarks against competing methods. Isn't that one reason why we have a release build which will not generate debug symbols suitable for debugging or post-processing?

rcurtin · 2016-06-02T18:39:38Z

I agree that the master branch is being used for development. It's possible we could create a separate "develop" branch and push to that instead, and push to master only when releases happen, but this then means that lots of experimental features aren't available to the average user. I also think mlpack developers don't have any problem enabling debugging and/or profiling symbols, whereas users might be confused about how to disable that.

I'm not sure, do you have any ideas for other solutions to this problem?

mentekid · 2016-06-02T19:21:10Z

Most people (myself included) will closely follow directions in the README before trying alternative install configurations, meaning if we make it slightly easier for them to just copy-paste commands on the terminal they will stop compiling with debug symbols.

For example, now README.md describes the installation process like this:

The next step is to run CMake to configure the project [...]
$ cmake ../
You can specify options to compile with debugging information and profiling information:
$ cmake -D DEBUG=ON -D PROFILE=ON ../

We could have it say instead:

The next step is to run CMake to configure the project [...]
$ cmake -D DEBUG=OFF -D PROFILE=OFF ../

You can specify options to compile with debugging information and profiling information:
$ cmake -D DEBUG=ON -D PROFILE=ON ../
Be warned, this will make mlpack significantly slower and is intended for mlpack developers

(Or something like that)

zoq · 2016-06-03T14:05:08Z

So, I think the README is probably misguiding some users since it refers to one of the releases:

If you run CMake with no options, it will configure the project to build with no debugging symbols and no profiling information:

That's not true, for the master branch. However, I think @mentekid is right, and most people take a look at the README to figure out how to build mlpack. So I'm also for the change, but I think we don't need to change the README.

SterlingPeet · 2016-06-04T15:53:58Z

Using master for mainline development (or not) is an interesting issue. Doing so is closely aligned with the SVN 'trunk' development and branching model, but may or may not be appropriate in git.

I have noticed that git based projects (especially on GitHub) present a much more professional look and feel when using master as the stable branch, in no small part because GitHub presents the master branch as the default landing page. So, perhaps it might be worth considering the master branch in light of it being the first public introduction for someone interested in trying out mlpack.

For unrelated reasons, my personal development evolved to follow the Successful Git Branching Model proposed/documented by Vincent Driessen. This model uses master as the branch that holds the stable, release code only. Release tags in this model all point into the master branch somewhere.

That model clearly doesn't address the concerns of making experimental features available to the end user, but that may not be a bad thing. Personally, I feel that the need for unavailable experimental features indicates that surpassing a higher barrier to entry (really just the ability to checkout a different git branch in this case) is not unreasonable to ask, and my professionally developed projects reflect that philosophy.

In any case, I think the readme in the master branch should have and indication of what branch of code a user wants for normal use cases, like benchmarking or advanced (or unreleased) features.

Hopefully these comments are useful for considering how to mitigate the root concern, dealing with the unexpected things people do is challenging.

rcurtin · 2016-07-05T18:15:19Z

Ok, it seems like everyone thinks using default DEBUG=OFF and PROFILE=OFF is a good idea, so I went ahead and merged commit 3fe0b72 to merge this PR (I did it manually because there is weirdness in this PR and I did not want to try to merge it and break everything...).

@SterlingPeet: thanks for the detailed response. I enjoyed the article you linked to. I think maybe at some point it may be worth changing to the model you suggested, but I am not sure that is a huge deal at the moment (maybe as the project grows that will be more necessary). As it stands now, we do use feature branches, which only get merged when the code is tested and ready. So I guess realistically master only contains code-that-will-be-released-but-maybe-isn't-quite-yet. Anyway, I'll think about it, but it would be some amount of effort to change, so unless someone else wants to take the lead, I'm fine leaving it how it is now (until there is a later problem, perhaps).

rcurtin and others added 30 commits May 16, 2016 14:57

Force positive-definiteness when training a Gaussian.

4536910

Better handling of NaNs.

2744628

Add --random_initialization for mlpack_hmm_train.

3e4f3ca

Don't forget the period in the output.

9866203

Add MultiplyConstantLayer which multiplies the input by a non-learnab…

2114358

…le constant.

Add NegativeLogLikelihoodLayer class which is useful to train a class…

cc7bac3

…ication problem with n classes.

Add the VRClassRewardLayer class which implements the REINFORCE algor…

b69c6dc

…itm for classification models. To be precise, this is is a Variance Reduces classification reinforcement learning rule.

Add GlimpseLayer class which takes an input image and a location to e…

be43684

…xtract a retina-like representation of the input image.

Remove debug message.

fe69d33

Add Recurrent Model of Visual Attention (RMVA) implementation.

4d8347a

Removes trailing whitespaces at the end of lines.

d1650e4

Include split_data.hpp file into the build process, so that it is ava…

989dd35

…ilable afterwards.

Remove unused output parameter.

f1bf339

Add RMVA class and function documentation.

39eefde

Properly use Enum type.

dd136db

Merge pull request mlpack#641 from MarcosPividori/master

9b42c22

Properly use Enum type.

Typo fix in knn_main.cpp

9b811f9

Merge pull request mlpack#643 from dasayan05/master

36b7316

Typo fix in knn_main.cpp

Properly use Enum type.

c82c747

Remove duplicated code for traversal info.

eef40b9

Instead of including: methods/neighbor_search/ns_traversal_info.hpp Include the definition in: core/tree/traversal_info.hpp

Remove unnecessary include.

b4ee954

Merge pull request mlpack#645 from MarcosPividori/master

f55427d

Properly use Enum type, in rann and range_search.

Merge pull request mlpack#646 from MarcosPividori/traversal-info

ade7fa0

Remove duplicated code for traversal info.

Remove trailing underscores in header guards as discussed in mlpack#533.

8acc4ce

Add CMakeLists file to build the RMVA.

cd7f063

Remove CMakeLists file; do not build the RMVA code.

c15541b

Use n_rows and n_cols to define the matrix size instead of arma::size().

04fe0d5

Deprecated arma function replaced by new arma constant

4fae385

Merge pull request mlpack#648 from dasayan05/PR1

5d1723d

Deprecated arma function replaced by new arma constant

Update documentation for changed names.

6f6173c

dasayan05 and others added 14 commits May 26, 2016 18:38

vc2_test.csv wasn't in proper csv format

c68c3b9

fix performance issue--forgot to move name parameters

fe1b6b9

Merge pull request mlpack#652 from dasayan05/test

290977d

vc2_test.csv wasn't in proper csv format

Merge pull request mlpack#653 from stereomatchingkiss/split_data_perf…

ec1c46a

…ormance_issue fix performance issue--forgot to move name parameters

Fix error, a balltree should use ballbounds instead of hrectbounds.

706f0ee

Set proper template order. To work with binary space trees, the metri…

9e5d9c2

…c template parameter must be the first.

Proper template order to BallBound.

c0cdb8b

Fix error. kdtree where should be a balltree.

903be7e

Change template name to be similar to the definition of hrectbound.

8285c31

Use add_cli_executable to control CLI executables build; more informa…

3d1ed0f

…tion take a look at mlpack#619.

Merge pull request mlpack#655 from MarcosPividori/fix-ball-tree

2fe9e82

Fix erroneous ball tree definition.

Fix spacing.

1dad2b6

Document the state of loading sparse matrices.

e36eec5

Default to DEBUG=OFF and PROFILE=OFF.

e8a1f1a

This is because people are using mlpack from git and wondering why it is so slow.

nilayjain force-pushed the master branch 2 times, most recently from fddfc18 to 1f562a1 Compare June 5, 2016 12:00

rcurtin closed this Jul 5, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default to DEBUG=OFF and PROFILE=OFF. #662

Default to DEBUG=OFF and PROFILE=OFF. #662

rcurtin commented May 31, 2016

zoq commented Jun 1, 2016

rcurtin commented Jun 2, 2016

mentekid commented Jun 2, 2016

zoq commented Jun 3, 2016

SterlingPeet commented Jun 4, 2016

rcurtin commented Jul 5, 2016

Default to DEBUG=OFF and PROFILE=OFF. #662

Default to DEBUG=OFF and PROFILE=OFF. #662

Conversation

rcurtin commented May 31, 2016

zoq commented Jun 1, 2016

rcurtin commented Jun 2, 2016

mentekid commented Jun 2, 2016

zoq commented Jun 3, 2016

SterlingPeet commented Jun 4, 2016

rcurtin commented Jul 5, 2016