Merge master into release/preview for 0.7 by shauheen · Pull Request #1469 · dotnet/machinelearning

shauheen · 2018-10-31T19:40:33Z

This PR merges master into release branch for 0.7

* Remove the error tracing when assembly loading fails for Maml. Also adding our native assemblies to the list to skip, so they aren't attempted to be loaded. Fix #1034

Helps the user to relate to the macOS version faster.

…1074)

* Multiclass logistic Regression tests enabled * threshold providing in tests * defining tolerance as a constant in baseTestBaseline Class * upper case camel for constant and _ for large decimal numbers

…800) * clarifying roadmap to mention current text/NLP features * updated the documentation to fix review comments

* updating documentation of TensorFlowTransform * making some some changes in the description

* (Fix issue #1050) Renamed namespace to Microsoft.ML.Transforms to conform with TF transform. * (Fix issue #1051) Added public create methods * (Fixed issue 1053) IDV names don't have to be the same as OnnxModel node names. * Added license header

* move Harness-related code to Harness folder * make sure that we always use recommended config * mention the external dependencies in the README * add nuget.config file so BDN can restore all packages * add comments to the config so I am not the only person who understands it * don't enable MemoryDiagnoser by default, it requires one extra iteration which is expensive for long running benchmarks * don't add nuget.config file, generate it on the fly when needed by BDN * generate a .csproj file that will handle both native dependencies and nuget.config file issue * describe authoring new benchmarks in the docs * add some integration tests that make sure that the benchmarks are not broken * register the right assemblies after recent change of assembly loading, makes all benchmark work again ;) * make Ranking benchmarks work * code review: split Helpers.cs into multiple files, cleanup the code, don't hardcode the dependencies

* Change all uses of System.Linq.Append to System.Linq.Concat. * Address code review comments

Fixes #1057 Fixes #1044 FIxes #1056

* Kmeans to estimator. * Adding the Clustering training context, the clustering Evaluate method, the KMeans extension on the clustering context and a tet for it.

… API. (#1082) This way, all components are registered before the Experiment tries to instantiate them. Fix #1042

* update ml.scoring to stable version * small change to kick off mac build * small change to kick off build * Kick off build * small change to kick off build

* simple fixe for warning issue in keytovalue transform * changed test file outputs so that they don't include the warnings

* AP xtensions * lbfgs derived classes take more arguments in their public ctors * adding pigstensions for lr, multilr, possion * Ogd static xtensions. * namespace change for pigstensions

…estimator (#1088)

Fixes #987 Adds a document describing high-level API concepts, as well as the 'cookbook' with a variety of samples.

* Adding the Ranker TrainContext, the Ranker TrainerEstimatorReconcilier, and an Evaluate method + metrics class to the existing RankerEvaluator. * Adding the FastTree ranking xtension and test. * Grouping the xtensions in classes with more meaningful names, since the docs site displays the methods per class, not file.

Adds project references for OnnxTransform, TensorFlowTransorm, and a NativeAssemblyReference for SymSGD

* Add a workaround for the tests hanging while loading MKL. The workaround is to ensure the MKL library is loaded very early in the test process, so it doesn't cause the deadlock. Workaround #1073 Another deadlock also occurs when running TestAutoInference and TestPipelineSweeper in parallel. Marking these tests to not run in parallel anymore. Workaround #1095 Moving back to the Azure Hosted VS2017 pool to run the tests now that we've narrowed the deadlocks down.

…owUtils.GetModelNodes (#1093)

)

…nchmarks (#1114)

…1032) * Add instructions for building for .NET Core 3.0, and make them work. Fix #1011 * Add config specific properties for the Intrinsics configs. * Allow tests to be run against .NET Core 3.0

Add MyGet link

* Enable statically-typed matrix factorization * Address comments 1. Add copyright 2. Try fix mac tolerance 3. Use MLContext with static pipeline * Add another example for in-memory matrix factorization

…1328) * Renaming the namespaces where the transforms live from Runtime.Data.Transforms to ML.Transforms. Addressing PR comments.

* Change TryParse* methods to return false instead of throw. * Make TextLoader throw on bad values * Make TextLoader throw on bad values, and fix unit tests. * Update Release baseline * Fix one more unit test

* Adding the catalog extensions for Concat, CopyColumns, Hash, KeyToVal

* adding airquality, infert datasets. Added samples for Term and KeyToVal estimators. * Adding tests for the NormalizerCatalog and the TextTransformCatalog. Adding samples for the ConcatEstimator, and KeyToValue, Term that will need to get referenced from the respective catalogs when they happen. re-organized the samples based on static-dynamic. Renamed.

…olumnsTransform (#1371) * Removes ChooseColumnsTransform and DropColumnsTransform classes replacing them with SelectColumnsTransform. These changes include: * Updates to SelectColumnsTransform to respect ordering when keeping columns. For example, if the input is ABC and CB is selected, the output will be CB. * Updates to code that used Choose or Drop columns, replacing with SelectColumns. * Updates to baseline output for tests to pass * Re-enabled the SavePipeline tests This fixes #1342 These changes are also related to #754

* more namespace move

…reated (#1428)

* Enhancements to online linear trainers to make them stateless. * Factor stateful logic into a separate internal object. * Remove direct usage of Console.Writeline * Opportunistic fixes of minor issues. * Nuke failing Mac test temporarily

* more reorg and namespace move * renaming the HashEstimator and taking care of consequences

Added a custom mapping transformer/estimator

…ection transforms (#1254)

* adding all extensions for the Text related transformation * adding keyToVector extensions and renaming the estimatorto conform to 1318 * Adding SelectColumns xtensions. Renaming SelectEstimator and Textnormalizer * KeyToBinary extensions * Adding extensions for the Image transforms. Some renaming

Fix an unassigned field for matrix factorization.

# Conflicts: # build/Dependencies.props # src/Microsoft.ML.Data/Evaluators/ClusteringEvaluator.cs # src/Microsoft.ML.Data/Scorers/PredictionTransformer.cs # src/Microsoft.ML.KMeansClustering/KMeansPlusPlusTrainer.cs # src/Microsoft.ML.KMeansClustering/KMeansStatic.cs # src/Microsoft.ML.Legacy/AssemblyRegistration.cs # src/Microsoft.ML.PCA/WrappedPcaTransform.cs # src/Microsoft.ML.StandardLearners/Standard/LogisticRegression/LbfgsPredictorBase.cs # src/Microsoft.ML.StandardLearners/Standard/LogisticRegression/MulticlassLogisticRegression.cs # src/Microsoft.ML.StandardLearners/Standard/Online/AveragedLinear.cs # src/Microsoft.ML.StandardLearners/Standard/Online/AveragedPerceptron.cs # src/Microsoft.ML.StandardLearners/Standard/Online/LinearSvm.cs # src/Microsoft.ML.StandardLearners/Standard/Online/OnlineGradientDescent.cs # src/Microsoft.ML.StandardLearners/Standard/Online/OnlineLearnerStatic.cs # src/Microsoft.ML.StandardLearners/Standard/Online/OnlineLinear.cs # src/Microsoft.ML.TensorFlow/doc.xml # src/Microsoft.ML.Transforms/NAReplaceTransform.cs # src/Microsoft.ML.Transforms/Text/TextTransform.cs # test/Microsoft.ML.StaticPipelineTesting/Training.cs # test/Microsoft.ML.Tests/TrainerEstimators/LbfgsTests.cs # test/Microsoft.ML.Tests/TrainerEstimators/TrainerEstimators.cs

shauheen and others added 30 commits September 26, 2018 13:13

Bump master to 0.7 (#1037)

759ac33

Remove the error tracing when assembly loading fails for Maml. (#1058)

a80e3d6

* Remove the error tracing when assembly loading fails for Maml. Also adding our native assemblies to the list to skip, so they aren't attempted to be loaded. Fix #1034

Use full test name (#1035)

437c1ba

Provided the name for macOS 10.12 version. (#1070)

769b1eb

Helps the user to relate to the macOS version faster.

Finish the sentence in TextLoader static pipeline extension method (#…

7fde5a3

…1074)

Enabled Multiclass Logistic Regression Tests (#939)

b871c86

* Multiclass logistic Regression tests enabled * threshold providing in tests * defining tolerance as a constant in baseTestBaseline Class * upper case camel for constant and _ for large decimal numbers

Clarified roadmap to mention existence of current text/NLP features (#…

5b4284c

…800) * clarifying roadmap to mention current text/NLP features * updated the documentation to fix review comments

Updated documentation for TensorFlowTransform (#1077)

eb87467

* updating documentation of TensorFlowTransform * making some some changes in the description

Change ML.NET to work with .NET Framework 4.6.1 (#1075)

0e7f8c9

* Change all uses of System.Linq.Append to System.Linq.Concat. * Address code review comments

Cumulative changes based on 0.6 bag bash (#1079)

93fbd25

Fixes #1057 Fixes #1044 FIxes #1056

Converting KMeans++trainer to estimator. (#979)

8aa4f1f

* Kmeans to estimator. * Adding the Clustering training context, the clustering Evaluate method, the KMeans extension on the clustering context and a tet for it.

Ensure all Microsoft.ML assemblies are loaded by the LearningPipeline…

0996878

… API. (#1082) This way, all components are registered before the Experiment tries to instantiate them. Fix #1042

update ml.scoring library to stable version (#1086)

abac853

* update ml.scoring to stable version * small change to kick off mac build * small change to kick off build * Kick off build * small change to kick off build

Temporary fix for warning issue in KeyToValueTransform (#1083)

f10212c

* simple fixe for warning issue in keytovalue transform * changed test file outputs so that they don't include the warnings

More pigstensions (#1084)

0b9ca00

* AP xtensions * lbfgs derived classes take more arguments in their public ctors * adding pigstensions for lr, multilr, possion * Ogd static xtensions. * namespace change for pigstensions

Added training method that accepts initial predictor for Symboli SGD …

1ae6070

…estimator (#1088)

API overview and samples (#960)

95f5f27

Fixes #987 Adds a document describing high-level API concepts, as well as the 'cookbook' with a variety of samples.

Add other projects to console project (#1099)

f124d69

Adds project references for OnnxTransform, TensorFlowTransorm, and a NativeAssemblyReference for SymSGD

Introduce CustomPipelineColumn (#1091)

161b450

Add release notes for ML.NET 0.6 (#1102)

70b3c3b

Add xml documentation for TensorFlowUtils.GetModelSchema and TensorFl…

87ffaa1

…owUtils.GetModelNodes (#1093)

Updated the building instructions to specify supported VS version (#1024

76dd923

)

Adding ONNX scoring example link and prediction engine improvement be…

00a10ad

…nchmarks (#1114)

Fixed a grammatical error in windows-instructions (#1117)

eb1c141

Allow the creation of ONNX initializers (#965)

ff85a5c

Add instructions for building for .NET Core 3.0, and make them work. (#…

fcea146

…1032) * Add instructions for building for .NET Core 3.0, and make them work. Fix #1011 * Add config specific properties for the Intrinsics configs. * Allow tests to be run against .NET Core 3.0

shauheen and others added 20 commits October 29, 2018 15:39

minor update to Readme.md

1dbc99a

Add MyGet link

Fix WordTokenize bug (#1433)

22f57f4

Enable statically-typed matrix factorization (#1407)

b495a03

* Enable statically-typed matrix factorization * Address comments 1. Add copyright 2. Try fix mac tolerance 3. Use MLContext with static pipeline * Add another example for in-memory matrix factorization

Moves dotnet-server shutdown to official build yml file (#1432)

b8267dd

Renaming some transforms to follow the estimator naming convention. (#…

453eb57

…1328) * Renaming the namespaces where the transforms live from Runtime.Data.Transforms to ML.Transforms. Addressing PR comments.

Fix bug (#1437)

a55b2a6

Change TryParse* methods to return false instead of throw. (#1385)

f8ef7e2

* Change TryParse* methods to return false instead of throw. * Make TextLoader throw on bad values * Make TextLoader throw on bad values, and fix unit tests. * Update Release baseline * Fix one more unit test

Adding transform extensions (#1448)

c903a53

* Adding the catalog extensions for Concat, CopyColumns, Hash, KeyToVal

more namespace move for transforms (#1457)

e4c2fa0

* more namespace move

Improved existing Append summaries, clarifying that a new object is c…

0b175ba

…reated (#1428)

Last namespace re-org (#1458)

7e5ee22

* more reorg and namespace move * renaming the HashEstimator and taking care of consequences

Convert to estimator (#1439)

c8a0c67

Custom mapping transformer (#1406)

a039462

Added a custom mapping transformer/estimator

Estimators for Timeseries SSA / IID ChangepointDetection and SpikeDet…

1391107

…ection transforms (#1254)

Fix unassigned public field (#1467)

a26eca7

Fix an unassigned field for matrix factorization.

shauheen requested review from Ivanidzo4ka, Zruty0 and eerhardt October 31, 2018 19:40

eerhardt approved these changes Oct 31, 2018

View reviewed changes

shauheen requested review from TomFinley and sfilipi October 31, 2018 20:06

sfilipi approved these changes Oct 31, 2018

View reviewed changes

shauheen merged commit c5cef31 into dotnet:release/preview Oct 31, 2018

shauheen deleted the rc107 branch October 31, 2018 20:22

ghost locked as resolved and limited conversation to collaborators Mar 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge master into release/preview for 0.7#1469

Merge master into release/preview for 0.7#1469
shauheen merged 161 commits intodotnet:release/previewfrom
shauheen:rc107

shauheen commented Oct 31, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

shauheen commented Oct 31, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

shauheen commented Oct 31, 2018 •

edited

Loading