Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

code migration from machinelearning-automl to machinelearning repo feature branch take 2 #3326

Merged
merged 213 commits into from Apr 13, 2019

Conversation

Projects
None yet
8 participants
@Dmitry-A
Copy link

Dmitry-A commented Apr 13, 2019

Sorry I squash merged on the first try which is not what I want, gotta keep history. Changes here are:

  • Shuffle code to conform to ml.net folder structure
  • Add automl and automl test components to build
  • Fix issues so it builds

Note this does NOT include running tests or publishing nuget packages, that will come in as a separate PR

jongalloway and others added some commits Jan 4, 2019

make params for MLContext data extensions match ML.NET default names …
…and values; update gitignore; nit rev for Benchmarking.cs (#5)
Make validation data param mandatory; remove GetFirstPipeline sample (#…
…10)

* Make validation data param mandatory; remove GetFirstPipeline sample

* remove deprecated todo
Create ISSUE_TEMPLATE.md & PULL_REQUEST_TEMPLATE.md (#12)
* Create ISSUE_TEMPLATE.md

* Create PULL_REQUEST_TEMPLATE.md
add estimator extensions / catalog; add conversion from external to i…
…nternal pipeline; transform clean-up; add back in test proj and fix build; refactor trainer ext name mappings (#15)

* Make validation data param mandatory; remove GetFirstPipeline sample

* remove deprecated todo

* add estimator extensions / catalog; add ability to go from external to internal pipeline; a lot of transform clean-up; add back in test proj and get it building; refactor trainer ext name mappings
make GetNextPipeline API w/ public Pipeline method on PipelineSuggest…
…er; write GetNextPipeline API test; fix public Pipeline object serialization; fix header inferencing bug; write test utils for fetching datasets (#18)
get next pipeline API rev -- refactor API to consume column dimension…
…s, purpose, type, and name instead of available trainers & transforms (#19)
fix dataview take util bug, add dataview skip util, add some UTs to i…
…ncrease code coverage (#21)

* fix dataview take util bug, add dataview skip util, add some UTs to increase code coverage

* add accuracy threshold on AutoFit test

* add null check to best pipeline on autofit result
unit test additions (including user input validation testing); dead c…
…ode removal for code coverage (including KDO & associated utils); misc fixes & revs (#22)

srsaggam and others added some commits Apr 1, 2019

Suggested changes to progress bar (#335)
* progressbar

* added progressbar and refactoring

* reverted

* revert sign assembly

* added headers and removed exception rethrow

* bug fixes and updates to UI

* added friendly name printing for metric

* formatting
Telemetry2 (#333)
* Create test.txt

* Create test.txt

* changes needed for benchmarking

* forgot one file

* merge conflict fix

* fix build break

* back out my version of the fix for Label column issue and fix the original fix

* bogus file removal

* undo SuggestedPipeline change

* remove labelCol from pipeline suggester

* fix build break

* rename AutoML to Microsoft.ML.Auto everywhere and a shot at publishing nuget package (will probably need tweaks once I try to use the pipleline)

* tweak queue in vsts-ci.yml

* CLI telemetry implementation

* Telemetry implementation

* delete unnecessary file and change file size bucket to actually log log2 instead of nearest ceil value

* add headers, remove comments

* one more header missing
Fix progress bar in linux/osx (#336)
* progressbar

* added progressbar and refactoring

* reverted

* revert sign assembly

* added headers and removed exception rethrow

* bug fixes and updates to UI

* added friendly name printing for metric

* formatting

* change from task to thread

* Update src/mlnet/CodeGenerator/CodeGenerationHelper.cs

Co-Authored-By: srsaggam <41802116+srsaggam@users.noreply.github.com>
Mem leak fix (#328)
* Create test.txt

* Create test.txt

* changes needed for benchmarking

* forgot one file

* merge conflict fix

* fix build break

* back out my version of the fix for Label column issue and fix the original fix

* bogus file removal

* undo SuggestedPipeline change

* remove labelCol from pipeline suggester

* fix build break

* rename AutoML to Microsoft.ML.Auto everywhere and a shot at publishing nuget package (will probably need tweaks once I try to use the pipleline)

* tweak queue in vsts-ci.yml

* there is still investigation to be done but this fix works and solves memory leak problems

* minor refactor
Add cross-validation (CV), and auto-CV for small datasets; push commo…
…n API experiment methods into base class (#287)
Polishing the CLI UI part-1 (#338)
* formatting of pbar message

* Polishing the UI

* optimization

* rename variable

* Update src/mlnet/AutoML/AutoMLEngine.cs

Co-Authored-By: srsaggam <41802116+srsaggam@users.noreply.github.com>

* Update src/mlnet/CodeGenerator/CodeGenerationHelper.cs

Co-Authored-By: srsaggam <41802116+srsaggam@users.noreply.github.com>

* new message

* changed hhtp to https

* added iteration num + 1

* change string name and add color to artifacts

* change the message

* build errors

* added null checks

* added exception messsages to log file

* added exception messsages to log file
command line api upgrade and progress bar rendering bug (#366)
* added fix for all platforms progress bar

* upgrade nuget

* removed args from writeline
fix few bugs in progressbar and verbosity (#374)
* fix few bugs in progressbar and verbosity

* removed unused name space
Fix for folders with space in it while generating project (#376)
* support for folders with spaces

* added support for paths with space

* revert file

* change name of var

* remove spaces
Formatting Regression metrics and progress bar display days. (#379)
* added progress bar day display and fix regression metrics

* fix formatting

* added total time

* formatted total time
change command name and add pbar message (#380)
* change command name and add pbar message

* fix tests

* added aliases

* duplicate alias

* added another alias for task
UI missing features (#382)
* added formatting changes

* added accuracy specifically
Change in project structure (#385)
* initial changes

* Change in project structure

* correcting test

* change variable name

* fix tests

* fix tests

* fix more tests

* fix codegen errors

* adde log file message

* changed name of args

* change variable names

* fix test
FileSizeBuckets in correct units (#387)
* Minor telemetry change to log in correct units and make our life easier in the future

* Use Ceiling instead of Round
prep work to transfer to ml.net (#389)
* move test projects to top level test subdir

* rename some projects to make naming consistent and make it build again

* fix test project refs

@Dmitry-A Dmitry-A merged commit 013a8d3 into dotnet:features/automl Apr 13, 2019

1 of 3 checks passed

MachineLearning-CI #20190412.46 failed
Details
MachineLearning-CodeCoverage #20190412.41 failed
Details
license/cla All CLA requirements met.
Details

@shauheen shauheen added the AutoML label Apr 21, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.