Skip to content

SymbolicStochasticGradientDescent always prints to console #1985

@daholste

Description

@daholste

System information

  • OS version/distro: Windows 10
  • .NET Version (eg., dotnet --info): .NET Core 2.1

Issue

  • What did you do?
    Ran this code:
const string trainDataPath = @"C:\data\sample_train2.csv";
const string validationDataPath = @"C:\data\sample_valid2.csv";
const string testDataPath = @"C:\data\sample_test2.csv";

var mlContext = new MLContext();

// auto-infer text loader args
var textLoaderArgs = RecipeInference.MyAutoMlInferTextLoaderArguments(mlContext, trainDataPath, "Label");

// load data
var textLoader = new TextLoader(mlContext,
	new TextLoader.Arguments()
	{
		Separator = ",",
		HasHeader = true,
		Column = new[]
		{
			new TextLoader.Column("Age", DataKind.R4, 0),
			new TextLoader.Column("Workclass", DataKind.TX, 1),
			new TextLoader.Column("Fnlwgt", DataKind.R4, 2),
			new TextLoader.Column("Education", DataKind.TX, 3),
			new TextLoader.Column("EducationNum", DataKind.R4, 4),
			new TextLoader.Column("MaritalStatus", DataKind.TX, 5),
			new TextLoader.Column("Occupation", DataKind.TX, 6),
			new TextLoader.Column("Relationship", DataKind.TX, 7),
			new TextLoader.Column("Race", DataKind.TX, 8),
			new TextLoader.Column("Sex", DataKind.TX, 9),
			new TextLoader.Column("CapitalGain", DataKind.R4, 10),
			new TextLoader.Column("CapitalLoss", DataKind.R4, 11),
			new TextLoader.Column("HoursPerWeek", DataKind.R4, 12),
			new TextLoader.Column("NativeCountry", DataKind.TX, 13),
			new TextLoader.Column("Label", DataKind.Bool, 14),
		}
	});

var trainData = textLoader.Read(trainDataPath);
var validationData = textLoader.Read(validationDataPath);
var testData = textLoader.Read(testDataPath);

// preprocess
var preprocessorEstimator = mlContext.Transforms.Categorical.OneHotEncoding("Workclass", "Workclass")
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("Education", "Education"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("MaritalStatus", "MaritalStatus"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("Occupation", "Occupation"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("Relationship", "Relationship"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("Race", "Race"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("Sex", "Sex"))
	.Append(mlContext.Transforms.Categorical.OneHotEncoding("NativeCountry", "NativeCountry"))
	.Append(mlContext.Transforms.Concatenate(DefaultColumnNames.Features,
		"Age", "Workclass", "Fnlwgt", "Education", "EducationNum", "MaritalStatus", "Occupation", "Relationship",
		"Race", "Sex", "CapitalGain", "CapitalLoss", "HoursPerWeek", "NativeCountry"));

// train model
var trainer = mlContext.BinaryClassification.Trainers.SymbolicStochasticGradientDescent();
var estimatorChain = preprocessorEstimator.Append(trainer);
var model = estimatorChain.Fit(trainData);
  • What happened?
Initial learning rate is tuned to 10.000000

was printed to the console. Looks like SymSGD learner w/ no initialization params always prints this line?

  • What did you expect?
    Nothing to be printed to the console

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions