Select a subset of data in a DataView.
Select a subset of data in a DataView.
Select-MLData -Data <IDataView> [-ByColumn <String>] [-ByKeyColumnFraction <String>] [-ByMissingValues <String[]>] [-ByColumnLowerBound <Double>] [-ByKeyColumnFractionLowerBound <Double>] [-ByColumnUpperBound <Double>] [-ByKeyColumnFractionUpperBound <Double>] [-Skip <Int64>] [-Take <Int64>] [-Shuffle] [-ShuffleSeed <Nullable<Int32>>] [-ShufflePoolSize <Int32>] [-DontShuffleSource] [-Cache <String[]>] [-Context <MLContext>] [<CommonParameters>]
The input data.
Type: Microsoft.ML.IDataView
Required: True
Position: named
Default value: None
Accept pipeline input: True (ByValue)
Accept wildcard characters: False
The name of a column to use for filtering.
Type: System.String
Required: False
Position: named
Default value: null
Accept pipeline input: False
Accept wildcard characters: False
The name of a column to use for filtering.
Type: System.String
Required: False
Position: named
Default value: null
Accept pipeline input: False
Accept wildcard characters: False
Name of the columns to filter on. If a row is has a missing value in any of these columns, it will be dropped from the dataset.
Type: System.String[]
Required: False
Position: named
Default value: null
Accept pipeline input: False
Accept wildcard characters: False
The inclusive lower bound for FilterByColumn.
Type: System.Double
Required: False
Position: named
Default value: -Infinity
Accept pipeline input: False
Accept wildcard characters: False
The inclusive lower bound for FilterByKeyColumnFraction.
Type: System.Double
Required: False
Position: named
Default value: 0
Accept pipeline input: False
Accept wildcard characters: False
The exclusive upper bound for FilterByColumn.
Type: System.Double
Required: False
Position: named
Default value: Infinity
Accept pipeline input: False
Accept wildcard characters: False
The exclusive upper bound for FilterByKeyColumnFraction.
Type: System.Double
Required: False
Position: named
Default value: 1
Accept pipeline input: False
Accept wildcard characters: False
Skip count rows in input.
Type: System.Int64
Required: False
Position: named
Default value: None
Accept pipeline input: False
Accept wildcard characters: False
Take count rows from input.
Type: System.Int64
Required: False
Position: named
Default value: None
Accept pipeline input: False
Accept wildcard characters: False
Shuffle the rows of input.
Type: System.Management.Automation.SwitchParameter
Required: False
Position: named
Default value: False
Accept pipeline input: False
Accept wildcard characters: False
The random seed. If unspecified, the random seed will be instead derived from the Context.
Type: System.Int32
Required: False
Position: named
Default value: null
Accept pipeline input: False
Accept wildcard characters: False
The number of rows to hold in the pool. Setting this to 1 will turn off pool shuffling and will only perform a shuffle by reading input in a random order.
Type: System.Int32
Required: False
Position: named
Default value: 1000
Accept pipeline input: False
Accept wildcard characters: False
If false, the transform will not attempt to read input in a random order and only use pooling to shuffle. This parameter has no effect if the CanShuffle property of input is false.
Type: System.Management.Automation.SwitchParameter
Required: False
Position: named
Default value: False
Accept pipeline input: False
Accept wildcard characters: False
The columns that must be cached whenever anything is cached. An empty array or null value means that columns are cached upon their first access. Only applied when the parameter is used.
Type: System.String[]
Required: False
Position: named
Default value: null
Accept pipeline input: False
Accept wildcard characters: False
The context on which to perform the action. If omitted, the current (cached) context will be used.
Type: Microsoft.ML.MLContext
Required: False
Position: named
Default value: Current context
Accept pipeline input: False
Accept wildcard characters: False
This cmdlet supports the common parameters: Verbose, Debug, ErrorAction, ErrorVariable, WarningAction, WarningVariable, OutBuffer, PipelineVariable, and OutVariable. For more information, see about_CommonParameters.
Type | Description |
---|---|
Microsoft.ML.IDataView | You can pipe a data view to this cmdlet. |
Type | Description |
---|---|
Microsoft.ML.IDataView | This cmdlet returns a DataView. |