Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ignoring CSV columns in input file #35

Closed
matthewrobertbell opened this issue Sep 4, 2021 · 2 comments
Closed

Ignoring CSV columns in input file #35

matthewrobertbell opened this issue Sep 4, 2021 · 2 comments
Labels
enhancement New feature or request

Comments

@matthewrobertbell
Copy link

(Great project by the way!)

It seems like it isn't currently possible to ignore CSV columns in the input file, this would be useful for training without having to process CSV files with things like database ID columns before starting training.

@nitsky
Copy link
Contributor

nitsky commented Sep 7, 2021

Hi @matthewrobertbell, we don't currently have this option, so I will leave this issue open until we implement it. In the meantime, I suggest you try xsv. You can use it to create a new csv with a subset of the columns like so: cat data.csv | xsv select column_a,column_b > data2.csv.

@nitsky
Copy link
Contributor

nitsky commented Nov 15, 2021

This is supported by specifying a subset of the columns with the columns key in a configuration file: https://www.tangram.dev/docs/guides/train_with_custom_configuration.

@nitsky nitsky closed this as completed Nov 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants