-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Support TSV and JSON file formats as input data #66
Comments
Thank you for your suggestion. |
pandas has to_json and read_json methods in the Dataframe that support several different user specified orientation: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_json.html So in command line we can have similar “orient” option for user to indicate which orientation the file is using. It can also embed schema in the output json file as well. Would this work as well. |
@w4nderlust so the good old |
@w4nderlust Is this issue still open or anyone working on ? |
@jeffin07 all work now is around TF2, so if you're interested in this feature, you could provide the PR. Thanks. |
@jeffin07 it is still open and needed. We are also planing a reorganization of the preprocessing pipeline, this: https://github.com/uber/ludwig/tree/preprocessing_strategy is a branch with an example of how the preprocessing would look like after refactoring, which will easily enable this feature, but I haven't spent time on it yet as I am full steam on TF2. |
@w4nderlust Yes i would love to help in refactoring, which will also help me to understand the project more.I will checkout the branch you specified.Can you give me some guides so that it will be helpful |
Sure definitely. |
This has been recently added. Closing. |
CSV is good for numerical data, but when you have text data that may contain
,
and"
, escaping the values in the columns can be tricky and identification of delimiter comma is harder for CSV parsers.Can you add support for data file formats, TSV and JSON which do not have the problems above as much?
The text was updated successfully, but these errors were encountered: