-
Notifications
You must be signed in to change notification settings - Fork 29
Where to place data #714
Comments
The diagram here: https://github.com/vmware/hillview/blob/master/docs/userManual.md#11-system-architecture shows the system architecture. Only workers read data. If you read data from files, the files should be on the same machines where the workers reside. On a single machine you can load files from the same machine. On a cluster it is easiest to divide the data among the machines where the workers are, placing all files that should be analyzed together in the same directory. If you have a more concrete use case we can discuss about it specifically. |
But ideally you should not need to move any of your data when using Hillview. If you already have the data stored in a distributed system, e.g. a set of logs on some machines, the ideal case is to deploy a Hillview worker on each machine which stores some of the data. Many data lakes look like this. |
I think that you have loaded the data alright. The issue is that your table has lots of columns, and thus it starts in a "schema" view instead of a "Table" view. In the schema view you are shown all columns and you can choose which ones to see in a table view. So you have selected 9 columns and displayed these as a table. If you want to see a table with all columns, just select all (using click on the first row, and then shift-click on the last row) and use the menu "view selected columns." I could also add menu buttons "select all columns", or "view all columns as table" to make this easier. |
The reason we show a schema view for wide tables is that they do not really fit nicely on the screen being very wide, so we give you the option to select only a subset of the columns. |
Thanks for your prompt responses. I wanted to know following.
I really appreciate your responses, and I see good potential of Hillview on my project. |
Hillview will always aggregate the displayed data in some form, because most data does not fit on the screen. For 2. the only solution right now is to edit the code; this is in file loadView.ts. But this is a good idea: to give you the possibility of creating JSON file with a set of files to load. I will file a separate issue for that. If you look at the code in loadView.ts, it looks like this:
So this is in fact just a JSON object. We could read this object from a file. But I will need to document the schema of the JSON. |
I have two questions,
Any articles or guidance would be appreciated.
The text was updated successfully, but these errors were encountered: