Feature/49 dynamic schema generation #69

zeyus · 2022-11-14T14:36:37Z

Fixes #49

So, we now have a fluent, SQLAlchemy compatible interface to dynamic schemas per user per datasource. These are stored in unique SQLAlchemy SQLite database files.

zeyus · 2022-11-14T15:02:32Z

nlp4all/models/DataSource.py

+    id = Column(Integer, primary_key=True)
+    user = Column(Integer, ForeignKey("user.id"))
+    data_source_id = Column(String(80), unique=True, nullable=False)
+    user_id = Column(Integer, nullable=False)


ok I def shouldn't need a user id here, that's already in the user relationship, fixing now

one further comment here, that column has been removed, but we might need a column or columns at some point to store information about the data source, like which columns are used in analysis or filtering etc, but that most likely belongs in the controller / model of the specific analysis, not in the data source / data source manager.

zeyus · 2022-11-14T15:05:16Z

all suggestions, comments, improvements, criticisms welcome! :D

zeyus · 2022-11-15T08:47:20Z

nlp4all/models/datasource_manager.py

+            os.makedirs(ds_dir)
+        # generate a unique filename from a hash of the data source id
+        # this is to prevent a user from accessing another user's data
+        ds_id_hash = hashlib.sha256(self._data_source_name.encode()).hexdigest()


I think I should make the hash not based on the data source name, because the user might change it (if we let them)...

The ID might be better, but it's also predictable, this might not matter too much because the data directory should never be publicly accessible, so I think we should change it to ID rather than name, as ID should be unique.

Is there any circumstance why the ID of the user may change? some migration? Docker reasons?

not in any way that will affect things, migrations won't change user IDs, and independent installations can have different IDs but they will also have their own data sources and their own users so it won't be an issue

zeyus added 2 commits November 11, 2022 15:15

initial idea for dynamic datasource schema, messy WIP.

530cee9

User-based datasource manager, with fluent SQLAlchemy compatible API

dbdd70f

zeyus requested review from franciscoabenza and emilroenn November 14, 2022 14:36

zeyus linked an issue Nov 14, 2022 that may be closed by this pull request

Dynamic schema generation #49

Closed

zeyus added 4 commits November 14, 2022 15:38

Merge branch 'develop' into feature/49-dynamic-schema-generation

e966cab

Whoops, forgot to change filename after refactor.

ec5047b

not sure why, but path does not seem to exist on github.

1b0eff5

now I know why...

bb1eda8

zeyus commented Nov 14, 2022

View reviewed changes

Removed unnecessary column.

158debb

zeyus added 3 commits November 14, 2022 18:57

Added destructor.

c02e5f8

Refactored enum to avoid linting bypass, sorted enums.

36229d6

Refactored tests, added tests for enums.

75ecd7a

zeyus commented Nov 15, 2022

View reviewed changes

Changed DataSourceManager to use ds id (int) instead of name (str).

bacb782

franciscoabenza approved these changes Nov 15, 2022

View reviewed changes

zeyus merged commit 9044838 into develop Nov 15, 2022

zeyus deleted the feature/49-dynamic-schema-generation branch November 15, 2022 10:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/49 dynamic schema generation #69

Feature/49 dynamic schema generation #69

zeyus commented Nov 14, 2022 •

edited

Loading

zeyus Nov 14, 2022

zeyus Nov 15, 2022

zeyus Nov 15, 2022

zeyus commented Nov 14, 2022

zeyus Nov 15, 2022

zeyus Nov 15, 2022

franciscoabenza Nov 15, 2022

zeyus Nov 15, 2022 •

edited

Loading

Feature/49 dynamic schema generation #69

Feature/49 dynamic schema generation #69

Conversation

zeyus commented Nov 14, 2022 • edited Loading

zeyus Nov 14, 2022

Choose a reason for hiding this comment

zeyus Nov 15, 2022

Choose a reason for hiding this comment

zeyus Nov 15, 2022

Choose a reason for hiding this comment

zeyus commented Nov 14, 2022

zeyus Nov 15, 2022

Choose a reason for hiding this comment

zeyus Nov 15, 2022

Choose a reason for hiding this comment

franciscoabenza Nov 15, 2022

Choose a reason for hiding this comment

zeyus Nov 15, 2022 • edited Loading

Choose a reason for hiding this comment

zeyus commented Nov 14, 2022 •

edited

Loading

zeyus Nov 15, 2022 •

edited

Loading