fix: datasource payload is incorrect #15184

betodealmeida · 2021-06-16T00:48:59Z

SUMMARY

We currently return all datasources in the /datasources/ and /chart/add endpoints. This PR changes the payload to have only user-accessible datasources.

The PR adds a new method get_user_datasources, which replaces get_all_datasources in the two views.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

N/A

TESTING INSTRUCTIONS

Will add unit tests, wanted to check the approach first.

ADDITIONAL INFORMATION

Has associated issue:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

codecov · 2021-06-16T01:22:56Z

Codecov Report

Merging #15184 (7c5f217) into master (ab153e6) will increase coverage by 0.08%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master   #15184      +/-   ##
==========================================
+ Coverage   77.14%   77.23%   +0.08%     
==========================================
  Files         973      973              
  Lines       50473    50496      +23     
  Branches     6183     6183              
==========================================
+ Hits        38938    39000      +62     
+ Misses      11329    11290      -39     
  Partials      206      206

Flag	Coverage Δ
hive	`81.42% <100.00%> (+<0.01%)`	⬆️
mysql	`81.69% <100.00%> (+<0.01%)`	⬆️
postgres	`81.71% <100.00%> (+<0.01%)`	⬆️
presto	`81.41% <100.00%> (?)`
python	`82.24% <100.00%> (+0.16%)`	⬆️
sqlite	`81.34% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
superset/views/chart/views.py	`88.63% <ø> (ø)`
superset/views/core.py	`75.54% <ø> (-0.04%)`	⬇️
superset/connectors/connector_registry.py	`83.33% <100.00%> (+4.64%)`	⬆️
superset/views/sql_lab.py	`60.68% <0.00%> (ø)`
superset/models/core.py	`90.02% <0.00%> (+0.26%)`	⬆️
superset/connectors/sqla/models.py	`89.87% <0.00%> (+1.41%)`	⬆️
superset/db_engine_specs/presto.py	`90.31% <0.00%> (+5.89%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ab153e6...7c5f217. Read the comment docs.

dpgaspar

Looking good, we should add some tests for get_user_datasources

dpgaspar · 2021-06-16T08:15:25Z

superset/connectors/connector_registry.py

+                    schema_perm
+                    and security_manager.can_access("schema_access", schema_perm)
+                ):
+                    user_datasources.extend(datasources)


oh! this makes me question the validity of the get methods on the MVC and API, they both depend on: https://github.com/apache/superset/blob/master/superset/views/base.py#L581

Hmmm, do you think I also need to check for security_manager.can_access_all_datasources() here?

can_access_database handles it

dpgaspar

LGTM

john-bodley · 2021-06-23T20:15:38Z

superset/connectors/connector_registry.py

+        user_datasources = set()
+        for datasource_class in ConnectorRegistry.sources.values():
+            user_datasources.update(
+                session.query(datasource_class)


@betodealmeida @dpgaspar I believe this logic, i.e., relying on the fragile perm and schema_perm columns in the database, breaks if people are using a custom security manager. Should we be calling get_datasources_accessible_by_user instead?

Note I sense currently Superset doesn't do a great job of differentiating between metadata and data access. The get_datasources_accessible_by_user is somewhat of a misnomer as it is merely used for metadata access (for an awareness perspective).

@john-bodley, get_datasources_accessible_by_user calls ConnectorRegistry.query_datasources_by_permissions, which uses the same logic I'm using here.

Since we're getting user_perms and schema_perms from the security manager (lines 108, 109) doesn't it means this works with custom security managers?

@betodealmeida @john-bodley I think that the question here is that AirBnb totally overrides the DB permissions, they do that by overriding the security manager, their permission "backend" is something totally different. So that's possible if all permission checks are done on the security manager.

Makes me think how can that work on the REST API defined filters

@dpgaspar that's correct. We actually don't use any of the FAB logic and thus we completely bypass the securiy_manager.user_view_menu_names and reliance of the datasource_class.perm and datasource_class.schema_perm database columns.

@betodealmeida if the get_datasources_accessible_by_user method isn't ideal, then I think the alternatively would be to move the get_user_datasources method to the security manager so deployments can overwrite the logic if necessary.

This reverts commit 216e2b8.

This reverts commit f230b19.

* fix: datasource payload is incorrect * Add tests, clean code

betodealmeida requested a review from dpgaspar June 16, 2021 00:48

pull-request-size bot added the size/M label Jun 16, 2021

dpgaspar reviewed Jun 16, 2021

View reviewed changes

betodealmeida force-pushed the ch16905 branch from 31e46a6 to 31af49a Compare June 22, 2021 21:43

fix: datasource payload is incorrect

8330c35

betodealmeida force-pushed the ch16905 branch from 31af49a to 8330c35 Compare June 22, 2021 21:52

pull-request-size bot added size/L and removed size/M labels Jun 22, 2021

Add tests, clean code

7c5f217

betodealmeida force-pushed the ch16905 branch from 22ed433 to 7c5f217 Compare June 22, 2021 23:30

betodealmeida requested a review from dpgaspar June 22, 2021 23:31

dpgaspar approved these changes Jun 23, 2021

View reviewed changes

betodealmeida merged commit 216e2b8 into apache:master Jun 23, 2021

john-bodley reviewed Jun 23, 2021

View reviewed changes

serenajiang added a commit to airbnb/superset-fork that referenced this pull request Jun 23, 2021

Revert "fix: datasource payload is incorrect (apache#15184)"

658ab0f

This reverts commit 216e2b8.

michellethomas pushed a commit to airbnb/superset-fork that referenced this pull request Jun 30, 2021

Revert "fix: datasource payload is incorrect (apache#15184)"

f230b19

This reverts commit 216e2b8.

john-bodley mentioned this pull request Jun 30, 2021

refactor: Moving get_user_datasources to security manager #15467

Merged

8 tasks

john-bodley added a commit to airbnb/superset-fork that referenced this pull request Jun 30, 2021

Revert "Revert "fix: datasource payload is incorrect (apache#15184)""

d916852

This reverts commit f230b19.

john-bodley added a commit to airbnb/superset-fork that referenced this pull request Jul 2, 2021

Revert "Revert "fix: datasource payload is incorrect (apache#15184)""

ee51662

This reverts commit f230b19.

cccs-RyanS pushed a commit to CybercentreCanada/superset that referenced this pull request Dec 17, 2021

fix: datasource payload is incorrect (apache#15184)

a92725a

* fix: datasource payload is incorrect * Add tests, clean code

QAlexBall pushed a commit to QAlexBall/superset that referenced this pull request Dec 29, 2021

fix: datasource payload is incorrect (apache#15184)

ff2b4c5

* fix: datasource payload is incorrect * Add tests, clean code

cccs-rc pushed a commit to CybercentreCanada/superset that referenced this pull request Mar 6, 2024

fix: datasource payload is incorrect (apache#15184)

8026395

* fix: datasource payload is incorrect * Add tests, clean code

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 1.3.0 labels Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: datasource payload is incorrect #15184

fix: datasource payload is incorrect #15184

betodealmeida commented Jun 16, 2021

codecov bot commented Jun 16, 2021 •

edited

dpgaspar left a comment

dpgaspar Jun 16, 2021

betodealmeida Jun 17, 2021

dpgaspar Jun 23, 2021

dpgaspar left a comment

john-bodley Jun 23, 2021 •

edited

betodealmeida Jun 24, 2021

dpgaspar Jun 24, 2021

john-bodley Jun 28, 2021

john-bodley Jun 28, 2021

fix: datasource payload is incorrect #15184

fix: datasource payload is incorrect #15184

Conversation

betodealmeida commented Jun 16, 2021

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

codecov bot commented Jun 16, 2021 • edited

Codecov Report

dpgaspar left a comment

Choose a reason for hiding this comment

dpgaspar Jun 16, 2021

Choose a reason for hiding this comment

betodealmeida Jun 17, 2021

Choose a reason for hiding this comment

dpgaspar Jun 23, 2021

Choose a reason for hiding this comment

dpgaspar left a comment

Choose a reason for hiding this comment

john-bodley Jun 23, 2021 • edited

Choose a reason for hiding this comment

betodealmeida Jun 24, 2021

Choose a reason for hiding this comment

dpgaspar Jun 24, 2021

Choose a reason for hiding this comment

john-bodley Jun 28, 2021

Choose a reason for hiding this comment

john-bodley Jun 28, 2021

Choose a reason for hiding this comment

codecov bot commented Jun 16, 2021 •

edited

john-bodley Jun 23, 2021 •

edited