feat: update lambda state machine to accommodate tenantId #367

Bingjiling · 2021-06-25T20:42:07Z

Issue #, if available:

Description of changes:

State machine and glue script change to accommodate tenantId
Depends on PR feat: Multi tenancy bulk export fhir-works-on-aws-persistence-ddb#90

Checklist:

Have you successfully deployed to an AWS account with your changes?
Have you written new tests for your core changes, as applicable?

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

rsmayda · 2021-06-25T22:23:29Z

bulkExport/glueScripts/export-script.py

+    filtered_tenant_id_frame = Filter.apply(frame = original_data_source_dyn_frame,
+                               f = lambda x:
+                               x['_tenantId'] == tenantId)


instead of doing glue side filtering would it be better to have a secondary index on the tenantId? This will become an expensive operation if we have to scan across all tenants

Emm that's a great question. In the design doc, it specified the filtering is to be done as part of the Glue job, and secondary index was not introduced for any tables. @carvantes Any thoughts?

The glue job always scans the entire DDB table no matter what, there's no way to use a query. This is a limitation on the current AWS Glue + DDB integration.

There are existing scenarios where this is far from ideal. e.g. exporting a single FHIR resource type or exporting the resources modified in the last hour will both scan the entire table.

There is room for improvement on the bulk export solution, but we are not changing the fundamentals here.

bulkExport/glueScripts/export-script.py

* feat: add tenantId attribute to Cognito user pool (#348) * feat: remove unneeded scope checks in authorizer (#347) * feat: update lambda state machine to accommodate tenantId (#367) * feat: add "enableMultiTenancy" CFN parameter (#381) * test: add multi-tenancy integ tests (#387) * fix: remove _id, _tenantId from bulk export results (#384) * feat: Group export scripts (#389) * fix: add multi-tenant metadata route (#392) * fix: allow more concurrent export jobs for multi-tenant deployments (#397) * test: integ tests for Group export (#393) * feat: add ES hard delete config value (#398) * docs: update postman collection and docs to use Id token (#399) * docs: add multi-tenancy docs (#400) Co-authored-by: Yanyu Zheng <yz2690@columbia.edu> BREAKING CHANGE: The Cognito IdToken is now used instead of the accessToken to authorize requests.

* feat: update lambda state machine to accommodate tenantId (#367) * feat: add "enableMultiTenancy" CFN parameter (#382) * fix: pass enableMultiTenancy to ES * fix: remove _id, _tenantId from bulk export results * feat: Group export scripts (#389) * chore: script generating patient compartment search params * feat: update Glue script for group export * Upload patient compartment jsons to S3 * fix: allow more concurrent export jobs for multi-tenant deployments (#397) * feat: add ES hard delete config value (#398) * docs: add multi-tenancy docs (#400) * fix: pass enableMultiTenancy flag to s3DataService * test: add multi-tenancy integ tests (#387) * test: integ tests for Group export (#393) * chore: upgrade dependencies * add public multi-tenant routes * add system/read and user/delete permissions to defaults * test: fix tests for smart multi-tenancy * test: update gh actions to also test multi-tenant environment * docs: update bulk export docs to mention group export Co-authored-by: Yanyu Zheng <yz2690@columbia.edu>

feat: update lambda state machine to accomandate tenantId

ea6f51c

github-actions bot added the size/s label Jun 25, 2021

Bingjiling changed the title ~~[WIP] feat: update lambda state machine to accomandate tenantId~~ [WIP] feat: update lambda state machine to accommodate tenantId Jun 25, 2021

rsmayda reviewed Jun 25, 2021

View reviewed changes

use tenant specific S3 path

d3460a5

Bingjiling requested review from nguyen102 and carvantes June 28, 2021 19:00

Bingjiling changed the title ~~[WIP] feat: update lambda state machine to accommodate tenantId~~ feat: update lambda state machine to accommodate tenantId Jun 28, 2021

carvantes approved these changes Jun 29, 2021

View reviewed changes

bulkExport/glueScripts/export-script.py Outdated Show resolved Hide resolved

update comment

c2b6228

Bingjiling merged commit 9fedf56 into feat-multitenancy Jun 30, 2021

Bingjiling deleted the multi-tenancy-bulk-export branch June 30, 2021 14:37

carvantes pushed a commit that referenced this pull request Jul 8, 2021

feat: update lambda state machine to accommodate tenantId (#367)

3350d0b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: update lambda state machine to accommodate tenantId #367

feat: update lambda state machine to accommodate tenantId #367

Bingjiling commented Jun 25, 2021 •

edited

rsmayda Jun 25, 2021

Bingjiling Jun 28, 2021

carvantes Jun 29, 2021

feat: update lambda state machine to accommodate tenantId #367

feat: update lambda state machine to accommodate tenantId #367

Conversation

Bingjiling commented Jun 25, 2021 • edited

rsmayda Jun 25, 2021

Choose a reason for hiding this comment

Bingjiling Jun 28, 2021

Choose a reason for hiding this comment

carvantes Jun 29, 2021

Choose a reason for hiding this comment

Bingjiling commented Jun 25, 2021 •

edited