Please follow the steps below to set up the Azure environment
Clone or download this repository and navigate to the project's root directory.
Before you can upload assests to the Synapse Workspace you will need to add your IP address:
- Go to the Synapse resouce you created in the previous step.
- Navigate to
Networking
underSecurity
on the left hand side of the page. - At the top of the screen click
+ Add client IP
- Your IP address should now be visible in the IP list
In order to perform the necessary actions in Synapse workspace, you will need to grant more access.
- Go to the Azure Data Lake Storage Account for your Synapse Workspace
- Go to the
Access Control (IAM) > + Add > Add role assignment
- Now search and select the
Storage Blob Data Contributor
role and click "Next" - Click "+ Select members", search and select your username and your adls-spn created in previous steps and click "Select"
- Click
Review and assign
at the bottom
- Launch the Synapse workspace Synapse Workspace
- Select the
subscription
andworkspace
name you are using for this solution accelerator - In Synapse Studio, navigate to the
Data
Hub - Select
Linked
- Under the category
Azure Data Lake Storage Gen2
you'll see an item with a name likexxxxx(xxxxx- Primary)
- Select the container named
govdatahubadlsfs (Primary)
, select "New folder", enterrawdata
and select "Create", selectUpload
to upload the following sample datasets in the data folder of this repository to therawdata
folder you created:programs.csv
program_types.csv
contacts.csv
program_participants.csv
program_metrics_outcomes.csv
programtype_metrics_outcomes.csv
- Launch the Synapse workspace Synapse Workspace
- Go to
Develop
, click the+
, and clickImport
to select all notebooks from this repository's folder - For each of the notebooks, select
Attach to > spark1
in the top dropdown - Update the following parameters in
LoadData.ipynb
and publish the changesdata_lake_account_name = '' file_system_name = ''
- Run
LoadData.ipynb
andCreateCDM.ipynb
-
Open the Power BI report in this repository
-
Click the Transform data dropdown and click Data source settings
- Select the Azure Synapse Workspace connection, select
Change Source...
and provide your SQL Server Database name under Server and clickOK
- Navigate to the Synapse Workspace overview page in the Azure Portal, copy the Serverless SQL endpoint
- Select
Edit Permissions
, under Credentials selectEdit
, sign in to your Microsoft Account, click "OK" and click "Close" - Select
Refresh