Skip to content

Exporting data from Dynamics 365 Business Central to Azure data lake storage

License

Notifications You must be signed in to change notification settings

JulianSchmidtke/bc2adls

 
 

Repository files navigation

Project

This tool is an experiment on Dynamics 365 Business Central with the sole purpose of discovering the possibilities of having data exported to an Azure Data Lake. To see the details of how this tool is supported, please visit the Support page. In case you wish to use this tool for your next project and engage with us, you are welcome to write to bc2adls@microsoft.com. As we are a small team, please expect delays in getting back to you.

Introduction

The bc2adls tool is used to export data from Dynamics 365 Business Central (BC) to Azure Data Lake Storage and expose it in the CDM folder format. The components involved are the following,

  • the businessCentral folder holds a BC extension called Azure Data Lake Storage Export (ADLSE) which enables export of incremental data updates to a container on the data lake. The increments are stored in the CDM folder format described by the deltas.cdm.manifest.json manifest.
  • the synapse folder holds the templates needed to create an Azure Synapse pipeline that consolidates the increments into a final data CDM folder.

The following diagram illustrates the flow of data through a usage scenario- the main points being,

  • Incremental update data from BC is moved to Azure Data Lake Storage through the ADLSE extension into the deltas folder.
  • Triggering the Synapse pipeline(s) consolidates the increments into the data folder.
  • The resulting data can be consumed by applications, such as Power BI, in the following ways:
    • CDM: via the data.cdm.manifest.json manifest
    • CSV/Parquet: via the underlying files for each individual entity inside the data folder
    • Spark/SQL: via shared metadata tables

Architecture

More details:

Latest notable changes

Pull request Changes
55 A much awaited request to allow the BC extension to read from the replica database saves up resources that can otherwise be dedicated to normal ERP operations, has now been implemented. This change is dependent on the version 21 of the application.
59 The default rounding principles caused the consolidated data to have a maximum of two decimal places even though the data in the deltas may have had higher decimal precision. Added an applied trait to all decimal fields so that they account for up to 5 decimal places.
54 Fixes irregularities on the System Audit fields. (1) Very old records do not appear in the lake sometimes because the SystemCreatedAt field is set to null. This field is now artificaly initialized to a date so that it appears in the lake, and (2) The SystemID field may be repeated over different records belonging to different companies in the same table. Thus, the uniqueness contraint has been fixed.
49 Entities using the Parquet file format can now automatically be registered as a shared metadata table that is managed in Spark but can also be queried using Serverless SQL. You can find the full feature guide here.
47 The ability to simultaneously export data from multiple companies has been introduced. This is expected to save time and effort in cases which required users to sequence the runs for different companies one after the other.
43 Intermediate staging data is no longer saved in CDM format. This eliminates potential conflicts during concurrent updates to the manifest. This does not affect the final data output, which continues to be in CDM format.
33 Fixing issue related to localizations of booleans and options/ enums.
31 Permissions corrected to direct permissions.
28 The AL app is upgraded to Dynamics 365 Business Central version 20. An archive has been created to keep the older versions at the archived folder.
23 The setting in Consolidation_OneEntity that limited concurrent execution of the pipeline to one instance has been removed. Now, an infinite number of pipeline instances is allowed to run concurrently.
20 Data on the lake can be chosen to be stored on the Parquet format, thus improving its fidelity to its original in Business Central.
16 The Consolidation_CheckForDeltas pipeline now contains a fail activity that is triggered when no directory is found in /deltas/ for an entity listed in the deltas.manifest.cdm.json. This may occur when no new deltas have been exported since the last execution of the consolidation pipeline. Other parallel pipeline runs are not affected.
14 It is possible now to select all fields in a table for export. Those fields that are not allowed to be exported, say flow fields, are not selected.
13 A template is inserted in the OnAfterOnDatabaseDelete procedure, so that deletions of archive table records, are not synchronized to the data lake. This helps in selected tables in the data lake continuing to hold on to records that may be removed from the BC database, for house-keeping purposes. This is especialy relevant for ledger entry tables.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

About

Exporting data from Dynamics 365 Business Central to Azure data lake storage

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • AL 98.0%
  • Jupyter Notebook 1.6%
  • Perl 0.4%