Digital Transformation - Metrics Automation

Description

This repository performs the following:

Generates bash scripts and sql queries from excel and csv mapping documents (generate_sql_output.py)
Converts mapping documents from spreadsheets to csv and update config file with updated csv file name when applicable (mapping_converter.py)
Generate ddl scripts based on config and schema files (generate_ddl_output.py)

Project Setup

System Prerequisites

For this project, Python 3.9+ is required. To ensure the proper distribution of Python is installed, use the version command

$ python --version

NOTE: MacOS ships with Python 2 by default as the python command, and will typically also include a Python 3 distribution available as the command python3. If this is the case on your machine, use python3 instead for all setup steps.

Hosting options

There are several ways to host this project, including:

Within an IDE, such as PyCharm
In a Python isolation environment, such as virtualenv
In a Docker container

Setup Steps

Clone this repository to the target build machine.

Create a virtual environment for the project then activate it.

Mac / Linux

$ python -m venv venv

$ . venv/bin/activate

Windows

$ python -m venv venv

$ .\venv\Scripts\activate.bat

Use pip to install all project dependencies into the virtual environment
```
$ pip install -r requirements.txt
```
Run the unit tests to verify the install
```
$ python -m unittest
```

Application Use Cases

Generate SQL Output

In order to generate SQL, run the transform_generator.generate_sql_output module from the root of this project.

Example files are included in this project's test directory, and can be used to quickly test SQL Generation. The following command will utilize the relative paths of these files and writes all output to an output folder at the project root.

Example Usage

$ python -m transform_generator.generate_sql_output --config_path test/Resources/positive_cases/config --schema_path test/Resources/positive_cases/schema --mapping_sheet_path test/Resources/positive_cases/mapping --project_config_path test/Resources/positive_cases/project_config/project_config_test.csv --output_datafactory output/datafactory --output_databricks output/databricks

Options and Arguments

--config_path: path to the config directory for the transform generator.
--schema_path: path to the directory containing .csv files for schemas
--mapping_sheet_path: path to the directory containing mapping sheets
--project_config_path semicolon delimited list to paths of the project config files
--output_databricks: path to the folder where databricks output is written. If folders in this path do not exist, they will be created.
--output_datafactory: path to the folder where databricks output is written. If folders in this path do not exist, they will be created.

Generate DDL Output

Example Usage

$ python -m transform_generator.generate_ddl_output --config_path test/Resources/positive_cases/config --schema_path test/Resources/positive_cases/schema --mapping_sheet_path test/Resources/positive_cases/mapping --project_config_path test/Resources/positive_cases/project_config/project_config_test.csv --output_datafactory output/datafactory --output_databricks output/databricks

Options and Arguments

--config_path: path to the config directory for the transform generator.
--schema_path: path to the directory containing .csv files for schemas
--mapping_sheet_path: path to the directory containing mapping sheets
--project_config_path semicolon delimited list to paths of the project config files
--output_databricks: path to the folder where databricks output is written. If folders in this path do not exist, they will be created.
--output_datafactory: path to the folder where databricks output is written. If folders in this path do not exist, they will be created.

Documentation

Running the Documentation mandates a virtual environment activated with the requirements.txt file installed. This is accomplished as a through the completion of the first three Setup Steps.

Transformation Generator uses the static site generator MkDocs to build project documentation. Once The project has been set up in a virtual environment, the documentation can be built for local development by:

Navigating to the documentation directory
```
$ cd ./documentation
```
Starting the development server
```
$ mkdocs serve
```

All documentation source files are contained within the documentation/docs directory and are written in Markdown. By default, the built site will be available at http://localhost:8000/

Generating unit test coverage file

Generating an unit test coverage file or updating a pre-existing one can be done with following steps:

Activate the virtual environment created during setup
```
$ . venv/bin/activate
```
Run the following commands to clear and generate new code coverage analysis:
```
$ coverage erase
$ coverage 
```
View the report
```
$ coverage report
```

Running the Web Server

Running the webserver mandates a virtual environment activated with the requirements.txt file installed. This is accomplished as a through the completion of the first three Setup Steps.

This application provides an API which can be hosted locally. To run it, at the root of the project run the module transform_generator.api.api from within a virtual environment.

$ python -m transform_generator.api.api

The API will start listening on port 8001 by default.

Accessing Swagger Documentation

Swagger documentation outlining the various endpoints offered by the API can be accessed from /docs

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
antlr		antlr
documentation		documentation
test		test
transform_generator		transform_generator
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
generate_antlr.sh		generate_antlr.sh
logging.conf		logging.conf
requirements.txt		requirements.txt
setup.py		setup.py
start_api.sh		start_api.sh
version.py		version.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digital Transformation - Metrics Automation

Description

Project Setup

System Prerequisites

Hosting options

Setup Steps

Mac / Linux

Windows

Application Use Cases

Generate SQL Output

Example Usage

Options and Arguments

Generate DDL Output

Example Usage

Options and Arguments

Documentation

Generating unit test coverage file

Running the Web Server

Accessing Swagger Documentation

About

Releases

Packages

Contributors 2

Languages

License

johnsonandjohnson/transformation_generator

Folders and files

Latest commit

History

Repository files navigation

Digital Transformation - Metrics Automation

Description

Project Setup

System Prerequisites

Hosting options

Setup Steps

Mac / Linux

Windows

Application Use Cases

Generate SQL Output

Example Usage

Options and Arguments

Generate DDL Output

Example Usage

Options and Arguments

Documentation

Generating unit test coverage file

Running the Web Server

Accessing Swagger Documentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages