Historical analysis of Cloud Observability data

This is a companion repository for the blog post which explains in more details the architecture and steps to provision and analyze data.

Architecture:

Step 1: Creating the resources

See creating resources directly with terraform below as an alternative to using the IBM Schematics service.

Create resources using schematics

Log in to IBM Cloud
Navigate to Create Schematics Workspaces Under the Specify Template section, verify:
1. Repository URL is https://github.com/IBM-Cloud/log-archive-analysis
2. Terriform version is terraform_v1.1
3. Click Next
Under Workspace details,
1. Provide a workspace name : log-archive.
2. Choose a Resource Group and a Location. Remember the resource group for the next step.
3. Click on Next.
Verify the details and then click on Create.
Under Variables change the resource-group-name and the other defaults as desired.
Scroll to the top of the page and click Apply plan. Check the logs to see the status of the services created.

Get schematics output

Use the IBM Cloud Shell to run the following commands. Alternatively you will find instructions to download and install ibmcloud and jq for your operating environment in the Getting started with tutorials guide.

The configuration variables retrieved below will be used in future steps.

Get the list of workspaces, note the ID column, set the shell variable:
```
ibmcloud schematics workspace list
```
Set the WORKSPACE_ID variable:
```
WORKSPACE_ID=YOUR_WORKSPACE_ID
```

Get the configuration for the logging dashboard settings for archiving. Used in step 2.

ibmcloud schematics output --id $WORKSPACE_ID --output json | jq -r '.[0].output_values[].logging_dashboard_settings_archiving.value'

Get the configuration for the jupyter notebook configuration for python. Used in step 3.

ibmcloud schematics output --id $WORKSPACE_ID --output json | jq -r '.[0].output_values[].jupyter_notebook_configuration_python.value'

Step 2: Enable Archiving

Open the Activity Tracker instance list.
Create an Activity Tracker in the same region as your resources from the previous step (if one does not exist).
Open the dashboard of the Activity Tracker.
Click the Settings cog, click Archiving and then click Manage.
Click Enable Archiving.
Select IBM Cloud Object Storage in the Provider drop-down menu.
Fill in the values with the items generated in Step 1.
Click Save

Wait until archives are visible in bucket - can take 24 hours.

Step 3: Jupyter Notebook

Open the Watson Studio resource - start in the resource list.
Click Launch in IBM Cloud Pak for Data.
If a pop-up screen/overlay page is displayed, dismiss it. It is not needed for this post.
Click the + in the Project section to create a new project. Click Create an empty project.
Provide a name. In the Select storage service section, select log-archive-cos from the drop-down menu and click Create.
In the project, create a Jupyter notebook
Click the Assets panel at the top.
Click New asset.
Type "jupyter" in the search, which should display a Jupyter notebook editor card. Click the card.
Name the notebook and click the From URL panel at the top.
Leave the default runtime (IBM Runtime 22.1 on Python 3.9 XS 2vCPU 8 GB RAM for me) and paste this string for the Notebook URL: https://github.com/IBM-Cloud/log-archive-analysis/blob/master/logarchive.ipynb. Then, click Create.

Clean up

Navigate to Schematics Workspaces.
Click your workspace to open.
Click Actions > Destroy resources and follow instructions.
Wait for successful complete. If it fails try again.
Click Actions > Delete workspace and follow instructions.

Troubleshooting

Jupyter notebook query fails with Unable to infer schema for JSON. It must be specified manually. Maybe the COS bucket used for log archiving is empty

Is the logging or activity tracker dashboard configured to archive?
Are the variables correct including the crn that ends in ::

Step 1: Creating resources directly with Terraform alternative

This is an alternative to using the IBM Schematics Service.

See CLI Getting Started.

Initialize and run terraform:

cp template.local.env local.env
edit local.env; # make changes as needed
source local.env
terraform init
terraform apply
terraform output logging_dashboard_settings_archiving
terraform output jupyter_notebook_configuration_python

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
drawio		drawio
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
logarchive.ipynb		logarchive.ipynb
main.tf		main.tf
provider.tf		provider.tf
requirements.txt		requirements.txt
tst.py		tst.py
variables.tf		variables.tf
versions.tf		versions.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Historical analysis of Cloud Observability data

Step 1: Creating the resources

Create resources using schematics

Get schematics output

Step 2: Enable Archiving

Step 3: Jupyter Notebook

Clean up

Troubleshooting

Step 1: Creating resources directly with Terraform alternative

About

Releases

Packages

Languages

License

IBM-Cloud/log-archive-analysis

Folders and files

Latest commit

History

Repository files navigation

Historical analysis of Cloud Observability data

Step 1: Creating the resources

Create resources using schematics

Get schematics output

Step 2: Enable Archiving

Step 3: Jupyter Notebook

Clean up

Troubleshooting

Step 1: Creating resources directly with Terraform alternative

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages