zos3: support update of some of workloads #1425

muhamadazmy · 2021-10-12T08:36:07Z

The provision engine need to handle the failure to update differently than an install update. The problem is a failed update does not mean that the reservation is gone.

For example, increase a disk size can fail, but doesn't mean the disk is gone so the disk state should still be okay but we need to report back that the operation of the resize has failed. This is currently is not possible, because a workload can only hold ONE error message that is associated with the workload state.

I suggest that a workload can have instead (State) which is basically the current disk state. And also a List of (states) that are associated with operations.

For example

Disk Workload {
   State: Ok,
   Result: {
       Size: "current disk size",
    } 
   Operations {
      {State: Error, Operatino: update, Time: Timestamp, message: "failed to increase disk size, no enough space"}
      {State: Ok, Operation: install, Time: Timestamp, message: ""}
   }
}

Deployment storage rewrite to support transactions
Disk resize
ZDB resize
Monitoring should use transaction data to correctly calculate usage

The text was updated successfully, but these errors were encountered:

muhamadazmy · 2022-01-04T11:04:21Z

Requirements

Was thinking about the cleanest way to implement this without sacrifice correctness of the operation. Came across this set of requirements

A deployment is a container of some workloads that can interact together
A change on the workload can be to create, update or delete
Not all workload types can execute an update
An update on the workload need to change the state if the update was successful, otherwise a failure to update need to reflect that the transaction has failed, BUT the state is not (the state also include workload data)

This all pushes toward a different persisted workload storage (currently deployments are stored as single files). Instead the abstract structure of storage need to reflect "transactions" where transactions can itself container 'data', 'state' and error. and the final state of the object is computed by scanning the transactions and compute the final state/data of the workload.

deployment {
   ...
   workload: <name> {
     type: <type>
     transactions: [
        {op: install, data: {...}, state: ok, error: none}
        {op: update, data: {...}, state: not-changed: error}
     ]
   }
}

for example the previous workload state will still be ok, but with the data provided on install.
If we push another update call that passes, then it will has {op: update, data: {..}, state: ok} and then the final state of the workload will be ok, and the associated data is data from last update operations

WORK IN PRGRESS

muhamadazmy · 2022-01-04T12:15:22Z

Get Deployment operation need to be backward compatible. A Get operation need to return the same data (current state) of the deployment.
A new call should return history of operations with data, and errors.

xmonader added the type_feature New feature or request label Oct 28, 2021

xmonader added this to Backlog in 3.0.1 via automation Oct 28, 2021

xmonader added this to Backlog in TFGrid_3.0.0 via automation Oct 28, 2021

xmonader added this to the next milestone Oct 28, 2021

muhamadazmy moved this from Backlog to Accepted in 3.0.1 Oct 28, 2021

muhamadazmy self-assigned this Oct 28, 2021

muhamadazmy moved this from Accepted to In Progress in 3.0.1 Oct 28, 2021

muhamadazmy moved this from In Progress to Accepted in 3.0.1 Oct 28, 2021

muhamadazmy removed this from Accepted in 3.0.1 Nov 15, 2021

muhamadazmy added this to To do in 3.0 via automation Nov 15, 2021

muhamadazmy moved this from To do to Accepted in 3.0 Nov 22, 2021

muhamadazmy moved this from Accepted to To do in 3.0 Nov 25, 2021

muhamadazmy moved this from Accepted to Backlog in 3.0 Dec 7, 2021

muhamadazmy removed this from Backlog in 3.0 Dec 28, 2021

muhamadazmy added this to Accepted in 3.0.9+ X via automation Dec 28, 2021

muhamadazmy mentioned this issue Jan 4, 2022

provisiond new storage interface and implementation #1537

Closed

muhamadazmy moved this from Accepted to In progress in 3.0.9+ X Jan 12, 2022

muhamadazmy removed this from In progress in 3.0.9+ X Jan 13, 2022

muhamadazmy added this to Accepted in 3.1.0 via automation Jan 13, 2022

muhamadazmy moved this from Accepted to In progress in 3.1.0 Jan 13, 2022

muhamadazmy removed this from In progress in 3.1.0 Jan 17, 2022

muhamadazmy added this to Accepted in 3.0.9+ X via automation Jan 17, 2022

muhamadazmy moved this from Accepted to In progress in 3.0.9+ X Jan 17, 2022

muhamadazmy closed this as completed Jan 17, 2022

TFGrid_3.0.0 automation moved this from Backlog to Done Jan 17, 2022

3.0.9+ X automation moved this from In progress to Done Jan 17, 2022

muhamadazmy moved this from Done to Verification in 3.0.9+ X Jan 17, 2022

muhamadazmy moved this from Verification to Done in 3.0.9+ X Jan 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zos3: support update of some of workloads #1425

zos3: support update of some of workloads #1425

muhamadazmy commented Oct 12, 2021 •

edited

Loading

muhamadazmy commented Jan 4, 2022

muhamadazmy commented Jan 4, 2022

zos3: support update of some of workloads #1425

zos3: support update of some of workloads #1425

Comments

muhamadazmy commented Oct 12, 2021 • edited Loading

muhamadazmy commented Jan 4, 2022

Requirements

muhamadazmy commented Jan 4, 2022

muhamadazmy commented Oct 12, 2021 •

edited

Loading