Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chaosengine was not clear after the experiment was over #1398

Closed
badashanren opened this issue Apr 2, 2020 · 9 comments
Closed

Chaosengine was not clear after the experiment was over #1398

badashanren opened this issue Apr 2, 2020 · 9 comments
Labels
area/chaos-scheduler Regarding the chaos scheduler kind/feature project/community Issues raised by community members
Milestone

Comments

@badashanren
Copy link

What happened:
Chaosengine was not clear after the experiment was over. spec.jobCleanPolicy is "delete"
And I run the experiment again, the experiment-pod is not start up. I have to remove chaoengine by manual. Is it normal?
What you expected to happen:
Chaosengine was clear after the experiment was over, I can run the experiment multiple times.
How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

@ksatchit
Copy link
Member

ksatchit commented Apr 2, 2020

Thanks for the feedback @badashanren ! Currently, we need to remove & recreate the engine to start the experiment (some of the users are working around this by setting up cron jobs to remove/create the chaosengine). We are solving this in a couple of stages:

@ksatchit ksatchit added kind/feature project/community Issues raised by community members area/chaos-scheduler Regarding the chaos scheduler labels Apr 2, 2020
@ksatchit ksatchit added this to the 1.4 milestone Apr 2, 2020
@badashanren
Copy link
Author

Thanks for the feedback @badashanren ! Currently, we need to remove & recreate the engine to start the experiment (some of the users are working around this by setting up cron jobs to remove/create the chaosengine). We are solving this in a couple of stages:

  • Stage-1: We will be able to keep a "completed" chaosengine and patch it to re-trigger experiments (ref: #1360 (comment)) - Should be available by 1.3 (15th of this month)
  • Stage-2: Provide scheduling capability to rerun the experiments multiple in a desired interval (ref: Better way to do "scheduled chaos" #1223). This should be available by 1.4, in all probability.

Got it~
BTW, I want to use other chaos library(such as chaosblade) to inject chaos. Some tips or docs will be appreciated

@ksatchit
Copy link
Member

ksatchit commented Apr 3, 2020

Sure @badashanren ! That is an interesting usecase. Litmus has been designed in a way where it can reuse other chaos tools/chaos logic - provided, it is containerized. Typically, the workflow involves being able to consume a config that the respective tools would need, with the litmus chaos-runner creating a job with the said chaos container/library. For ex: We have been using pumba in that model for some of the network experiments. Another example of integration is with chaostoolkit - where not just the chaos-injection stage, but an entire experiment is written using that tool & is orchestrated by Litmus.

We don't have a formal document describing how it is done yet, but will add it & share it for review ASAP.

Having said that, I am interested in learning more about your usecase for chaos in general, where chaosblade fits in etc., so that I can better suggest/document things.

@badashanren
Copy link
Author

That is an interesting usecase. Litmus has been designed in a way where it can reuse other chaos tools/chaos logic - provided, it is containerized. Typically, the workflow involves being able to consume a config that the respective tools would need, with the litmus chaos-runner creating a job with the said chaos container/library. For ex: We have been using pumba in that model for some of the network experiments. Another example of integration is with chaostoolkit - where not just the chaos-injection stage, but an entire experiment is written using that tool & is orchestrated by Litmus.

We don't have a formal document describing how it is done yet, but will add it & share it for review ASAP.

Having said that, I am interested in learning more about your usecase for chaos in general, where chaosblade fits in etc., so that I can better suggest/document things.

First,I like litmus,I think it's not a chaos tool,it should be a chaos platform. it's scalable, it's very important. Many Chaos tools have their advantages, litmus can be quickly compatible with other tools.
But if we have no formal docs, it is very difficulty to us.

@ksatchit
Copy link
Member

ksatchit commented Apr 3, 2020

That's great to hear! The doc is coming your way in a few hours :)

@ksatchit
Copy link
Member

ksatchit commented Apr 3, 2020

@badashanren, here is some info you may find useful.

I will be talking about 2 types of integrations in the upcoming doc that resolves #1205 :

@badashanren
Copy link
Author

@badashanren, here is some info you may find useful.

I will be talking about 2 types of integrations in the upcoming doc that resolves #1205 :

Thanks very much,I will study it carefully.

@ksatchit
Copy link
Member

Thanks for the feedback @badashanren ! Currently, we need to remove & recreate the engine to start the experiment (some of the users are working around this by setting up cron jobs to remove/create the chaosengine). We are solving this in a couple of stages:

  • Stage-1: We will be able to keep a "completed" chaosengine and patch it to re-trigger experiments (ref: #1360 (comment)) - Should be available by 1.3 (15th of this month)
  • Stage-2: Provide scheduling capability to rerun the experiments multiple in a desired interval (ref: Better way to do "scheduled chaos" #1223). This should be available by 1.4, in all probability.

Just to circle back on the 1.3 items related to topics discussed in this issue/comments : @badashanren .

Upcoming:

As noted 1.3.1 will contain the style guide, some more improvements to developer docs (including an example for use w/ chaosblade)

1.4 will have the dockerization of the scaffold tool to avoid dependencies.

@ksatchit
Copy link
Member

ksatchit commented May 16, 2020

The ChaosScheduler has been made available (as alpha feature) in release 1.4 This should meet the requirement of being able to repeatedly run the experiment by creating/removing the chaosengine.

Refer to https://docs.litmuschaos.io/docs/scheduling/ & https://docs.litmuschaos.io/docs/chaosschedule/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/chaos-scheduler Regarding the chaos scheduler kind/feature project/community Issues raised by community members
Projects
None yet
Development

No branches or pull requests

2 participants