siddhi how to confirm the data availability? #1450

xywan89 · 2019-08-16T01:57:53Z

now i deploy a siddhi cluster on some docker,and will restart the cluster frequently(at least one or more per day).my question is when i restart an docker, if will loss data or not ?

if will loss data,how can i do to confirm the data availability?

BuddhiWathsala · 2019-08-16T05:30:57Z

Yes if you deploy Siddhi in default way you will lose the data. You can enable state persistence in two ways.

File system persistence
DB persistence

To persist data in a file system you have to do the following.

Create a directory(<PATH_TO_TEMP>/temp) to persist the state

Then, you need to create a YAML file with the following content to enable state persistence in file system mode. Let say that file is config.yaml.

state.persistence:
enabled: true
intervalInMin: 1
revisionsToKeep: 2
persistenceStore: io.siddhi.distribution.core.persistence.FileSystemPersistenceStore
config:
    location: siddhi-app-persistence

Then you have to run the docker using the following command. This command will create a volume mount to the /conf/config.yaml the directory inside the docker and using that file Siddhi runner changes its default config. This config change enables periodic state persistence.

docker run -v <PATH_TO_TEMP>/temp:/home/siddhi_user/siddhi-runner/wso2/runner/siddhi-app-persistence -v <PATH_TO_CONFIG_YAML>/config.yaml:/conf/config.yaml  -v <PATH_TO_SIDDHI_APPS>/PowerConsumptionSurgeDetection.siddhi:/siddhi/PowerConsumptionSurgeDetection.siddhi -p 8070:8070 siddhiio/siddhi-runner-ubuntu:5.1.0-m2  -Dconfig=/conf/config.yaml -Dapps=/siddhi/PowerConsumptionSurgeDetection.siddhi

This will persist the state to your <PATH_TO_TEMP>/temp directory. To enble database persistence use following YAML block. Then you have to connect a DB to the Siddhi runner docker.

state.persistence:
  enabled: true
  intervalInMin: 1
  revisionsToKeep: 3
  persistenceStore: io.siddhi.distribution.core.persistence.DBPersistenceStore
  config:
    datasource: <DATASOURCE NAME>   # A datasource with this name should be defined in wso2.datasources namespace
    table: <TABLE NAME>

Please refer to Siddhi documentation for more details.
[1] https://siddhi.io/en/v5.0/docs/siddhi-as-a-docker-microservice/#running-with-runner-config
[2] https://siddhi.io/en/v5.0/docs/config-guide/#configuring-periodic-state-persistence

cristicmf · 2019-08-19T07:32:49Z

If I didn't want to use the docker or Kubernetes , How Can I Cover the Multi Datacenter High Availability Deployment

BuddhiWathsala · 2019-08-19T09:59:53Z

@cristicmf, currently Siddhi distribution does not support that HA deployment without docker or K8s.

But if you really need this HA feature you can try out the HA functionality in our stream processor. Please refer this link to get more idea about HA deployment in the stream processor.

cristicmf · 2019-08-21T02:53:52Z

@cristicmf, currently Siddhi distribution does not support that HA deployment without docker or K8s.

But if you really need this HA feature you can try out the HA functionality in our stream processor. Please refer this link to get more idea about HA deployment in the stream processor.

thx ~~ And I want know more thing about the scalability , can you give me some tips.

BuddhiWathsala · 2019-08-22T05:01:27Z

In our stream processor, we are supporting the following deployment types.

Now we have separate runtime called Siddhi runner which is a very light environment to run streaming logic. All the deployment types now managed using docker and K8s.

You can find out more details about docker deployments in Siddhi using following links.
[1] https://github.com/siddhi-io/docker-siddhi
[2] https://hub.docker.com/u/siddhiio

For K8s deployment, we have custom K8s operator called Siddhi operator. Up to now, Siddhi operator supports default distributed deployment. We are working on with the fully distributed deployment for K8s now. You can find out K8s deployment details from the following link.

[3] https://github.com/siddhi-io/siddhi-operator

Try this Katacoda samples about each deployment type in K8s.

Also, refer to the Siddhi documentation for more descriptive details.

xywan89 · 2019-08-23T01:00:17Z

refer this link

i run siddhi as a java library,it seems all way mentioned above can not cover,is there any way to confirm ?

HA mode need duplicate one instance, it's not a good way to solve my restart scenario!!

BuddhiWathsala · 2019-08-23T06:00:03Z

@xywan89 you can achieve in-memory persistence and file store persistence using Siddhi Java library. Please refer following test cases to understand how to persist your state.

xywan89 · 2019-08-23T08:25:06Z

2. File system

tks,it seems java documentaion is insufficient, look forward to your complement。

what more about persistance? such as the difference between persistance and IncrementalPersistence ?

xywan89 · 2019-08-27T06:37:25Z

@xywan89 you can achieve in-memory persistence and file store persistence using Siddhi Java library. Please refer following test cases to understand how to persist your state.

In-memory

File system

the other issue is can i limit the resource used by siddhi runtime when i use siddhi as a java library,such as limit memory cost?

BuddhiWathsala · 2019-09-02T05:43:49Z

File system

tks,it seems java documentaion is insufficient, look forward to your complement。

what more about persistance? such as the difference between persistance and IncrementalPersistence ?

Sorry for the late reply. In the state persistence, it simply persists the overall state of the current checkpoint of the application. For example, let say your application is in checkpoint X1 then it will persist X1 to the file system. When you move to the X2 checkpoint, then again Siddhi will persist overall X2 state in the file system. However, this would be a redundant persistent mechanism when your application has Giga bites of data and you only do small changes to the application.

For that kind of scenario, you can use incremental persistence. Incremental persistence uses incremental checkpointing. In there Siddhi will only persist the changes(or delta) instead of persisting overall state.

mohanvive · 2019-09-09T07:01:15Z

Closing the issue since query is answered. Please reopen if u need further assistance on this.

mohanvive added the type/question label Aug 16, 2019

mohanvive assigned BuddhiWathsala Aug 16, 2019

mohanvive closed this as completed Sep 9, 2019

mohanvive added this to Done in Release 5.1.0 (core) Sep 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

siddhi how to confirm the data availability? #1450

siddhi how to confirm the data availability? #1450

xywan89 commented Aug 16, 2019 •

edited

BuddhiWathsala commented Aug 16, 2019

cristicmf commented Aug 19, 2019

BuddhiWathsala commented Aug 19, 2019

cristicmf commented Aug 21, 2019 •

edited

BuddhiWathsala commented Aug 22, 2019

xywan89 commented Aug 23, 2019 •

edited

BuddhiWathsala commented Aug 23, 2019

xywan89 commented Aug 23, 2019 •

edited

xywan89 commented Aug 27, 2019

BuddhiWathsala commented Sep 2, 2019

mohanvive commented Sep 9, 2019

siddhi how to confirm the data availability? #1450

siddhi how to confirm the data availability? #1450

Comments

xywan89 commented Aug 16, 2019 • edited

BuddhiWathsala commented Aug 16, 2019

cristicmf commented Aug 19, 2019

BuddhiWathsala commented Aug 19, 2019

cristicmf commented Aug 21, 2019 • edited

BuddhiWathsala commented Aug 22, 2019

xywan89 commented Aug 23, 2019 • edited

BuddhiWathsala commented Aug 23, 2019

xywan89 commented Aug 23, 2019 • edited

xywan89 commented Aug 27, 2019

BuddhiWathsala commented Sep 2, 2019

mohanvive commented Sep 9, 2019

xywan89 commented Aug 16, 2019 •

edited

cristicmf commented Aug 21, 2019 •

edited

xywan89 commented Aug 23, 2019 •

edited

xywan89 commented Aug 23, 2019 •

edited