FfDL logging issues with Elastic Search #13

Tomcli · 2018-02-15T19:55:42Z

The FfDL Elastic Search sometimes has an overhead issue when creating the emetrics/logline mapping.

[2018-02-15T18:11:21,334][INFO ][o.e.c.m.MetaDataMappingService] [VPF0eed] [dlaas_learner_data/Of58W91xS-6OlsuQEGByzw] update_mapping [logline]
[2018-02-15T18:11:36,852][INFO ][o.e.m.j.JvmGcMonitorService] [VPF0eed] [gc][556] overhead, spent [258ms] collecting in the last [1s]

When the Elastic Search works properly, it should have the following logs for mapping update/create.

[2018-02-15T17:52:52,598][INFO ][o.e.c.m.MetaDataMappingService] [R7H6R6o] [dlaas_learner_data/d-NMzvwRT_CXgMwHOBinTg] update_mapping [logline]
[2018-02-15T17:54:51,289][INFO ][o.e.c.m.MetaDataMappingService] [R7H6R6o] [dlaas_learner_data/d-NMzvwRT_CXgMwHOBinTg] create_mapping [emetrics]

The text was updated successfully, but these errors were encountered:

sboagibm · 2018-02-15T22:45:56Z

Probably first thing to try is to see if we can monkey with templates/infrastructure/storage.yml to give it a bit more memory.

animeshsingh · 2018-02-19T23:52:25Z

@sboagibm dont see any memory field here
https://github.com/IBM/FfDL/blob/master/templates/infrastructure/storage.yml

sboagibm · 2018-02-20T00:41:32Z

@whummer do you know off the top of your head how to allocate more memory there?

k0105 · 2018-04-02T11:50:21Z

Shouldn't allocating more memory work the same way as with containers, i.e. something like the following?

      requests:
        memory: "500Mi"
      limits:
        memory: "1000Mi"

What are your specs? How much RAM do you have and how much space does /var/lib/elasticsearch have left?

You could also try increasing the Java heap size with -Xmx (maximum heap size) and -Xms (initial heap size) in jvm.options. jmap -histo <pid> should provide a heap histogram, cmp. https://docs.oracle.com/javase/10/tools/jmap.htm

Finally: What are bootstrap.memory_lock, indices.memory.index_buffer_size and MAX_OPEN_FILES?

[If all of those check out, this might just be normal garbage collection, if I'm not mistaken. 25% does not seem too critical if it does not occur too frequently. The threshold for hard warnings is 50%, iirc.]

animeshsingh · 2018-04-04T04:47:33Z

Thanks @k0105 - We will take a look @Tomcli @whummer

Tomcli · 2018-04-04T17:48:29Z

Thanks @k0105, this issue was caused by some helper pods that has insufficient memory. I did a quick PR that will fix this issue.

add a -p to lcm logs

Tomcli added the bug Something isn't working label Feb 21, 2018

sboagibm assigned sboagibm and unassigned sboagibm Mar 20, 2018

Tomcli mentioned this issue Apr 4, 2018

Fix logging bug and clean up on helm chart #53

Merged

Tomcli mentioned this issue Apr 4, 2018

Move constants values to be configurable in helm chart #54

Closed

animeshsingh closed this as completed in #53 Apr 6, 2018

sboagibm pushed a commit to sboagibm/FfDL that referenced this issue May 21, 2018

Merge pull request IBM#13 from sboagibm/try_previous_lcm_logs

142b54e

add a -p to lcm logs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FfDL logging issues with Elastic Search #13

FfDL logging issues with Elastic Search #13

Tomcli commented Feb 15, 2018 •

edited

sboagibm commented Feb 15, 2018

animeshsingh commented Feb 19, 2018

sboagibm commented Feb 20, 2018

k0105 commented Apr 2, 2018

animeshsingh commented Apr 4, 2018

Tomcli commented Apr 4, 2018

FfDL logging issues with Elastic Search #13

FfDL logging issues with Elastic Search #13

Comments

Tomcli commented Feb 15, 2018 • edited

sboagibm commented Feb 15, 2018

animeshsingh commented Feb 19, 2018

sboagibm commented Feb 20, 2018

k0105 commented Apr 2, 2018

animeshsingh commented Apr 4, 2018

Tomcli commented Apr 4, 2018

Tomcli commented Feb 15, 2018 •

edited