Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LizardFS master 3.10.2 crash #487

Closed
andypl78 opened this issue Sep 26, 2016 · 8 comments
Closed

LizardFS master 3.10.2 crash #487

andypl78 opened this issue Sep 26, 2016 · 8 comments
Assignees
Labels
Milestone

Comments

@andypl78
Copy link

[4581135.415771] mfsmaster[13210]: segfault at 0 ip 00000000004fa3b7 sp 00007ffc9fb8f1f0 error 4 in mfsmaster[400000+15d000]

Setup:

@blink69 blink69 added the bug label Sep 27, 2016
@blink69
Copy link
Contributor

blink69 commented Sep 27, 2016

Please send us more details regarding that,

  • numbers of files,
  • numbers of chunks,
  • numbers of connected chunkservers, clients
    also syslog from before this crash dump will also help.

@blink69 blink69 self-assigned this Sep 27, 2016
@blink69
Copy link
Contributor

blink69 commented Sep 27, 2016

patch for that is in our CR http://cr.skytechnology.pl:8081/#/c/2720/

@andypl78
Copy link
Author

andypl78 commented Sep 27, 2016

When can we expect a new version in the repo?

My setup (all on version 3.10.2) :

  • master x1
  • shadow x1
  • metalogger x1
  • chunks x2
  • clients x22
syslog:

Sep 26 09:17:01 lfs01 /USR/SBIN/CRON[26509]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 09:47:23 lfs01 mfsmaster[13210]: structure check loop
Sep 26 10:17:01 lfs01 /USR/SBIN/CRON[14730]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 11:17:01 lfs01 /USR/SBIN/CRON[2936]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 12:17:01 lfs01 /USR/SBIN/CRON[23580]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 13:17:01 lfs01 /USR/SBIN/CRON[11799]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 13:47:37 lfs01 mfsmaster[13210]: structure check loop
Sep 26 14:17:01 lfs01 /USR/SBIN/CRON[32448]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 15:17:01 lfs01 /USR/SBIN/CRON[20667]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 16:17:01 lfs01 /USR/SBIN/CRON[8887]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Sep 26 16:42:54 lfs01 kernel: [4581135.415771] mfsmaster[13210]: segfault at 0 ip 00000000004fa3b7 sp 00007ffc9fb8f1f0 error 4 in mfsmaster[400000+15d000]
Sep 26 16:42:54 lfs01 mfsmount[25736]: master: connection lost

@blink69 blink69 added this to the 3.10.4 milestone Sep 28, 2016
@blink69
Copy link
Contributor

blink69 commented Sep 28, 2016

In next few days we will publish 3.10.4 with all fixes.

@viktor-zhuromskyy
Copy link

viktor-zhuromskyy commented Oct 4, 2016

Same thing here.
lizardfs-master crashes once a day on average load.

@DrProfi
Copy link

DrProfi commented Dec 11, 2016

Confirm problem with 3.10.4

[3780056.793786] mfsmaster[19333]: segfault at 7fffc4c68ff8 ip 00000000004db2b5 sp 00007fffc4c69000 error 6 in mfsmaster[400000+13a000]

[3789174.107720] mfsmaster[7194]: segfault at 7ffe06e40ff8 ip 00000000004db2b2 sp 00007ffe06e41000 error 6 in mfsmaster[400000+13a000]

From repo for ubuntu 14.04

@blink69
Copy link
Contributor

blink69 commented Dec 12, 2016

This issue is related to bug in libjudy, see issue #496

@blink69 blink69 reopened this Dec 12, 2016
@psarna
Copy link
Member

psarna commented Dec 13, 2016

#496 is already closed with e77895a

@psarna psarna closed this as completed Dec 13, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants