runner: don't allow multiple instances of daemon to run #418

pkalever · 2018-05-16T09:09:45Z

until now there is no check to defend on multiple runs of daemon within
the same node. This patch takes a non blocking lock on 'tcmu.lock' file,
if it succeeds only then daemon is allowed to run, if there is already a
lock on lock-file (taken by some other instance of tcmu-runner) we will exit.

Signed-off-by: Prasanna Kumar Kalever prasanna.kalever@redhat.com

lxbsz · 2018-05-16T09:39:42Z

main.c

+	lock.l_type = F_WRLCK;
+	if (fcntl(fd, F_SETLK, &lock) == -1) {
+		tcmu_err("tcmu-runner is already running...\n");
+		close(fd);


We'd better add one goto for closing the fd to make the code more readable.

Or we should it close(fd) everywhere just after this line when trying to goto error tags.

Good catch @lxbsz
I will fix this part.

lxbsz · 2018-05-16T09:40:26Z

main.c

+		tcmu_err("tcmu-runner is already running...\n");
+		close(fd);
+		goto destroy_log;
+	}

 	ret = load_our_module();
 	if (ret < 0) {
 		tcmu_err("couldn't load module\n");
 		goto destroy_log;


Just like here.

lxbsz · 2018-05-16T09:49:16Z

main.c

+
+	lock.l_type = F_WRLCK;
+	if (fcntl(fd, F_SETLK, &lock) == -1) {
+		tcmu_err("tcmu-runner is already running...\n");


Should we check that only when errno == EACCES or EAGAIN then we will be sure that the tcmu-runner main process is already runing ? Or it should be other op errors.

Sure, I will defend on EAGAIN

pkalever · 2018-05-16T11:02:42Z

@lxbsz updated with the suggestions. Thanks!

lxbsz · 2018-05-16T13:35:00Z

It looks good to me and test it and the logs are:

/usr/bin/tcmu-runner --tcmu-log-dir=/var/log/gluster-block/ --debug
The logdir option from the tcmu.conf will be ignored
Inotify is watching "/etc/tcmu/tcmu.conf", wd: 1, mask: IN_ALL_EVENTS
2018-05-16 09:30:46.462 6677 [ERROR] main:1073: tcmu-runner is already running...

pranithk

Changes look good to me.

mikechristie · 2018-05-21T20:06:37Z

main.c

@@ -56,6 +56,8 @@
 #include "libtcmu_config.h"
 #include "libtcmu_log.h"

+# define TCMU_LOCK_FILE   "/var/run/tcmu.lock"


Is this more normally in the lock dir like

/var/run/lock/tcmu.lock
or
/var/run/lock/tcmu/lock
?

Yes will pick the first one.

mikechristie · 2018-05-21T20:52:07Z

main.c

@@ -994,6 +996,8 @@ int main(int argc, char **argv)
 	GIOChannel *libtcmu_gio;
 	guint reg_id;
 	bool new_path = false;
+	struct flock lock = {0, };
+	int fd;


How about rename to lock_fd.

I will change this.

mikechristie · 2018-05-21T20:55:38Z

main.c

@@ -1146,6 +1173,8 @@ int main(int argc, char **argv)
 	tcmulib_close(tcmulib_context);
 err_free_handlers:
 	darray_free(handlers);
+close_fd:
+	close(fd);


Why for the normal shutdown we do a F_UNLCK and a close, but for the error path we only do a close on the fd?

From the man page it sounds like the same thing happens and the lock is released due to the process exiting. Is that correct? Is it just customary to only do a close in error paths liek this?

From man:

As well as being removed by an explicit F_UNLCK, record locks are automatically released when the process terminates.

If a process closes any file descriptor referring to a file, then all of the process's locks on that file are released

I just wrote more like sequence, like open, grab lock, release lock, close in the normal shutdown. And forget to add release lock at error path.

But, just closing the fd without F_UNLCK will also release the lock anyway.

@mikechristie I'm not sure what would you prefer:

remove F_UNLCK completely and just close(fd) in the both normal & error paths (or)

add F_UNLCK in error path, which I forget before ?

Thanks!

Lol, I do not know.

Go for the F_UNLOCK just to make it clear I guess. We seem to do that for some other things so it will at least be consistent.

OK - Make-sense :-)

until now there is no check to defend on multiple runs of daemon within the same node. This patch takes a non blocking lock on 'tcmu.lock' file, if it succeeds only then daemon is allowed to run, if there is already a lock on lock-file (taken by some other instance of tcmu-runner) we will exit. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>

pkalever · 2018-05-25T12:12:28Z

Updated with the suggested changes. Thanks!

pkalever force-pushed the lockfile branch from a8c859e to 1af1759 Compare May 16, 2018 09:14

lxbsz reviewed May 16, 2018

View reviewed changes

pkalever force-pushed the lockfile branch from 1af1759 to e42cd78 Compare May 16, 2018 11:02

pranithk approved these changes May 21, 2018

View reviewed changes

mikechristie reviewed May 21, 2018

View reviewed changes

pkalever force-pushed the lockfile branch from e42cd78 to db35711 Compare May 25, 2018 12:11

mikechristie merged commit 83664e9 into open-iscsi:master May 25, 2018

lxbsz mentioned this pull request Aug 23, 2018

Avoid multi tcmu-runner processes to run at container env #460

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runner: don't allow multiple instances of daemon to run #418

runner: don't allow multiple instances of daemon to run #418

pkalever commented May 16, 2018

lxbsz May 16, 2018

pkalever May 16, 2018 •

edited

lxbsz May 16, 2018

lxbsz May 16, 2018 •

edited

pkalever May 16, 2018

pkalever commented May 16, 2018

lxbsz commented May 16, 2018

pranithk left a comment

mikechristie May 21, 2018

pkalever May 23, 2018

mikechristie May 21, 2018

pkalever May 23, 2018

mikechristie May 21, 2018

pkalever May 23, 2018 •

edited

mikechristie May 23, 2018

pkalever May 25, 2018

pkalever commented May 25, 2018

runner: don't allow multiple instances of daemon to run #418

runner: don't allow multiple instances of daemon to run #418

Conversation

pkalever commented May 16, 2018

Choose a reason for hiding this comment

pkalever May 16, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lxbsz May 16, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkalever commented May 16, 2018

lxbsz commented May 16, 2018

pranithk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkalever May 23, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkalever commented May 25, 2018

pkalever May 16, 2018 •

edited

lxbsz May 16, 2018 •

edited

pkalever May 23, 2018 •

edited