3.0 clean up shell scripts #6158

benbc · 2015-12-22T16:59:05Z

Fixes #139, fixes #270, fixes #133, fixes #5682.

srbaker · 2015-12-22T19:54:02Z

packaging/standalone/src/main/distribution/shell-scripts/bin/neo4j-arbiter


+  *)
+    echo "Usage: neo4j { console | start | stop | restart | status | info }"


Isn't this supposed to say neo4j-arbiter for the script name?

A lof of this script looks very close to the neo4j one. Can we have this one source that one, and just add its weirdities? Or are they not close enough?

Isn't this supposed to say neo4j-arbiter for the script name?

Thank you. I find that that bug has existed since 2013.

srbaker · 2015-12-22T20:00:40Z

Aside from what might be an opportunity to cut down on duplication in neo4j-arbiter, I'm very happy with this. 👍

benbc · 2015-12-23T15:41:01Z

@srbaker All right. You got me. I've been trying to avoid removing that duplication, but you are quite right that it needs doing.

This functionality has long been problematic for three reasons. * It uses lsof which is an endless cause of compatibility bugs because its interface changes across distributions and verions. * It doesn't actually work as intended. The expectation is that the server is ready to serve requests when the script exits. In fact that is not quite the case because Jetty has this weird behaviour where it binds the port before the handler is actually ready. So using "port bound" as a proxy for "ready" doesn't work. * It takes a massively simplistic view of what it means for Neo4j to be "ready". It doesn't take into account the complexity of recovery, upgrades or clustering; each of which can make the server unavailable even though the port is bound. We are losing functionality here because errors early in the DBMS lifecycle may not be repoorted. Probably the biggest loss is in the case when there is something else listening on the same port. Previously we attempted to catch that in the shell script and give a clear message but now the user will have to look in the logs find the information they need. In fact even this latter is not as simple as it looks since, following the changes for Bolt, it's now possible for the DBMS to be running perfectly happily with nothing listening on port 7474. We hope to introduce mitigation for this loss later. Either have something that polls real endpoints (e.g. curl), although it's not clear how that would work in the Bolt-only case; or modify the DMBS so that it can signal back to the parent process that it's up and happy. As a temporary measure we are sleeping for five seconds after starting the process and checking that it's still alive. That way we can tell the user about very early errors (e.g. port binding problems). The sleep time has been chosen experimentally and we can't be sure that it will always be long enough.

* Call the detection function from the code that needs it to improve maintainability. Duplicating calls to `uname` will not slow us down. * Use the call detection function everywhere instead of ad hoc detection. * Remove detection for OSes that we don't vary our behaviour for.

This is untested, not officially documented and has several long-outstanding bugs against it.

With our current build and release processes it is very unlikely that someone would run a SNAPSHOT build unintentionally, so this is just unnecessary clutter.

Uses the Sharness shell testing library (https://github.com/mlafeldt/sharness/).

This changes the way we construct the classpath to be easier to test, letting Java do the globbing for us.

This allows us to accommodate environments where the JVM takes a long time (or very little time) to come up. It also allows us to turn the sleep right down for testing purposes, so the tests run quickly.

We would like the shell script tests to work on OSX and Linux. Unfortunately sed is pretty incompatible between the two of them. We don't really want to take a backup of the file here, but the form we've given happens to work identically in both environments.

In order to make the tests work the same on OSX and Linux, we need to specify /bin/bash since /bin/sh is different on those platforms (Bourne shell vs Dash).

These tests will only fail if the entire tarball is somehow broken.

benbc · 2016-01-05T14:40:33Z

@srbaker Back to you.

This is mostly a refactoring to simplify the code, but it does introduce one behavioural change (to make the implementation easier): it is now necessary to have a valid Java available for all commands, not just starting Neo4j.

This special case was inherited from whatever we copied when our scripts when our scripts were originally written. As far as I can tell the weird case that it was trying to deal with no longer exists and we don't support AIX anyway.

3.0 clean up shell scripts

srbaker reviewed Dec 22, 2015
View reviewed changes

benbc self-assigned this Jan 4, 2016

benbc added NOT READY FOR MERGE 3.0 operability labels Jan 4, 2016

benbc added 23 commits January 5, 2016 14:10

Remove obsolete stuff from and tidy scripts

fb3f4d3

Remove deprecated neo4j-install scripts

08d2ce8

Refactor console script to match daemon script

6ebac3f

Add nohup when starting in background

e47ebb5

Update the arbiter script to match the main one

cf051e0

Update shell script style

2750cb8

Remove support for Cygwin

044618b

This is untested, not officially documented and has several long-outstanding bugs against it.

Remove warning about SNAPSHOT jars in scripts

1241b8c

With our current build and release processes it is very unlikely that someone would run a SNAPSHOT build unintentionally, so this is just unnecessary clutter.

Remove unused pidfile wrapper setting

6c773c1

Remove support for Bash 3.1 and earlier

9fa6e9c

Scripts cope when the data dir is missing

7a21040

Add isolated tests for shell scripts

782e689

Uses the Sharness shell testing library (https://github.com/mlafeldt/sharness/).

Restructure shell script test harness

f5ffc96

Add tests for Java arguments in shell scripts

132f80a

Add tests for classpath construction in shell scripts

254ff04

This changes the way we construct the classpath to be easier to test, letting Java do the globbing for us.

Remove non-working code to add plugin subdirs to classpath

cd50aff

Add trigger to run Sharness with verbose flag

620513b

Test Java arg construction for daemon as well as console

31af93b

Make the scripts' sleep after startup configurable

4800c50

This allows us to accommodate environments where the JVM takes a long time (or very little time) to come up. It also allows us to turn the sleep right down for testing purposes, so the tests run quickly.

Test redirection to console.log

32680ab

Remove unused NEO4J_INSTANCE variable from scripts

4823a5f

benbc added 17 commits January 5, 2016 14:10

Add more tests for scripts running daemon

f60df7c

Update neo4j-arbiter script to be as similar to neo4j as possible

703bf24

Tests for script JVM compatibility testing

749fced

Integrate shell script tests with the build

5307fe2

Fix script name in neo4j-arbiter usage message

532e59a

Use bash for shell script tests

0f2196c

In order to make the tests work the same on OSX and Linux, we need to specify /bin/bash since /bin/sh is different on those platforms (Bourne shell vs Dash).

Move JVM +DisableExplicitGC flag to wrapper conf for consistency

64ace60

Refactor scripts to make it easier to remove duplication

a70404e

Align config handling of neo4j and neo4j-arbiter scripts

da7a095

Refactor scripts to make it easier to remove duplication

78e58f4

Remove unnecessary warnings about broken directory structure

5dd5624

These tests will only fail if the entire tarball is somehow broken.

Refactor scripts to make it easier to remove duplication

5c84c5f

Add switchable tracing to neo4j and neo4j-arbiter scripts

11b325f

Refactor scripts to make it easier to remove duplication

ec85eb8

Refactor scripts to make it easier to remove duplication

71fc43b

Remove duplication between neo4j and neo4j-arbiter scripts

131f9d8

benbc force-pushed the 3.0-clean-up-shell-scripts branch from f414ebf to e142688 Compare January 5, 2016 14:14

benbc removed the NOT READY FOR MERGE label Jan 5, 2016

benbc added 5 commits January 5, 2016 16:49

Refactor common script code to make its role clear

2aecdd4

Overhaul Java detection in scripts

eb83017

This is mostly a refactoring to simplify the code, but it does introduce one behavioural change (to make the implementation easier): it is now necessary to have a valid Java available for all commands, not just starting Neo4j.

Remove special handling for AIX in scripts

76a6f77

This special case was inherited from whatever we copied when our scripts when our scripts were originally written. As far as I can tell the weird case that it was trying to deal with no longer exists and we don't support AIX anyway.

Make names of script variables and functions consistent

bd87838

Note which customization variables the scripts respond to

eae29e8

benbc force-pushed the 3.0-clean-up-shell-scripts branch from e142688 to eae29e8 Compare January 5, 2016 16:49

srbaker added a commit that referenced this pull request Jan 7, 2016

Merge pull request #6158 from benbc/3.0-clean-up-shell-scripts

3e8fa0b

3.0 clean up shell scripts

srbaker merged commit 3e8fa0b into neo4j:3.0 Jan 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3.0 clean up shell scripts #6158

3.0 clean up shell scripts #6158

benbc commented Dec 22, 2015

srbaker Dec 22, 2015

srbaker Dec 22, 2015

benbc Dec 23, 2015

srbaker commented Dec 22, 2015

benbc commented Dec 23, 2015

benbc commented Jan 5, 2016


		*)
		echo "Usage: neo4j { console \| start \| stop \| restart \| status \| info }"

3.0 clean up shell scripts #6158

3.0 clean up shell scripts #6158

Conversation

benbc commented Dec 22, 2015

srbaker Dec 22, 2015

Choose a reason for hiding this comment

srbaker Dec 22, 2015

Choose a reason for hiding this comment

benbc Dec 23, 2015

Choose a reason for hiding this comment

srbaker commented Dec 22, 2015

benbc commented Dec 23, 2015

benbc commented Jan 5, 2016