Tinkerpop 2274 tp33 by rpopov · Pull Request #1175 · apache/tinkerpop

rpopov · 2019-08-05T05:58:53Z

For initial review - there are still more changes to come

Resolving locale and file system - specific issues when compiling under Windows (10)

the test data. In order to avoid MAVEN warnings, provided explicitly the most-recent version of jacoco in <pluginManagement> and removed <prerequisite>.

…dependence

…specific EOLN char(s)

to avoid cryptic test errors. Added Readme.md to describe how to install hadoop, spark and OS integration

…mputer.AbstractHadoopGraphComputerTest to delete the temorary files first before attempting to create new ones with the same name. Provided better error diagnostic messages. Set POM to check if HADOOP_HOME and HADOOP_GREMLIN_LIBS env. vars are set in advance in order not to fail the build with a cryptic message.

…he classes and failing the tests due to loading org.apache.tinkerpop.gremlin.TestHelper from gremlin-code instead of from gremlin-test by replacing the RANDOM constant with new Random(). *Suggestion:* Remove org.apache.tinkerpop.gremlin.TestHelper from gremlin-core

spmallette · 2019-08-05T10:42:34Z

Please note that travis is failing the build at this point. The error appears consistent across the different builds, so I think it's a legitimate problem with your changes and not travis instability.

… warn if missed.

…ow Travis build pass, but this basically negates the use of Maven enforcer at all - in case of not installed Hadoop the tests will fail with no indication why, instead of the script to define & test its environment clearly.

…rpop into TINKERPOP-2274-tp33

gremlin-server/src/test/java/org/apache/tinkerpop/gremlin/util/Log4jRecordingAppender.java

pom.xml

spmallette · 2019-08-08T13:52:34Z

hadoop-gremlin/pom.xml

+                <artifactId>maven-enforcer-plugin</artifactId>
+                <executions>
+                    <execution>
+                        <id>check-hadoop-installed</id>


Could you clarify the need for enforcer for these environment variables? Was the build failing without those somehow? I don't have them set at the moment and mvn clean install builds fine for me.

My idea is to check the environment the build/tests run in and warn that it is incomplete, instead of throwing cryptic exceptions. Having this stated in the POM serves also as some documentation. In this specific case I could not realize the needed build environment setup neither how the CI environment provides Hadoop libraries for the tests

I see where not setting HADOOP_GREMLIN_LIBS generates some WARN messages when you build:

[WARN] org.apache.tinkerpop.gremlin.hadoop.jsr223.HadoopGremlinPlugin - Be sure to set the environmental variable: HADOOP_GREMLIN_LIBS [WARN] org.apache.hadoop.util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable [WARN] org.apache.tinkerpop.gremlin.hadoop.jsr223.HadoopGremlinPlugin - Be sure to set the environmental variable: HADOOP_GREMLIN_LIBS [WARN] org.apache.tinkerpop.gremlin.hadoop.jsr223.HadoopGremlinPlugin - Be sure to set the environmental variable: HADOOP_GREMLIN_LIBS [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.844 s - in org.apache.tinkerpop.gremlin.hadoop.jsr223.HadoopGremlinPluginTest

but it doesn't error, even when you run integration tests on hadoop-gremlin. I think those environment variables are really more for usage at runtime rather than for actual tests. You mentioned that you get "cryptic exceptions" - what are those exceptions and do they occur during build on windows?

I'd rather not force folks to setup up environment variables unless they are necessary to the build somehow. But, now that I look at it again, i assume that <level>WARN</warn> isn't actually enforcement (meaning the build doesn't fail if those environment variables aren't present). So perhaps this addition is ok given how you have done it even if there aren't errors specifically.

spmallette · 2019-08-12T20:53:06Z

The tp33 branch has been re-opened for development and is set to 3.3.9-SNAPSHOT. Please feel free to rebase.

Realized that we weren't even really testing GraphSON 2.0. It was hardcoded to graphson 3.0 for the vast majority of the tests. When I enabled GraphSON 2.0 tests i started getting test failures. I don't think they were failures in the sense that 2.0 doesn't work but more like the test semantics weren't properly setup for 2.0 serialization expectations. Anyway, disabling these tests for now until the issue can be resolved. CTR

the test data. In order to avoid MAVEN warnings, provided explicitly the most-recent version of jacoco in <pluginManagement> and removed <prerequisite>.

…dependence

…specific EOLN char(s)

to avoid cryptic test errors. Added Readme.md to describe how to install hadoop, spark and OS integration

…mputer.AbstractHadoopGraphComputerTest to delete the temorary files first before attempting to create new ones with the same name. Provided better error diagnostic messages. Set POM to check if HADOOP_HOME and HADOOP_GREMLIN_LIBS env. vars are set in advance in order not to fail the build with a cryptic message.

…he classes and failing the tests due to loading org.apache.tinkerpop.gremlin.TestHelper from gremlin-code instead of from gremlin-test by replacing the RANDOM constant with new Random(). *Suggestion:* Remove org.apache.tinkerpop.gremlin.TestHelper from gremlin-core

… warn if missed.

…ow Travis build pass, but this basically negates the use of Maven enforcer at all - in case of not installed Hadoop the tests will fail with no indication why, instead of the script to define & test its environment clearly.

…er Windows for platform indepenence.

…tHelper class, that is used both in core and tests projects, as a preparation to avoid the collistion of TestHelper classes in both projects

…stHelper and reduced the changes only within the that project. Any other projects should continue using gremlin-test in order to continue using TestHelper class withot any change in Java. Made TestHelper publish the methods of CoreTestHelper this way avoiding duplicated implementations.

org.apache.tinkerpop.gremlin.spark.SparkGremlinGryoSerializerTest org.apache.tinkerpop.gremlin.spark.SparkGremlinTest that used to fail with: ERROR shouldSupportCopyMethods(org.apache.tinkerpop.gremlin.spark.structure.io.SparkContextStorageCheck) java.lang.AssertionError at org.apache.tinkerpop.gremlin.spark.structure.io.SparkContextStorageCheck.shouldSupportCopyMethods(SparkContextStorageCheck.java:75)

… paths

…X, so rm may fail. Changed rm to report if it succeeded to remove all files in scope, while still rm with an empty scope returns false. Changed POM under Windows to skip the tests with removal of Spark's working files due to the same problem above. See: [https://issues.apache.org/jira/browse/SPARK-12216]

….com/rpopov/tinkerpop into TINKERPOP-2274-tp33 Resolved conflict in pom.xml

spmallette · 2019-08-16T18:28:29Z

It looks like you got some extra commits in here somehow. You might need to clean up your git history a bit.

… and help further improvement of error reporting

help further improvement of error reporting. Unify the naming of the Spark/Hadoop store directoires to comply with the convention of directories naming - all directories end with a name (not with /), which makes them uniformly refer (as of the spec of CoreTestHelper#makeTestDataDirectory()): a/b/c - the directoy itself a/b/c/ - the contents of the directory which unifies and correlates with the behavior of rm: rm(a/b/c) - remove the directoy itself while in order to remove the contents of the directory and keep it use rm(a/b/c/*) This complies with the tests in spark-gremlin on the Spark/Hadoop file system. Revealed that the Spark context is not closed between the tests, while the tests remove its storage files, therefore explicitluy store Spark Cintext for every test, making the tests independent and (more) corect on Spark use. Revealed that Spark does not close the FileInputStreams it iterates upon, so that locks hang up in the file system. Thus, not being able to fix Spark/Hadoop and specifically the use of MultiIterator, call explicitly System.gc() in order to finalize the streams remaining open after closing the context.

spmallette · 2019-09-03T16:45:56Z

I just wanted to call attention to my previous comment from a few weeks back - unfortunately, we can't easily evaluate/review this PR unless the commit history is cleaned up a bit.

… Refactor FileSystemStorage to match the use & specification. Fix possible leaking resources. Fix inconsistencies in the use of / and home directory. NOTE: FileSystemStorage is still inconsistent in the use of / and home directory and in appending /* and *

… test failure.

… composing file paths

rpopov · 2019-09-08T01:36:48Z

Replaced by #1188 PR

rpopov added 7 commits August 3, 2019 14:31

[TINKERPOP-2274] Used SimpleDateFormat with explicit US locale to parse

d33803e

the test data. In order to avoid MAVEN warnings, provided explicitly the most-recent version of jacoco in <pluginManagement> and removed <prerequisite>.

[TINKERPOP-2274] Fixed EarlyLimitStrategyTest to allow running it

aed9a07

[TINKERPOP-2274] Delegated file parsing to File class for platform in…

1d89615

…dependence

[TINKERPOP-2274] Changed the loger matching tests to ignore platform-…

65c7a16

…specific EOLN char(s)

[TINKERPOP-2274] Added explicit check if Hadoop is installed for testing

8988875

to avoid cryptic test errors. Added Readme.md to describe how to install hadoop, spark and OS integration

rpopov changed the base branch from master to tp33 August 5, 2019 06:00

[TINKERPOP-2274] Commented HADOOP_GREMLIN_LIBS set up.

2dc9eeb

spmallette and others added 4 commits August 5, 2019 07:00

TinkerPop 3.3.8 release

dc00b0a

[TINKERPOP-2274] Suggested HADOOP_GREMLIN_LIBS set up and let the POM…

46f3349

… warn if missed.

Merge branch 'TINKERPOP-2274-tp33' of https://github.com/rpopov/tinke…

8734301

…rpop into TINKERPOP-2274-tp33

spmallette reviewed Aug 8, 2019

View reviewed changes

gremlin-server/src/test/java/org/apache/tinkerpop/gremlin/util/Log4jRecordingAppender.java Show resolved Hide resolved

spmallette reviewed Aug 8, 2019

View reviewed changes

pom.xml Outdated Show resolved Hide resolved

spmallette reviewed Aug 8, 2019

View reviewed changes

Bump to 3.3.9-SNAPSHOT

af7d741

spmallette and others added 11 commits August 13, 2019 08:03

TINKERPOP-2275 bump jackson-databind 2.9.9.3 - CTR

ebba19f

Minor changelog fixup

b9f010f

[TINKERPOP-2274] Used SimpleDateFormat with explicit US locale to parse

14977c5

the test data. In order to avoid MAVEN warnings, provided explicitly the most-recent version of jacoco in <pluginManagement> and removed <prerequisite>.

[TINKERPOP-2274] Fixed EarlyLimitStrategyTest to allow running it

0671dcb

[TINKERPOP-2274] Delegated file parsing to File class for platform in…

526b9ad

…dependence

[TINKERPOP-2274] Changed the loger matching tests to ignore platform-…

8b28023

…specific EOLN char(s)

[TINKERPOP-2274] Added explicit check if Hadoop is installed for testing

b0d9de5

to avoid cryptic test errors. Added Readme.md to describe how to install hadoop, spark and OS integration

[TINKERPOP-2274] Commented HADOOP_GREMLIN_LIBS set up.

3c280a4

rpopov added 10 commits August 16, 2019 21:15

[TINKERPOP-2274] Suggested HADOOP_GREMLIN_LIBS set up and let the POM…

2710172

… warn if missed.

[TINKERPOP-2274] Migrate to use unform UNIX-formatted paths, even und…

532b9f6

…er Windows for platform indepenence.

[TINKERPOP-2274] Fixed reflection-based test to match their parameters

99ee0e7

[TINKERPOP-2274] Added generation of the tests jar to publish the Tes…

e3ba2c9

…tHelper class, that is used both in core and tests projects, as a preparation to avoid the collistion of TestHelper classes in both projects

[TINKERPOP-2274] Documented the convention to use UNIX-formatted file…

817825b

… paths

[TINKERPOP-2274] Merge branch 'TINKERPOP-2274-tp33' of https://github…

21f0edc

….com/rpopov/tinkerpop into TINKERPOP-2274-tp33 Resolved conflict in pom.xml

rpopov added 3 commits August 24, 2019 09:56

[TINKERPOP-2274] Refactored removal tests to ease their understanding…

bcbb119

… and help further improvement of error reporting

[TINKERPOP-2274] Resolve inconsistency of file system API returning null

6a69d3d

rpopov added 5 commits September 6, 2019 23:05

[TINKERPOP-2274] Fixed the tests of rm(), cp() methods.

7856c70

[TINKERPOP-2274] Changed POM to dump the whole stack trace in case of…

ecb29ed

… test failure.

[TINKERPOP-2274] Avoiding strings as storage file system paths

c952e7c

[TINKERPOP-2274] Imposing Path / File classes use instead of manually…

66d2235

… composing file paths

rpopov closed this Sep 8, 2019

rpopov deleted the TINKERPOP-2274-tp33 branch September 8, 2019 01:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tinkerpop 2274 tp33#1175

Tinkerpop 2274 tp33#1175
rpopov wants to merge 42 commits intoapache:tp33from
rpopov:TINKERPOP-2274-tp33

rpopov commented Aug 5, 2019

Uh oh!

spmallette commented Aug 5, 2019

Uh oh!

Uh oh!

Uh oh!

spmallette Aug 8, 2019

Uh oh!

rpopov Aug 8, 2019

Uh oh!

spmallette Aug 8, 2019

Uh oh!

spmallette commented Aug 12, 2019

Uh oh!

spmallette commented Aug 16, 2019

Uh oh!

spmallette commented Sep 3, 2019

Uh oh!

rpopov commented Sep 8, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rpopov commented Aug 5, 2019

Uh oh!

spmallette commented Aug 5, 2019

Uh oh!

Uh oh!

Uh oh!

spmallette Aug 8, 2019

Choose a reason for hiding this comment

Uh oh!

rpopov Aug 8, 2019

Choose a reason for hiding this comment

Uh oh!

spmallette Aug 8, 2019

Choose a reason for hiding this comment

Uh oh!

spmallette commented Aug 12, 2019

Uh oh!

spmallette commented Aug 16, 2019

Uh oh!

spmallette commented Sep 3, 2019

Uh oh!

rpopov commented Sep 8, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants