Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDDS-1568 : Add RocksDB metrics to OM. #868

Merged
merged 2 commits into from May 30, 2019

Conversation

avijayanhwx
Copy link
Contributor

RocksDB statistics need to sinked to hadoop-metrics2 for Ozone Manager to understand how OM behaves under heavy load.
Example: "rocksdb.bytes.written"

@avijayanhwx
Copy link
Contributor Author

/label ozone

@elek elek added the ozone label May 29, 2019
@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 37 Docker mode activated.
_ Prechecks _
+1 dupname 0 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
+1 test4tests 0 The patch appears to include 2 new or modified test files.
_ trunk Compile Tests _
0 mvndep 21 Maven dependency ordering for branch
+1 mvninstall 536 trunk passed
+1 compile 259 trunk passed
+1 checkstyle 84 trunk passed
+1 mvnsite 0 trunk passed
+1 shadedclient 875 branch has no errors when building and testing our client artifacts.
+1 javadoc 145 trunk passed
0 spotbugs 290 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 475 trunk passed
_ Patch Compile Tests _
0 mvndep 17 Maven dependency ordering for patch
+1 mvninstall 465 the patch passed
+1 compile 259 the patch passed
+1 javac 259 the patch passed
+1 checkstyle 68 the patch passed
+1 mvnsite 0 the patch passed
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 629 patch has no errors when building and testing our client artifacts.
+1 javadoc 132 the patch passed
+1 findbugs 474 the patch passed
_ Other Tests _
+1 unit 230 hadoop-hdds in the patch passed.
-1 unit 1074 hadoop-ozone in the patch failed.
-1 asflicense 41 The patch generated 17 ASF License warnings.
5990
Reason Tests
Failed junit tests hadoop.ozone.client.rpc.TestReadRetries
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-868/1/artifact/out/Dockerfile
GITHUB PR #868
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux 2d072eb09fea 4.4.0-138-generic #164-Ubuntu SMP Tue Oct 2 17:16:02 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / 346c2b7
Default Java 1.8.0_212
unit https://builds.apache.org/job/hadoop-multibranch/job/PR-868/1/artifact/out/patch-unit-hadoop-ozone.txt
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-868/1/testReport/
asflicense https://builds.apache.org/job/hadoop-multibranch/job/PR-868/1/artifact/out/patch-asflicense-problems.txt
Max. process+thread count 3838 (vs. ulimit of 5500)
modules C: hadoop-hdds/common hadoop-hdds/framework U: hadoop-hdds
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-868/1/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

Copy link
Contributor

@anuengineer anuengineer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for getting this change done. I am fine with this change if we are able to work with older YARN installs. You might want to ask Namit if he remembers the details of that issue, or Arpit or elek.

@@ -41,13 +49,21 @@
/**
* Adapter JMX bean to publish all the Rocksdb metrics.
*/
public class RocksDBStoreMBean implements DynamicMBean {
public class RocksDBStoreMBean implements DynamicMBean, MetricsSource {

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@elek Will this automatically show up in the Promotheus ? or do we need to any extra plumbing ? See the comment in the JIRA where @jnp talks about Container metrics on Datanodes.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@anuengineer I verified by checking the /prom servlet end point. I also made changes in the prometheus sink metric name sanitization code so that the RocksDB metric names don't look odd.

option = dbProfile.getDBOptions();
}

if (!rocksDbStat.equals(OZONE_METADATA_STORE_ROCKSDB_STATISTICS_OFF)) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is some history here. During our first release we found that RocksDB is also shipped by YARN. That version of RocksDB is very old, hence this call would fail in mysterious ways. @arp7 went and fixed that issue and made sure that we don't enable this by default. I am fine with enabling this, if we don't run into that old issue again. @elek, @arp7 any comments ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@anuengineer We don't enable the metrics by default. I just used the same config that is used to enable metrics for SCM RocksDB, to enable metrics for OM RocksDB as well.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not familiar with the old issue, but now our classpath is separated from yarn. We can use any newer rocksdb (but we should use the same versions for all the hdds/ozone projects)

I think it's safe to use any new rocksdb feature if you think metrics should be turned on by default...

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 37 Docker mode activated.
_ Prechecks _
+1 dupname 1 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
+1 test4tests 0 The patch appears to include 2 new or modified test files.
_ trunk Compile Tests _
0 mvndep 13 Maven dependency ordering for branch
+1 mvninstall 519 trunk passed
+1 compile 255 trunk passed
+1 checkstyle 74 trunk passed
+1 mvnsite 0 trunk passed
+1 shadedclient 821 branch has no errors when building and testing our client artifacts.
+1 javadoc 154 trunk passed
0 spotbugs 296 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 492 trunk passed
_ Patch Compile Tests _
0 mvndep 16 Maven dependency ordering for patch
-1 mvninstall 178 hadoop-ozone in the patch failed.
+1 compile 265 the patch passed
+1 javac 265 the patch passed
+1 checkstyle 95 the patch passed
+1 mvnsite 0 the patch passed
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 663 patch has no errors when building and testing our client artifacts.
+1 javadoc 169 the patch passed
+1 findbugs 496 the patch passed
_ Other Tests _
+1 unit 251 hadoop-hdds in the patch passed.
-1 unit 1284 hadoop-ozone in the patch failed.
+1 asflicense 69 The patch does not generate ASF License warnings.
6195
Reason Tests
Failed junit tests hadoop.ozone.client.rpc.TestFailureHandlingByClient
hadoop.ozone.om.TestOzoneManagerHA
hadoop.ozone.TestMiniOzoneCluster
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-868/2/artifact/out/Dockerfile
GITHUB PR #868
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux df569ff468e5 4.4.0-138-generic #164-Ubuntu SMP Tue Oct 2 17:16:02 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / abf76ac
Default Java 1.8.0_212
mvninstall https://builds.apache.org/job/hadoop-multibranch/job/PR-868/2/artifact/out/patch-mvninstall-hadoop-ozone.txt
unit https://builds.apache.org/job/hadoop-multibranch/job/PR-868/2/artifact/out/patch-unit-hadoop-ozone.txt
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-868/2/testReport/
Max. process+thread count 5385 (vs. ulimit of 5500)
modules C: hadoop-hdds/common hadoop-hdds/framework U: hadoop-hdds
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-868/2/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

@anuengineer anuengineer merged commit 33419a9 into apache:trunk May 30, 2019
shanthoosh pushed a commit to shanthoosh/hadoop that referenced this pull request Oct 15, 2019
Causing Travis-CI builds to appear broken on master.

Author: Daniel Nishimura <dnishimura@linkedin.com>

Reviewers: Prateek Maheshwari <pmaheshwari@apache.org>

Closes apache#868 from dnishimura/disable-sonar-scanner
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
4 participants