Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Onboard Diagnostics #9024

Merged
merged 5 commits into from
Mar 27, 2020
Merged

Onboard Diagnostics #9024

merged 5 commits into from
Mar 27, 2020

Conversation

wlan0
Copy link
Contributor

@wlan0 wlan0 commented Feb 21, 2020

on board diagnostics

@krishnasrinivas @poornas @abperiasamy

asciicast

The output from the ascii video is here: https://gist.github.com/wlan0/670db2881b4163e3af35a94f83bf642a

Some interesting observations to not in the json above:

  • The speed diff between minio-4, which has nvme, and others, which only have hdds.
  • The 100 Gbit network link is saturated
  • Differences between the CPUs of different machines

Here's another output JSON of a fs setup with 1 drive on my laptop. It's smaller, so its easier to read

https://gist.github.com/wlan0/aa892c51c126dfed6a47305eddeec967

@wlan0 wlan0 force-pushed the diagnose branch 3 times, most recently from 1f78e9a to feedb3c Compare February 21, 2020 02:23
@minio minio deleted a comment from minio-ops Feb 21, 2020
@minio minio deleted a comment from minio-ops Feb 21, 2020
Copy link
Contributor

@poornas poornas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

admin docs will also need an update

cmd/admin-handlers.go Show resolved Hide resolved
cmd/admin-router.go Outdated Show resolved Hide resolved
cmd/admin-server-info.go Outdated Show resolved Hide resolved
cmd/notification.go Show resolved Hide resolved
pkg/disk/obd.go Show resolved Hide resolved
@poornas

This comment has been minimized.

cmd/utils.go Outdated Show resolved Hide resolved
@wlan0 wlan0 force-pushed the diagnose branch 2 times, most recently from febad3a to 775728d Compare March 13, 2020 04:51
@kannappanr

This comment has been minimized.

@kannappanr

This comment has been minimized.

@kannappanr
Copy link
Contributor

ping @aead @poornas

@wlan0

This comment has been minimized.

@wlan0 wlan0 force-pushed the diagnose branch 3 times, most recently from 6e0ec44 to 6a10248 Compare March 21, 2020 05:39
@minio minio deleted a comment from minio-trusted Mar 21, 2020
@minio minio deleted a comment from minio-trusted Mar 21, 2020
@kannappanr
Copy link
Contributor

ping @aead @poornas

cmd/api-errors.go Outdated Show resolved Hide resolved
cmd/fs-v1.go Outdated Show resolved Hide resolved
cmd/object-api-interface.go Outdated Show resolved Hide resolved
cmd/obdinfo.go Outdated Show resolved Hide resolved
cmd/peer-rest-client.go Show resolved Hide resolved
pkg/disk/obd.go Outdated Show resolved Hide resolved
pkg/disk/obd.go Outdated Show resolved Hide resolved
pkg/disk/obd.go Show resolved Hide resolved
pkg/disk/obd.go Show resolved Hide resolved
pkg/net/obd.go Outdated Show resolved Hide resolved
@poornas

This comment has been minimized.

cmd/gateway/azure/gateway-azure.go Outdated Show resolved Hide resolved
cmd/gateway/b2/gateway-b2.go Outdated Show resolved Hide resolved
cmd/gateway/hdfs/gateway-hdfs.go Outdated Show resolved Hide resolved
cmd/gateway/nas/gateway-nas.go Outdated Show resolved Hide resolved
cmd/gateway/oss/gateway-oss.go Outdated Show resolved Hide resolved
cmd/gateway/s3/gateway-s3.go Outdated Show resolved Hide resolved
cmd/gateway/gcs/gateway-gcs.go Outdated Show resolved Hide resolved
@wlan0
Copy link
Contributor Author

wlan0 commented Mar 25, 2020

@poornas @klauspost I've addressed your comments and pushed changes. PTAL

Copy link
Contributor

@klauspost klauspost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome 👍

Copy link
Contributor

@poornas poornas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Member

@harshavardhana harshavardhana left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Testing right now using mc everything looks good - just some idiomatic changes needed.

cmd/admin-handlers.go Outdated Show resolved Hide resolved
cmd/fs-v1.go Outdated Show resolved Hide resolved
cmd/notification.go Outdated Show resolved Hide resolved
cmd/notification.go Outdated Show resolved Hide resolved
cmd/notification.go Show resolved Hide resolved
cmd/object-api-interface.go Outdated Show resolved Hide resolved
cmd/peer-rest-common.go Show resolved Hide resolved
cmd/rest/client.go Outdated Show resolved Hide resolved
cmd/server-main.go Outdated Show resolved Hide resolved
cmd/xl-zones.go Outdated Show resolved Hide resolved
@wlan0 wlan0 force-pushed the diagnose branch 3 times, most recently from e83b024 to 992b934 Compare March 27, 2020 01:56
disk diagnostics: preserve diagnostics in memory after 1st run
config format: return map of values instead of bytestream
admin obd command: fix logic for setting query params to ensure all obd types are either true or false
1. add admin info to obd api
2. add network OBD info
3. lock obd commands to allow only one at a time
- Implement graph algorithm to test network bandwidth from every node to every other node
- Saturate any network bandwidth adaptively, accounting for slow and fast network capacity
- Implement parallel drive OBD tests
- Implement a paging mechanism for OBD test to provide periodic updates to client
- Implement mechanism to dynamically cancel long running OBD tests
- Sys: Disk (Partitions, DiskInfo) , CPU (HW, Timers)
- Host: Platform (OS, Kernel), Sensors (Temperatures)
- Mem: Swap Mem, Virtual Mem
- Process: Everything about the MinIO Process
@harshavardhana harshavardhana merged commit 0c80bf4 into minio:master Mar 27, 2020
@minio-trusted
Copy link
Contributor

Mint Automation

Test Result
mint-xl.sh ✔️
mint-large-bucket.sh ✔️
mint-worm.sh ✔️
mint-fs.sh ✔️
mint-dist-xl.sh ✔️
mint-gateway-nas.sh ✔️
mint-gateway-azure.sh more...

9024-e8b0e61/mint-gateway-azure.sh.log:

Running with
SERVER_ENDPOINT:      minio-dev7.minio.io:31561
ACCESS_KEY:           minioazure
SECRET_KEY:           ***REDACTED***
ENABLE_HTTPS:         0
SERVER_REGION:        us-east-1
MINT_DATA_DIR:        /mint/data
MINT_MODE:            full
ENABLE_VIRTUAL_STYLE: 0

To get logs, run 'docker cp 5b9dea141f2e:/mint/log /tmp/mint-logs'

(1/15) Running aws-sdk-go tests ... done in 8 seconds
(2/15) Running aws-sdk-java tests ... done in 2 seconds
(3/15) Running aws-sdk-php tests ... done in 8 minutes and 14 seconds
(4/15) Running aws-sdk-ruby tests ... done in 19 seconds
(5/15) Running awscli tests ... done in 2 minutes and 47 seconds
(6/15) Running healthcheck tests ... done in 0 seconds
(7/15) Running mc tests ... done in 7 minutes and 27 seconds
(8/15) Running minio-dotnet tests ... done in 1 minutes and 46 seconds
(9/15) Running minio-go tests ... done in 21 minutes and 41 seconds
(10/15) Running minio-java tests ... FAILED in 8 minutes and 53 seconds
{
  "name": "minio-java",
  "function": "composeObject(String bucketName, String objectName,List<ComposeSource> composeSources, Map <String,String > headerMap, ServerSideEncryption sseTarget)",
  "args": "size: 6 MB & 6 MB",
  "duration": 11373,
  "status": "FAIL",
  "error": "error occurred\nErrorResponse(code=NotImplemented, message=A header you provided implies functionality that is not implemented, bucketName=minio-java-test-h4t6tv, objectName=minio-java-test-3cdblht, resource=/minio-java-test-h4t6tv/minio-java-test-3cdblht, requestId=16000E2880C71324, hostId=aba24c6f-85d5-47e9-ac45-d5e2fed80939)\nrequest={method=PUT, url=http://minio-dev7.minio.io:31561/minio-java-test-h4t6tv/minio-java-test-3cdblht?uploadId=8b775148bbab49c7&partNumber=1, headers=x-amz-copy-source-if-match: dda202f43c2b2df39d130f43ab1fdac7-2\nx-amz-copy-source: minio-java-test-h4t6tv/minio-java-test-30ni5eo\nAccept-Encoding: identity\nHost: minio-dev7.minio.io:31561\nUser-Agent: MinIO (amd64; amd64) minio-java/dev\nx-amz-content-sha256: e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855\nx-amz-date: 20200327T041715Z\nAuthorization: AWS4-HMAC-SHA256 Credential=*REDACTED*/20200327/us-east-1/s3/aws4_request, SignedHeaders=accept-encoding;host;x-amz-content-sha256;x-amz-copy-source;x-amz-copy-source-if-match;x-amz-date, Signature=*REDACTED*\n}\nresponse={code=501, headers=Accept-Ranges: bytes\nContent-Length: 408\nContent-Security-Policy: block-all-mixed-content\nContent-Type: application/xml\nServer: MinIO/DEVELOPMENT.GOGET\nVary: Origin\nX-Amz-Request-Id: 16000E2880C71324\nX-Xss-Protection: 1; mode=block\nDate: Fri, 27 Mar 2020 04:17:16 GMT\n}\n >>> [io.minio.MinioClient.executeReq(MinioClient.java:1220), io.minio.MinioClient.execute(MinioClient.java:1082), io.minio.MinioClient.executePut(MinioClient.java:1446), io.minio.MinioClient.executePut(MinioClient.java:1468), io.minio.MinioClient.uploadPartCopy(MinioClient.java:2619), io.minio.MinioClient.composeObject(MinioClient.java:2572), FunctionalTest.composeObject_test1(FunctionalTest.java:2587), FunctionalTest.runTests(FunctionalTest.java:3396), FunctionalTest.main(FunctionalTest.java:3514)]"
}

Executed 9 out of 15 tests successfully.

Deleting image on docker hub
Deleting image locally

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants