[SS-52 | SS-53 | SS-51] COPY FROM s3 testing#35313
[SS-52 | SS-53 | SS-51] COPY FROM s3 testing#35313patrickwwbutler merged 10 commits intoMaterializeInc:mainfrom
Conversation
|
Thanks for opening this PR! Here are a few tips to help make the review process smooth for everyone. PR title guidelines
Pre-merge checklist
|
def-
left a comment
There was a problem hiding this comment.
Not necessarily as part of this PR, but it would be valuable to have it in data-ingest, bounded-memory and feature-benchmark too. Mostly to make sure large amounts of data work, memory usage stays low and performance stays good (and is good already) respectively.
Added feature-benchmark and bounded-memory, will leave data-ingest for another PR as it seems it will require a bit more surrounding infrastructure for this |
ea10277 to
5ac021c
Compare
There was a problem hiding this comment.
Fresh nightly triggered: https://buildkite.com/materialize/nightly/builds/15543
Edit: Parallel Workload is red, because there is no COPY TO S3, your code should also account for it being empty, but probably has to be disabled until we can figure out that other bug: Cannot choose from an empty sequence
Feature benchmark also failed:
> COPY copy_from_s3_src TO 's3://copytos3/benchmark/copy_from_s3/18265b9b-de1c-410a-bf4b-b1247b50b6d7'
17:1: error: executing query failed: db error: ERROR: dispatch failure: other: Custom endpoint `minio:9000` was not a valid URI: URL scheme must be HTTP or HTTPS (found minio)
|
2 | > CREATE SECRET copy ... [rest of line truncated for security]
6 | SECRET ACCESS KE ... [rest of line truncated for security]
16 |
17 | > COPY copy_from_s3_src TO 's3://copytos3/benchmark/copy_from_s3/18265b9b-de1c-410a-bf4b-b1247b50b6d7'
| ^
Zippy looks like either a test bug or correctness issue:
source: /var/lib/buildkite-agent/builds/hetzner-x86-64-8cpu-16gb-aa260bc3/materialize/nightly/misc/python/materialize/zippy/copy_s3_actions.py:76
6:1: error: non-matching rows: expected:
[["0", "0", "1", "1"]]
got:
[["7001", "7001", "1", "1"]]
Poor diff:
- 0 0 1 1
+ 7001 7001 1 1
|
2 | > CREATE SECRET zipp ... [rest of line truncated for security]
3 | > CREATE CONNECTION ... [rest of line truncated for security]
5 | > COPY INTO zippy_s3_staging_24 FROM 's3://copytos3/zippy/20' (FORMAT CSV, AWS CONNECTION = zippy_aws_conn_24)
6 | > SELECT MIN(f1), MAX(f1), COUNT(f1), COUNT(DISTINCT f1) FROM zippy_s3_staging_24
| ^
I'd suggest running and iterating on tests locally first for faster turnaround times.
5ac021c to
a867917
Compare
|
triggered nightly with all modules/tests here: https://buildkite.com/materialize/nightly/builds/15556/steps/canvas |
|
@def- It seems like all the nightly pipeline failures are unrelated to these changes, with almost all of them being due to this unrecognized parameter |
a0ef4c6 to
01cd8bb
Compare
|
True, rebased on main, should be fine now: https://buildkite.com/materialize/nightly/builds/15572 |
def-
left a comment
There was a problem hiding this comment.
Test failures look unrelated, thanks!
3288bbe
into
MaterializeInc:main
Updates platform check, parallel workload, and testdrive
COPY TO S3tests to now roundtrip testCOPY TO S3andCOPY FROM S3.Adds a new zippy test for copy to & from s3 roundtrip.
Motivation
https://linear.app/materializeinc/project/copy-from-s3-da125e5e04e1/issues