Skip to content

Processing job input and output cannot be the same  #1950

@RoelantStegmann

Description

@RoelantStegmann

Describe the bug
Sagemaker requires the input and output to be different folders (for no apparent reason)

To reproduce

proc = Processor(
   ....
)
proc.run(
    inputs=[
         ProcessingInput(
            source='s3://processing-data/test/',
            destination='/opt/ml/test/'
        ),
    ],
    outputs=[
        ProcessingOutput(
            source=f'/opt/ml/processing/test/',
            destination=f's3://processing-data/test',
            s3_upload_mode="Continuous"
        ),
    ],
    arguments=[
    ],
)

Expected behavior

You would expect this to be allowed, but it runs into validation errors that apparently compares the input and output paths.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions