Skip to content

Classification : Enhance pipeline to save train and test datasets to s3 for reuse #341

@nishika26

Description

@nishika26

Right now, in our classification pipeline, the train and test data are split from the uploaded dataset, converted into an OpenAI-compatible format, and then uploaded to OpenAI for further use. While this works, it limits the pipeline because we don’t retain the actual training and testing datasets in any proper storage system other than openai.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

Projects

Status

Closed

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions