Skip to content

Custom data importer#20

Merged
anshulgoel27 merged 10 commits intomainfrom
CUSTOM-DATA-IMPORTER
Oct 10, 2024
Merged

Custom data importer#20
anshulgoel27 merged 10 commits intomainfrom
CUSTOM-DATA-IMPORTER

Conversation

@arslanprecisely
Copy link
Contributor

No description provided.

| `dataImport.config.s3AccessKeyId` | AWS S3 access key for reading the CSV data | `""` |
| `dataImport.config.s3AccessKeySecret` | AWS S3 secret key for reading the CSV data | `""` |
| `dataImport.config.s3Region` | AWS S3 region for accessing the bucket | `us-east-1` |
| `dataImport.config.csvSourceFile` | The source file for data import stored in S3 | `s3://new-bucket/data.csv` |
Copy link
Collaborator

@anshulgoel27 anshulgoel27 Sep 26, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should change the description to be generic here, not tied to S3

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

claened up

| `dataImport.image.tag` | The Docker image tag for the custom data importer | `2.0.1` |
| `dataImport.image.pullPolicy` | The image pull policy | `Always` |
| `dataImport.config.expressUrl` | The URL for the express engine used in the import job | `https://express-engine-cluster-master:9200` |
| `dataImport.config.s3AccessKeyId` | AWS S3 access key for reading the CSV data | `""` |
Copy link
Collaborator

@anshulgoel27 anshulgoel27 Sep 26, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we move them under config.aws, later on when we add azure, and gcp we can add those sections here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cleaned up

@anshulgoel27
Copy link
Collaborator

Whole version of helm chart also needs to be updated to 2.0.2.

And the main readme needs to be updated with the release information.

apiVersion: batch/v1
kind: Job
metadata:
name: "{{ .Release.Name }}-importer"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AWS jobs are immutable. we should add revision number to the JOB

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add revision number to the JOB

imagePullSecrets:
{{- toYaml . | nindent 8 }}
{{- end }}
restartPolicy: Never
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please specify resources for the job

--set dataImport.config.aws.s3AccessKeyId="NEW_AKIA_ACCESS_KEY" \
--set dataImport.config.aws.s3AccessKeySecret="NEW_SECRET_ACCESS_KEY" \
--set dataImport.config.aws.s3Region="us-west-2" \
--set dataImport.config.csvSourceFile="s3://new-bucket/new-data.csv"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please also add the nodeSelector here, because express service runs on arm64 so here we should mention the type of node needed for this importer image

@anshulgoel27 anshulgoel27 merged commit 36f9018 into main Oct 10, 2024
@anshulgoel27 anshulgoel27 deleted the CUSTOM-DATA-IMPORTER branch November 6, 2024 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants