-
Notifications
You must be signed in to change notification settings - Fork 4.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🎉 New Source: Apify Dataset #5069
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is looking great, thanks for the contribution Matej!
I especially like the multi-threading across batches 💪
Just a couple of minor tweaks required , see my comments.
airbyte-integrations/connectors/source-apify-dataset/build.gradle
Outdated
Show resolved
Hide resolved
airbyte-integrations/connectors/source-apify-dataset/integration_tests/configured_catalog.json
Outdated
Show resolved
Hide resolved
airbyte-integrations/connectors/source-apify-dataset/source_apify_dataset/source.py
Outdated
Show resolved
Hide resolved
airbyte-integrations/connectors/source-apify-dataset/source_apify_dataset/spec.json
Outdated
Show resolved
Hide resolved
airbyte-integrations/connectors/source-apify-dataset/source_apify_dataset/source.py
Show resolved
Hide resolved
airbyte-integrations/connectors/source-apify-dataset/source_apify_dataset/source.py
Show resolved
Hide resolved
307c72f
to
445b9cb
Compare
@Phlair I addressed the comments, can you please take another look? Thanks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Amazing, thank you Matej 🚀
I'll now be following this process #3118 (issue exists because we'll be making the process less painful soon!) to get this merged and published!
...c/main/resources/config/STANDARD_SOURCE_DEFINITION/47f17145-fe20-4ef5-a548-e29b048adf84.json
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
New changes make sense for retaining order. Very minor couple of comments :)
airbyte-integrations/connectors/source-apify-dataset/source_apify_dataset/source.py
Outdated
Show resolved
Hide resolved
...c/main/resources/config/STANDARD_SOURCE_DEFINITION/47f17145-fe20-4ef5-a548-e29b048adf84.json
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm! I'll pull latest into here: #5081
closed by #5081 |
What
Syncing the data from Apify dataset to Airbyte.
How
Adding a new connector written in Python which uses Apify Python client under the hood.
Recommended reading order
No particular order. Most of the code was auto-generating using the provided Airbyte
generate.sh
/Pre-merge Checklist
Expand the checklist which is relevant for this PR.
Connector checklist
airbyte_secret
in the connector's spec./gradlew :airbyte-integrations:connectors:<name>:integrationTest
./test connector=connectors/<name>
command as documented here is passing.README.md
docs/SUMMARY.md
if it's a new connectordocs/integrations/<source or destination>/<name>
.docs/integrations/...
. See changelog exampledocs/integrations/README.md
contains a reference to the new connector/publish
command described here